TCC - Bacharelado em Ciência da Computação (Sede)
URI permanente para esta coleçãohttps://arandu.ufrpe.br/handle/123456789/415
Navegar
2 resultados
Resultados da Pesquisa
Item Avaliação de métodos de imputação de valores ausentes para a predição de interações fármaco-proteína(2024-03-08) Santos, Victor Vidal dos; Nascimento, André Câmara Alves do; http://lattes.cnpq.br/0622594061462533; http://lattes.cnpq.br/7999257997046465In the last decade, the study of pharmacological networks has received a lot of attention given its relevance drug discovery process. Many different approaches for predicting biological interactions have been proposed, especially in the area of multiple kernel learning (MKL). Such methods comprise integrative approaches that can handle heterogeneous data sources, but suffer from the missing data problem. Techniques to handle missing values in the base kernel matrices can be used, usually based on simple techniques, such as imputing zeroes, mean and median of the matrix. In this work, techniques for handling missing values were evaluated in the context of bipartite networks. Our analyzes showed that the, depending on the amount of missing data, k-NN and SVD technique performed much better than the other techniques, bringing encouraging results, while zero-fill showed the worst performance in relation to all other evaluated methods.Item Tratamento de Kernels incompletos em redes bipartidas na predição de interações em redes biológicas(2020-10-30) Bastos, Tássia Laís Barros; Nascimento, André Câmara Alves do; http://lattes.cnpq.br/0622594061462533; http://lattes.cnpq.br/2065961687962702In the last decade, the study of pharmacological networks has received a lot of attention given its relevance to the production of new drugs. The studies were made possible by the large volume of biological data generated, making it possible to understand and extract knowledge from them. However, although interesting, this is a process that brings with it some barriers in terms of viability, particularly when the data appear heterogeneously and contain missing information. Many different approaches for predicting biological interactions have been proposed, especially in the area of multiple kernel learning (Multiple Kernel Learning (MKL). The use of MKL methods in biological data is also compromised by the heterogeneity of data sources, but associated with the methods, techniques for complementing missing values in the base kernel matrices can be used, this filling process is usually done with simple techniques, such as imputing zeroes, mean and median of the matrix. In this work, techniques for handling false values were evaluated in the context of bipartite networks to solve the limitations related to the heterogeneity of the data. We used three single value imputation techniques (mean, median and zero) and a more complex predictive imputation technique (SVD). All the aforementioned techniques have already been used for matrix completeness in the context of unipartite networks. Our analyzes showed that the SVD technique performed much better than the other techniques in evaluative metrics, bringing encouraging results for the use of the technique in models that use bipartite networks. The average and median techniques showed similar performances, but lower than the SVD. And filling with zero showed the worst performance in relation to all other applied techniques.