학술논문

Combining K Nearest Neighbor With Nonnegative Matrix Factorization for Predicting Circrna-Disease Associations
Document Type
Periodical
Source
IEEE/ACM Transactions on Computational Biology and Bioinformatics IEEE/ACM Trans. Comput. Biol. and Bioinf. Computational Biology and Bioinformatics, IEEE/ACM Transactions on. 20(5):2610-2618 Jan, 2023
Subject
Bioengineering
Computing and Processing
Diseases
Semantics
Kernel
Databases
RNA
Cancer
Computational modeling
Gaussian kernel similarity
semantic similarity
++%24K%24<%2Ftex-math>+++K<%2Fmml%3Ami>+<%2Fmml%3Amath>++<%2Falternatives>+<%2Finline-formula>+<%2Fnamed-content>+nearest+neighbor%22"> $K$ K nearest neighbor
circRNA-disease association
nonnegative Matrix Factorization
Language
ISSN
1545-5963
1557-9964
2374-0043
Abstract
Accumulating evidences show that circular RNAs (circRNAs) play an important role in regulating gene expression, and involve in many complex human diseases. Identifying associations of circRNA with disease helps to understand the pathogenesis, treatment and diagnosis of complex diseases. Since inferring circRNA-disease associations by biological experiments is costly and time-consuming, there is an urgently need to develop a computational model to identify the association between them. In this paper, we proposed a novel method named KNN-NMF, which combines $K\ $Knearest neighbors with nonnegative matrix factorization to infer associations between circRNA and disease (KNN-NMF). Frist, we compute the Gaussian Interaction Profile (GIP) kernel similarity of circRNA and disease, the semantic similarity of disease, respectively. Then, the circRNA-disease new interaction profiles are established using weight $K$K nearest neighbors to reduce the false negative association impact on prediction performance. Finally, Nonnegative Matrix Factorization is implemented to predict associations of circRNA with disease. The experiment results indicate that the prediction performance of KNN-NMF outperforms the competing methods under five-fold cross-validation. Moreover, case studies of two common diseases further show that KNN-NMF can identify potential circRNA-disease associations effectively.