Boosting(机器学习)
计算机科学
联想(心理学)
人工智能
心理学
心理治疗师
作者
Xi Tang,Menglu Li,Wei Zhang,Junfeng Xia
标识
DOI:10.1145/3386052.3386056
摘要
There is increasing evidence that long non-coding RNAs (lncRNAs) play an important role in many significant biological processes. Associations' detection between lncRNAs and human diseases by computational models is beneficial to the identification of biomarkers and the discovery of drugs for the diagnosis, treatment, and prognosis of human diseases. In this study, we propose a method called PrLDA (Predicting LncRNA-Disease Association based on extreme gradient boosting) for predicting potential lncRNA-disease associations based on eXtreme Gradient Boosting (XGBoost). Firstly, we compute semantic similarity of diseases and lncRNA sequence similarity. Then, we extracte feature vectors by concatenating these similarities horizontally. At last, the feature matrix after dimension reduction is used as the input for XGBoost and we get the score about the lncRNA association with a specific disease. Computational results indicate that our method can predict lncRNA-disease associations with higher accuracy compared with previous methods. Furthermore, case study shows that our method can effectively predict candidate lncRNAs for breast cancer, with 80% of the top 10 predictions are confirmed by experiments. Therefore, PrLDA is a useful computational method for lncRNA-disease association prediction.
科研通智能强力驱动
Strongly Powered by AbleSci AI