矩阵分解
计算机科学
表达式(计算机科学)
因式分解
非负矩阵分解
计算生物学
算法
生物
物理
量子力学
特征向量
程序设计语言
出处
期刊:IEEE Access
[Institute of Electrical and Electronics Engineers]
日期:2024-01-01
卷期号:12: 70297-70304
被引量:8
标识
DOI:10.1109/access.2024.3401005
摘要
Long non-coding RNAs (lncRNAs) play significant roles in multiple biological processes and contribute to the progression and development of various human diseases. Therefore, it is necessary to decipher novel lncRNA-disease associations from the perspective of biomarker detection. Numerous computational models have been designed to identify lncRNA-disease associations using machine learning. However, many of these models fail to effectively incorporate heterogeneous biological datasets, which can lead to reduced model accuracy and performance. In this study, we propose a novel lncRNA expression profile-based matrix factorization method that applies lncRNA expression profiles to identify lncRNA-disease associations (EMFLDA). Matrix factorization is a machine learning method that exhibits excellent performance not only in recommender systems, but also in various scientific areas. We also applied lncRNA expression profiles as weights for the proposed model, which allowed for the integration of heterogeneous information and thereby improved performance. As a result, EMFLDA outperformed the four previous models in terms of AUC scores, achieving scores of 0.9042 and 0.8841 based on leave-one-out cross-validation and five-fold cross-validation, respectively. Thus, the proposed model, EMFLDA, not only serves as an effective tool for identifying disease-related lncRNAs, but also plays a pivotal role in extracting disease biomarkers.
科研通智能强力驱动
Strongly Powered by AbleSci AI