非编码RNA
深度学习
核糖核酸
人工智能
计算机科学
计算生物学
机器学习
鉴定(生物学)
长非编码RNA
生物
编码(社会科学)
基因
遗传学
统计
数学
生态学
作者
Noorul Amin,Annette McGrath,Yi‐Ping Phoebe Chen
标识
DOI:10.1038/s42256-019-0051-2
摘要
Non-coding (nc) RNA plays a vital role in biological processes and has been associated with diseases such as cancer. Classification of ncRNAs is necessary for understanding the underlying mechanisms of the diseases and to design effective treatments. Recently, deep learning has been employed for ncRNA identification and classification and has shown promising results. In this study, we review the progress of ncRNA type classification, specifically lncRNA, lincRNA, circular RNA and small ncRNA, and present a comprehensive comparison of six deep learning based classification methods published in the past two years. We identify research gaps and challenges of ncRNA types, such as the classification of subclasses of lncRNA, transcript length and compositional variation, dependency on database searches and the high false positive rate of existing approaches. We suggest future directions for cross-species performance deviation, deep learning model selection and sequence intrinsic features. Many functions of RNA strands that do not code for proteins are still to be deciphered. Methods to classify different groups of non-coding RNA increasingly use deep learning, but the landscape is diverse and methods need to be categorized and benchmarked to move forward. The authors take a close look at six state-of-the-art deep learning non-coding RNA classifiers and compare their performance and architecture.
科研通智能强力驱动
Strongly Powered by AbleSci AI