计算机科学
人工智能
机器学习
编码(社会科学)
集成学习
数学
统计
作者
Yanzhen Xu,Xiaohan Zhao,Shuai Liu,Shichao Liu,Yanqing Niu,Wen Zhang,Leyi Wei
出处
期刊:Bioinformatics and Biomedicine
日期:2019-11-01
被引量:4
标识
DOI:10.1109/bibm47256.2019.8982948
摘要
A large number of transcripts have been generated by the development of high throughput sequencing technologies. Predicting lncRNA from transcripts is a challenging and important task. In this paper, we propose LncPred-IEL, an iterative ensemble learning long non-coding RNA prediction method. LncPred-IEL not only considers features widely used for the lncRNA prediction, but also take into account sequence-derived features used in the RNA sequence classification, so as to make use of diverse information. LncPred-IEL builds base predictors based on different groups of features, and employs a supervised iterative way to combine base predictors and build ensemble models. Our studies demonstrate that supervised iterative way can learn the representations that help to separate lncRNA and protein-coding transcripts, and further improve the performances. Experiments demonstrate that LncPred-IEL outperforms several state-of-the-art methods when evaluated by 10-fold cross-validation. The capability of LncPred-IEL for the cross-species prediction is also tested. As complementary to wet experiments, LncPred-IEL is a useful computational tool for lncRNA prediction.
科研通智能强力驱动
Strongly Powered by AbleSci AI