核糖核酸
适体
计算机科学
核酸二级结构
核酸结构
蛋白质二级结构
计算生物学
深度学习
人工智能
折叠(DSP实现)
核糖开关
假结
机器学习
数据挖掘
生物
非编码RNA
工程类
遗传学
电气工程
基因
生物化学
作者
Shamsudin S. Nasaev,Artem Mukanov,Ivan Kuznetsov,A. V. Veselovsky
标识
DOI:10.1002/minf.202300113
摘要
Nowadays there are numerous discovered natural RNA variations participating in different cellular processes and artificial RNA, e. g., aptamers, riboswitches. One of the required tasks in the investigation of their functions and mechanism of influence on cells and interaction with targets is the prediction of RNA secondary structures. The classic thermodynamic-based prediction algorithms do not consider the specificity of biological folding and deep learning methods that were designed to resolve this issue suffer from homology-based methods problems. Herein, we present a method for RNA secondary structure prediction based on deep learning - AliNA (ALIgned Nucleic Acids). Our method successfully predicts secondary structures for non-homologous to train-data RNA families thanks to usage of the data augmentation techniques. Augmentation extends existing datasets with easily-accessible simulated data. The proposed method shows a high quality of prediction across different benchmarks including pseudoknots. The method is available on GitHub for free (https://github.com/Arty40m/AliNA).
科研通智能强力驱动
Strongly Powered by AbleSci AI