计算生物学
计算机科学
聚类分析
图形
主题(音乐)
人工智能
核糖核酸
共识序列
鉴定(生物学)
机器学习
生物
基因
理论计算机科学
遗传学
肽序列
物理
植物
声学
作者
Guodong Li,Bo-Wei Zhao,Xiaorui Su,Yue Yang,Pengwei Hu,Xi Zhou,Lun Hu
标识
DOI:10.1109/jbhi.2024.3357979
摘要
As a pivotal post-transcriptional modification of RNA, N6-methyladenosine (m6A) has a substantial influence on gene expression modulation and cellular fate determination. Although a variety of computational models have been developed to accurately identify potential m6A modification sites, few of them are capable of interpreting the identification process with insights gained from consensus knowledge. To overcome this problem, we propose a deep learning model, namely M6A-DCR, by discovering consensus regions for interpretable identification of m6A modification sites. In particular, M6A-DCR first constructs an instance graph for each RNA sequence by integrating specific positions and types of nucleotides. The discovery of consensus regions is then formulated as a graph clustering problem in light of aggregating all instance graphs. After that, M6A-DCR adopts a motif-aware graph reconstruction optimization process to learn high-quality embeddings of input RNA sequences, thus achieving the identification of m6A modification sites in an end-to-end manner. Experimental results demonstrate the superior performance of M6A-DCR by comparing it with several state-of-the-art identification models. The consideration of consensus regions empowers our model to make interpretable predictions at the motif level. The analysis of cross validation through different species and tissues further verifies the consistency between the identification results of M6A-DCR and the evolutionary relationships among species
科研通智能强力驱动
Strongly Powered by AbleSci AI