清晨好,您是今天最早来到科研通的研友!由于当前在线用户较少,发布求助请尽量完整地填写文献信息,科研通机器人24小时在线,伴您科研之路漫漫前行!

An Effective Algorithm Based on Sequence and Property Information for N4-methylcytosine Identification in Multiple Species

鉴定(生物学) 序列(生物学) 5-甲基胞嘧啶 财产(哲学) 化学 算法 计算生物学 计算机科学 生物化学 生物 基因 DNA甲基化 植物 基因表达 哲学 认识论
作者
Lichao Zhang,Xueting Wang,Kang Xiao,Liang Kong
出处
期刊:Letters in Organic Chemistry [Bentham Science Publishers]
卷期号:21 (8): 695-706
标识
DOI:10.2174/0115701786277281231228093405
摘要

Abstract: N4-methylcytosine (4mC) is one of the most important epigenetic modifications, which plays a significant role in biological progress and helps explain biological functions. Although biological experiments can identify potential 4mC sites, they are limited due to the experimental environment and labor-intensive process. Therefore, it is crucial to construct a computational model to identify the 4mC sites. Some computational methods have been proposed to identify the 4mC sites, but some problems should not be ignored, such as those presented as follows: (1) a more accurate algorithm is required to improve the prediction, especially for Matthew’s correlation coefficient (MCC); (2) easier method is needed for clinical research to design medicine or treat disease. Considering these aspects, an effective algorithm using comprehensible encoding in multiple species was proposed in this study. Since nucleotide arrangement and its property information could reflect the sequence structure and function, several feature vectors have been developed based on nucleotide energy information, trinucleotide energy information, and nucleotide chemical property information. Besides, feature effect has been analyzed to select the optimal feature vectors for multiple species. Finally, the optimal feature vectors were inputted into the CatBoost algorithm to construct the identification model. The evaluation results showed that our study obtained the highest MCC, i.e., 2.5%~11.1%, 1.4%~17.8%, 1.1%~7.6%, and 2.3%~18.0% higher than previous models for the A. thaliana, C. elegans, D. melanogaster, and E. coli datasets, respectively. These satisfactory results reflect that the proposed method is available to identify 4mC sites in multiple species, especially for MCC. It could provide a reasonable supplement for biological research.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
livialiu发布了新的文献求助10
3秒前
miki完成签到 ,获得积分10
6秒前
成就的香菇完成签到,获得积分10
18秒前
Vintoe完成签到 ,获得积分10
19秒前
23秒前
禾木完成签到,获得积分10
28秒前
30秒前
禾木发布了新的文献求助10
33秒前
沉梦昂志完成签到 ,获得积分10
36秒前
翟庆春完成签到,获得积分10
38秒前
螺丝炒钉子完成签到,获得积分10
44秒前
CodeCraft应助livialiu采纳,获得10
1分钟前
Hao完成签到,获得积分10
1分钟前
羞涩的问兰完成签到,获得积分10
1分钟前
naczx完成签到,获得积分0
1分钟前
1分钟前
零丁完成签到,获得积分10
1分钟前
小田完成签到 ,获得积分10
1分钟前
梁梁完成签到 ,获得积分10
2分钟前
领导范儿应助livialiu采纳,获得10
2分钟前
Autin完成签到,获得积分10
2分钟前
帅气的芷文完成签到,获得积分10
2分钟前
2分钟前
赖氨酸完成签到,获得积分10
2分钟前
深情安青应助livialiu采纳,获得10
2分钟前
琳io完成签到 ,获得积分10
3分钟前
紫熊完成签到,获得积分10
3分钟前
3分钟前
胡杨树2006完成签到,获得积分10
3分钟前
livialiu发布了新的文献求助10
3分钟前
老石完成签到 ,获得积分0
3分钟前
隐形曼青应助livialiu采纳,获得10
3分钟前
3分钟前
4分钟前
4分钟前
4分钟前
livialiu发布了新的文献求助10
4分钟前
qwq完成签到,获得积分10
4分钟前
4分钟前
科研通AI2S应助科研通管家采纳,获得10
4分钟前
高分求助中
Signals, Systems, and Signal Processing 610
Fundamentals of Pharmaceutical and Biologics Regulations: A Global Perspective, Second Edition 600
久松真一著作集〈第5巻〉禅と芸術 500
Fundamentals of Modern Mathematics: A Practical Review (Dover Books on Mathematics) 500
Cold War Transcended: Australia's China Policy, 1949-1990 470
Cybercrime: The Transformation of Crime in the Information Age, 2nd Edition 400
Moore's Clinically Oriented Anatomy 10th Edition 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6612488
求助须知:如何正确求助?哪些是违规求助? 8377993
关于积分的说明 17924117
捐赠科研通 5777491
什么是DOI,文献DOI怎么找? 2958286
邀请新用户注册赠送积分活动 1933549
关于科研通互助平台的介绍 1835514