IntroUNET: identifying introgressed alleles via semantic segmentation

分割 等位基因 人工智能 自然语言处理 计算机科学 生物 遗传学 进化生物学 基因
作者
Dylan D. Ray,Lex E. Flagel,Daniel R. Schrider
标识
DOI:10.1101/2023.02.07.527435
摘要

A growing body of evidence suggests that gene flow between closely related species is a widespread phenomenon. Alleles that introgress from one species into a close relative are typically neutral or deleterious, but sometimes confer a significant fitness advantage. Given the potential relevance to speciation and adaptation, numerous methods have therefore been devised to identify regions of the genome that have experienced introgression. Recently, supervised machine learning approaches have been shown to be highly effective for detecting introgression. One especially promising approach is to treat population genetic inference as an image classification problem, and feed an image representation of a population genetic alignment as input to a deep neural network that distinguishes among evolutionary models (i.e. introgression or no introgression). However, if we wish to investigate the full extent and fitness effects of introgression, merely identifying genomic regions in a population genetic alignment that harbor introgressed loci is insufficient---ideally we would be able to infer precisely which individuals have introgressed material and at which positions in the genome. Here we adapt a deep learning algorithm for semantic segmentation, the task of correctly identifying the type of object to which each individual pixel in an image belongs, to the task of identifying introgressed alleles. Our trained neural network is thus able to infer, for each individual in a two-population alignment, which of those individual's alleles were introgressed from the other population. We use simulated data to show that this approach is highly accurate, and that it can be readily extended to identify alleles that are introgressed from an unsampled "ghost" population, performing comparably to a supervised learning method tailored specifically to that task. Finally, we apply this method to data from Drosophila , showing that it is able to accurately recover introgressed haplotypes from real data. This analysis reveals that introgressed alleles are typically confined to lower frequencies within genic regions, suggestive of purifying selection, but are found at much higher frequencies in a region previously shown to be affected by adaptive introgression. Our method's success in recovering introgressed haplotypes in challenging real-world scenarios underscores the utility of deep learning approaches for making richer evolutionary inferences from genomic data.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
hd完成签到,获得积分10
1秒前
星星草发布了新的文献求助10
1秒前
大民王发布了新的文献求助10
1秒前
5秒前
白云完成签到,获得积分10
7秒前
科研通AI2S应助chcmuer采纳,获得10
8秒前
无敌鱼发布了新的文献求助10
9秒前
12秒前
reece完成签到,获得积分10
17秒前
小马宝莉发布了新的文献求助10
23秒前
23秒前
xzw完成签到,获得积分10
26秒前
聪明的灵寒完成签到 ,获得积分10
26秒前
30秒前
shitou2023发布了新的文献求助10
30秒前
32秒前
YCS完成签到,获得积分10
33秒前
小马宝莉完成签到,获得积分20
34秒前
34秒前
34秒前
36秒前
hideinrubbish发布了新的文献求助10
38秒前
早早入眠完成签到,获得积分10
38秒前
陶梦欣发布了新的文献求助20
39秒前
格子完成签到,获得积分10
39秒前
Polarbeer发布了新的文献求助10
39秒前
40秒前
bkagyin应助古乙丁三雨采纳,获得10
40秒前
Jasper应助聪明的灵寒采纳,获得10
41秒前
1.1发布了新的文献求助30
43秒前
45秒前
欢语完成签到,获得积分10
47秒前
酷炫问玉发布了新的文献求助10
48秒前
48秒前
Polarbeer完成签到,获得积分20
48秒前
48秒前
科里斯皮尔应助vision0000采纳,获得10
49秒前
赵赵完成签到,获得积分10
51秒前
寒烟完成签到,获得积分10
51秒前
Owen应助ri_290采纳,获得10
51秒前
高分求助中
【本贴是提醒信息,请勿应助】请在求助之前详细阅读求助说明!!!! 20000
One Man Talking: Selected Essays of Shao Xunmei, 1929–1939 1000
The Three Stars Each: The Astrolabes and Related Texts 900
Yuwu Song, Biographical Dictionary of the People's Republic of China 800
Multifunctional Agriculture, A New Paradigm for European Agriculture and Rural Development 600
Challenges, Strategies, and Resiliency in Disaster and Risk Management 500
Bernd Ziesemer - Maos deutscher Topagent: Wie China die Bundesrepublik eroberte 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2481776
求助须知:如何正确求助?哪些是违规求助? 2144384
关于积分的说明 5469750
捐赠科研通 1866895
什么是DOI,文献DOI怎么找? 927899
版权声明 563039
科研通“疑难数据库(出版商)”最低求助积分说明 496404