A scalable artificial intelligence platform that automatically finds copy number variations (CNVs) in journal articles and transforms them into a database: CNV extraction, transformation, and loading AI (CNV-ETLAI)

拷贝数变化 计算机科学 人工智能 生物 遗传学 基因组 基因
作者
Jongmun Choi,Soomin Jeon,Doyun Kim,Michelle Chua,Synho Do
出处
期刊:Computers in Biology and Medicine [Elsevier]
卷期号:144: 105332-105332 被引量:1
标识
DOI:10.1016/j.compbiomed.2022.105332
摘要

Although copy number variations (CNVs) are infrequent, each anomaly is unique, and multiple CNVs can appear simultaneously. Growing evidence suggests that CNVs contribute to a wide range of diseases. When CNVs are detected, assessment of their clinical significance requires a thorough literature review. This process can be extremely time-consuming and may delay disease diagnosis. Therefore, we have developed CNV Extraction, Transformation, and Loading Artificial Intelligence (CNV-ETLAI), an innovative tool that allows experts to classify and interpret CNVs accurately and efficiently.We combined text, table, and image processing algorithms to develop an artificial intelligence platform that automatically extracts, transforms, and organizes CNV information into a database. To validate CNV-ETLAI, we compared its performance to ground truth datasets labeled by a human expert. In addition, we analyzed the CNV data, which was collected using CNV-ETLAI via a crowdsourcing approach.In comparison to a human expert, CNV-ETLAI improved CNV detection accuracy by 4% and performed the analysis 60 times faster. This performance can improve even further with upscaling of the CNV-ETLAI database as usage increases. 5,800 CNVs from 2,313 journal articles were collected. Total CNV frequency for the whole chromosome was highest for chromosome X, whereas CNV frequency per 1 Mb of genomic length was highest for chromosome 22.We have developed, tested, and shared CNV-ETLAI for research and clinical purposes (https://lmic.mgh.harvard.edu/CNV-ETLAI). Use of CNV-ETLAI is expected to ease and accelerate diagnostic classification and interpretation of CNVs.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
2秒前
基一啊佳发布了新的文献求助10
3秒前
哇哈哈完成签到,获得积分10
5秒前
Brian发布了新的文献求助10
6秒前
田様应助夕荀采纳,获得10
7秒前
9秒前
基一啊佳完成签到,获得积分10
9秒前
Ono完成签到,获得积分20
9秒前
英姑应助宣依云采纳,获得10
10秒前
cae哈哈哈完成签到,获得积分10
12秒前
兴钬完成签到,获得积分10
13秒前
13秒前
APS发布了新的文献求助10
15秒前
结实无施发布了新的文献求助10
16秒前
快看文献完成签到 ,获得积分10
16秒前
17秒前
17秒前
lxj发布了新的文献求助10
18秒前
18秒前
小七辅助发布了新的文献求助10
20秒前
21秒前
22秒前
慕青应助欢乐的零采纳,获得10
22秒前
24秒前
杨一完成签到 ,获得积分10
24秒前
黄sir发布了新的文献求助10
24秒前
宣依云发布了新的文献求助10
25秒前
27秒前
pwang_ecust完成签到,获得积分20
28秒前
多余完成签到,获得积分10
29秒前
shen发布了新的文献求助10
29秒前
29秒前
大个应助黄sir采纳,获得10
30秒前
pwang_ecust发布了新的文献求助10
31秒前
hawaii66完成签到,获得积分10
31秒前
ou应助翊然甜周采纳,获得10
31秒前
33秒前
34秒前
欢乐的零完成签到,获得积分20
34秒前
夕荀发布了新的文献求助10
34秒前
高分求助中
Manual of Clinical Microbiology, 4 Volume Set (ASM Books) 13th Edition 1000
Chinese-English Translation Lexicon Version 3.0 500
Electronic Structure Calculations and Structure-Property Relationships on Aromatic Nitro Compounds 500
マンネンタケ科植物由来メロテルペノイド類の網羅的全合成/Collective Synthesis of Meroterpenoids Derived from Ganoderma Family 500
薩提亞模式團體方案對青年情侶輔導效果之研究 400
[Lambert-Eaton syndrome without calcium channel autoantibodies] 400
Statistical Procedures for the Medical Device Industry 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2380043
求助须知:如何正确求助?哪些是违规求助? 2087323
关于积分的说明 5240774
捐赠科研通 1814497
什么是DOI,文献DOI怎么找? 905230
版权声明 558734
科研通“疑难数据库(出版商)”最低求助积分说明 483250