Multi-center verification of the influence of data ratio of training sets on test results of an AI system for detecting early gastric cancer based on the YOLO-v4 algorithm

试验装置 卷积神经网络 集合(抽象数据类型) 人工智能 尤登J统计 训练集 计算机科学 癌症 数据集 接收机工作特性 模式识别(心理学) 机器学习 医学 程序设计语言 内科学
作者
Tao Jin,Yancai Jiang,Boneng Mao,Xing Wang,Bo Lü,Min Ji,Hutao Zhou,Tieliang Ma,Yefei Zhang,Sisi Li,Yun Shi,Zhendong Yao
出处
期刊:Frontiers in Oncology [Frontiers Media]
卷期号:12: 953090-953090 被引量:5
标识
DOI:10.3389/fonc.2022.953090
摘要

Objective: Convolutional Neural Network(CNN) is increasingly being applied in the diagnosis of gastric cancer. However, the impact of proportion of internal data in the training set on test results has not been sufficiently studied. Here, we constructed an artificial intelligence (AI) system called EGC-YOLOV4 using the YOLO-v4 algorithm to explore the optimal ratio of training set with the power to diagnose early gastric cancer. Design: A total of 22,0918 gastroscopic images from Yixing People's Hospital were collected. 7 training set models were established to identify 4 test sets. Respective sensitivity, specificity, Youden index, accuracy, and corresponding thresholds were tested, and ROC curves were plotted. Results: 1. The EGC-YOLOV4 system completes all tests at an average reading speed of about 15 ms/sheet; 2. The AUC values in training set 1 model were 0.8325, 0.8307, 0.8706, and 0.8279, in training set 2 model were 0.8674, 0.8635, 0.9056, and 0.9249, in training set 3 model were 0.8544, 0.8881, 0.9072, and 0.9237, in training set 4 model were 0.8271, 0.9020, 0.9102, and 0.9316, in training set 5 model were 0.8249, 0.8484, 0.8796, and 0.8931, in training set 6 model were 0.8235, 0.8539, 0.9002, and 0.9051, in training set 7 model were 0.7581, 0.8082, 0.8803, and 0.8763. Conclusion: EGC-YOLOV4 can quickly and accurately identify the early gastric cancer lesions in gastroscopic images, and has good generalization.The proportion of positive and negative samples in the training set will affect the overall diagnostic performance of AI.In this study, the optimal ratio of positive samples to negative samples in the training set is 1:1~ 1:2.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
现实的小蚂蚁完成签到,获得积分10
1秒前
桃花扇完成签到,获得积分10
2秒前
weila完成签到 ,获得积分10
4秒前
w0r1d完成签到 ,获得积分10
6秒前
hdcf完成签到 ,获得积分10
11秒前
kook完成签到 ,获得积分10
14秒前
所所应助芹123采纳,获得10
15秒前
幸福妙柏完成签到 ,获得积分10
16秒前
Jason完成签到 ,获得积分10
16秒前
tigger完成签到,获得积分10
18秒前
weng完成签到,获得积分10
20秒前
llhh2024完成签到,获得积分10
22秒前
Tomyyh完成签到,获得积分10
24秒前
奥斯卡完成签到,获得积分0
29秒前
Jzag完成签到 ,获得积分10
30秒前
Elaine完成签到 ,获得积分10
33秒前
赘婿应助Apricity采纳,获得10
33秒前
ygmygqdss完成签到 ,获得积分10
38秒前
杆杆完成签到 ,获得积分10
38秒前
科研通AI2S应助科研通管家采纳,获得10
41秒前
cdd完成签到,获得积分10
41秒前
hhh完成签到 ,获得积分10
43秒前
朴素的曼文完成签到,获得积分10
44秒前
舒适刺猬完成签到 ,获得积分10
46秒前
大气糖豆完成签到 ,获得积分10
47秒前
FZz完成签到 ,获得积分10
48秒前
孙一完成签到,获得积分10
48秒前
茹茹完成签到 ,获得积分10
49秒前
星星完成签到 ,获得积分10
49秒前
刚子完成签到 ,获得积分0
52秒前
茅十八完成签到,获得积分10
53秒前
白白不喽完成签到 ,获得积分10
53秒前
GXW完成签到,获得积分10
1分钟前
zhuxd完成签到 ,获得积分10
1分钟前
20250702完成签到 ,获得积分10
1分钟前
amen发布了新的文献求助10
1分钟前
flypipidan完成签到,获得积分10
1分钟前
Sunbrust完成签到 ,获得积分10
1分钟前
kaiz完成签到,获得积分10
1分钟前
幸运嘟嘟完成签到 ,获得积分10
1分钟前
高分求助中
Introduction to Helicopter and Tiltrotor Flight Simulation, Second Edition 2000
Overcoming Stigma and Bias in Obesity Management 800
Malcolm Fraser : a biography 700
Signals, Systems, and Signal Processing 610
Materials selection in mechanical design 500
Bounds for Statistical Estimation in Semiparametric Models 500
Forced degradation and stability indicating LC method for Letrozole: A stress testing guide 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6487208
求助须知:如何正确求助?哪些是违规求助? 8285503
关于积分的说明 17670930
捐赠科研通 5575792
什么是DOI,文献DOI怎么找? 2913521
邀请新用户注册赠送积分活动 1890466
关于科研通互助平台的介绍 1748008