接收机工作特性
人工智能
判别式
支持向量机
计算机科学
耳鼻咽喉科
变压器
精确性和召回率
F1得分
模式识别(心理学)
机器学习
医学
外科
工程类
电压
电气工程
作者
Alberto Paderno,Anita Rau,Nikita Bedi,Paolo Bossi,Giuseppe Mercante,Cesare Piazza,F. Christopher Holsinger
摘要
Objectives To evaluate the performance of vision transformer‐derived image embeddings for distinguishing between normal and neoplastic tissues in the oropharynx and to investigate the potential of computer vision (CV) foundation models in medical imaging. Methods Computational study using endoscopic frames with a focus on the application of a self‐supervised vision transformer model (DINOv2) for tissue classification. High‐definition endoscopic images were used to extract image patches that were then normalized and processed using the DINOv2 model to obtain embeddings. These embeddings served as input for a standard support vector machine (SVM) to classify the tissues as neoplastic or normal. The model's discriminative performance was validated using an 80–20 train‐validation split. Results From 38 endoscopic NBI videos, 327 image patches were analyzed. The classification results in the validation cohort demonstrated high accuracy (92%) and precision (89%), with a perfect recall (100%) and an F1‐score of 94%. The receiver operating characteristic (ROC) curve yielded an area under the curve (AUC) of 0.96. Conclusion The use of large vision model‐derived embeddings effectively differentiated between neoplastic and normal oropharyngeal tissues. This study supports the feasibility of employing CV foundation models like DINOv2 in the endoscopic evaluation of mucosal lesions, potentially augmenting diagnostic precision in Otorhinolaryngology. Level of Evidence 4 Laryngoscope , 2024
科研通智能强力驱动
Strongly Powered by AbleSci AI