MetDIT: Transforming and Analyzing Clinical Metabolomics Data with Convolutional Neural Networks

过度拟合代谢组学卷积神经网络判别式模式识别（心理学）降维主成分分析特征（语言学）源代码维数之咒机器学习深度学习支持向量机随机森林编码（集合论）人工神经网络人工智能生物信息学计算机科学生物语言学哲学集合（抽象数据类型）程序设计语言操作系统

作者

Yuyang Sha,Weiyu Meng,Gang Luo,Xiaobing Zhai,Henry H.Y. Tong,Yuefei Wang,Kefeng Li

出处

期刊：Analytical Chemistry [American Chemical Society]
日期：2024-02-07 被引量：7

链接

nih.govdoi.org

标识

DOI：10.1021/acs.analchem.3c04607

摘要

Clinical metabolomics is growing as an essential tool for precision medicine. However, classical machine learning algorithms struggle to comprehensively encode and analyze the metabolomics data due to their high dimensionality and complex intercorrelations. This article introduces a new method called MetDIT, designed to analyze intricate metabolomics data effectively using deep convolutional neural networks (CNN). MetDIT comprises two components: TransOmics and NetOmics. Since CNN models have difficulty in processing one-dimensional (1D) sequence data efficiently, we developed TransOmics, a framework that transforms sequence data into two-dimensional (2D) images while maintaining a one-to-one correspondence between the sequences and images. NetOmics, the second component, leverages a CNN architecture to extract more discriminative representations from the transformed samples. To overcome the overfitting due to the small sample size and class imbalance, we introduced a feature augmentation module (FAM) and a loss function to improve the model performance. Furthermore, we systematically optimized the model backbone and image resolution to balance the model parameters and computational costs. To demonstrate the performance of the proposed MetDIT, we conducted extensive experiments using three different clinical metabolomics data sets and achieved better classification performance than classical machine learning methods used in metabolomics, including Random Forest, SVM, XGBoost, and LightGBM. The source code is available at the GitHub repository at https://github.com/Li-OmicsLab/MetDIT, and the WebApp can be found at http://metdit.bioinformatics.vip/.

求助该文献

最长约 10秒，即可获得该文献文件

MetDIT: Transforming and Analyzing Clinical Metabolomics Data with Convolutional Neural Networks

今日热心研友