微生物群
分类单元
分类学(生物学)
生物
分类等级
表(数据库)
等级制度
操作分类学单元
计算生物学
进化生物学
计算机科学
生态学
生物信息学
数据挖掘
遗传学
16S核糖体RNA
基因
经济
市场经济
作者
S Kim,Nayeon Kang,Taesung Park
标识
DOI:10.1109/tcbb.2020.3039326
摘要
The recent advent of high-throughput sequencing technology has enabled us to study the associations between human microbiome and diseases. The DNA sequences of microbiome samples are clustered as operational taxonomic units (OTUs) according to their similarity. The OTU table containing counts of OTUs present in each sample is used to measure correlations between OTUs and disease status and find key microbes for prediction of the disease status. Various statistical methods have been proposed for such microbiome data analysis. However, none of these methods reflects the hierarchy of taxonomy information. In this paper, we propose a hierarchical structural component model for microbiome data (HisCoM-microb) using taxonomy information as well as OTU table data. The proposed HisCoM-microb consists of two layers: one for OTUs and the other for taxa at the higher taxonomy level. Then we calculate simultaneously coefficient estimates of OTUs and taxa of the two layers inserted in the hierarchical model. Through this analysis, we can infer the association between taxa or OTUs and disease status, considering the impact of taxonomic structure on disease status. Both simulation study and real microbiome data analysis show that HisCoM-microb can successfully reveal the relations between each taxon and disease status and identify the key OTUs of the disease at the same time.
科研通智能强力驱动
Strongly Powered by AbleSci AI