生物
计算生物学
瓶颈
基因组
染色体构象捕获
矩阵分解
杠杆(统计)
标杆管理
非负矩阵分解
计算机科学
人工智能
基因
遗传学
基因表达
物理
特征向量
增强子
量子力学
营销
业务
嵌入式系统
作者
Da-Inn Lee,Sushmita Roy
出处
期刊:Genome Research
[Cold Spring Harbor Laboratory Press]
日期:2025-03-20
卷期号:: gr.279930.124-gr.279930.124
标识
DOI:10.1101/gr.279930.124
摘要
Three-dimensional (3D) genome organization, which determines how the DNA is packaged inside the nucleus, has emerged as a key component of the gene regulation machinery. High-throughput chromosome conformation datasets, such as Hi-C, have become available across multiple conditions and timepoints, offering a unique opportunity to examine changes in 3D genome organization and link them to phenotypic changes in normal and diseases processes. However, systematic detection of higher-order structural changes across multiple Hi-C datasets remains a major challenge. Existing computational methods either do not model higher-order structural units or cannot model dynamics across more than two conditions of interest. We address these limitations with Tree-Guided Integrated Factorization (TGIF), a generalizable multitask Non-negative Matrix Factorization (NMF) approach that can be applied to time series or hierarchically related biological conditions. TGIF can identify large-scale changes at compartment or subcompartment levels, as well as local changes at boundaries of topologically associated domains (TADs). Based on benchmarking in simulated and real Hi-C data, TGIF boundaries are more accurate and reproducible across differential levels of noise and sources of technical artifacts, and more enriched in CTCF. Application to three multisample mammalian datasets shows TGIF can detect differential regions at compartment, subcompartment, and boundary levels that are associated with significant changes in regulatory signals and gene expression enriched in tissue-specific processes. Finally, we leverage TGIF boundaries to prioritize sequence variants for multiple phenotypes from the NHGRI GWAS catalog. Taken together, TGIF is a flexible tool to examine 3D genome organization dynamics across disease and developmental processes.
科研通智能强力驱动
Strongly Powered by AbleSci AI