Lasso(编程语言)
全基因组关联研究
遗传关联
计算机科学
选择(遗传算法)
人工智能
单核苷酸多态性
人口
机器学习
特征选择
计算生物学
数据挖掘
生物
遗传学
医学
基因型
环境卫生
基因
万维网
作者
Nisha Puthiyedth,Nuoyi Zhang,Ziqing Wang,Yan Yan
标识
DOI:10.1109/bibm52615.2021.9669510
摘要
GWAS are popular approaches to associate genetic variations on a population of individuals with particular traits. Despite widely applications, GWAS are heavily depend on the accuracy of statistical models and they have been criticized for missing important genetic markers. Least absolute shrinkage and selection operator (LASSO) and its variants have become an important machine learning methods to find significant features from complex datasets. The regression models used in LASSO make it so useful for selecting features from the dataset where there are many features but only a few relevant ones. In this study, we investigate how LASSO-based methods can be used as an alternative of GWAS to find significant genetic markers. This study provides guidance on selecting LASSO-based programs with different data used in selecting significant SNPs.
科研通智能强力驱动
Strongly Powered by AbleSci AI