Canonical Correlation Analysis and Partial Least Squares for Identifying Brain–Behavior Associations: A Tutorial and a Comparative Study

典型相关 偏最小二乘回归 过度拟合 虚假关系 降维 样本量测定 计算机科学 多元统计 人类连接体项目 偏相关 维数之咒 人工智能 相关性 超参数 数据挖掘 机器学习 统计 数学 心理学 人工神经网络 几何学 神经科学 功能连接
作者
Agoston Mihalik,James Chapman,Rick A. Adams,Nils R. Winter,Fabio S. Ferreira,John Shawe-Taylor,Janaina Mourão-Miranda
出处
期刊:Biological Psychiatry: Cognitive Neuroscience and Neuroimaging [Elsevier]
卷期号:7 (11): 1055-1067 被引量:2
标识
DOI:10.1016/j.bpsc.2022.07.012
摘要

Canonical correlation analysis (CCA) and partial least squares (PLS) are powerful multivariate methods for capturing associations across 2 modalities of data (e.g., brain and behavior). However, when the sample size is similar to or smaller than the number of variables in the data, standard CCA and PLS models may overfit, i.e., find spurious associations that generalize poorly to new data. Dimensionality reduction and regularized extensions of CCA and PLS have been proposed to address this problem, yet most studies using these approaches have some limitations. This work gives a theoretical and practical introduction into the most common CCA/PLS models and their regularized variants. We examine the limitations of standard CCA and PLS when the sample size is similar to or smaller than the number of variables. We discuss how dimensionality reduction and regularization techniques address this problem and explain their main advantages and disadvantages. We highlight crucial aspects of the CCA/PLS analysis framework, including optimizing the hyperparameters of the model and testing the identified associations for statistical significance. We apply the described CCA/PLS models to simulated data and real data from the Human Connectome Project and Alzheimer's Disease Neuroimaging Initiative (both of n > 500). We use both low- and high-dimensionality versions of these data (i.e., ratios between sample size and variables in the range of ∼1-10 and ∼0.1-0.01, respectively) to demonstrate the impact of data dimensionality on the models. Finally, we summarize the key lessons of the tutorial.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
建议保存本图,每天支付宝扫一扫(相册选取)领红包
实时播报
刚刚
chirouoru完成签到 ,获得积分10
1秒前
今后应助Ainra采纳,获得50
2秒前
shizx发布了新的文献求助10
2秒前
潇洒的达发布了新的文献求助10
2秒前
Wangjialin完成签到 ,获得积分10
3秒前
5秒前
7秒前
8秒前
爆米花应助科研通管家采纳,获得10
8秒前
CipherSage应助科研通管家采纳,获得10
8秒前
所所应助科研通管家采纳,获得10
8秒前
闵白柏发布了新的文献求助10
11秒前
Maestro_S应助冷艳的心锁采纳,获得10
11秒前
11秒前
xinyi完成签到,获得积分10
13秒前
科研通AI2S应助J_C_Van采纳,获得10
13秒前
爆米花应助潇洒的达采纳,获得10
14秒前
Grayball应助kankan katz采纳,获得10
16秒前
17秒前
19秒前
20秒前
21秒前
会飞的猪发布了新的文献求助10
23秒前
23秒前
23秒前
小钟发布了新的文献求助10
24秒前
小贱牛发布了新的文献求助10
25秒前
mirr关注了科研通微信公众号
25秒前
Ainra发布了新的文献求助50
28秒前
完美世界应助kankan katz采纳,获得10
30秒前
wwec发布了新的文献求助10
32秒前
思源应助会飞的猪采纳,获得10
33秒前
彭于晏应助wodetaiyangLLL采纳,获得10
37秒前
高小谦完成签到 ,获得积分10
38秒前
八宝糖发布了新的文献求助10
41秒前
科研通AI2S应助时光纠缠采纳,获得10
44秒前
闵白柏完成签到,获得积分10
45秒前
46秒前
xingyi完成签到,获得积分10
46秒前
高分求助中
【重要提醒】请驳回机器人应助,等待人工应助!!!! 20000
Teaching Social and Emotional Learning in Physical Education 1000
Multifunctionality Agriculture: A New Paradigm for European Agriculture and Rural Development 500
grouting procedures for ground source heat pump 500
A Monograph of the Colubrid Snakes of the Genus Elaphe 300
An Annotated Checklist of Dinosaur Species by Continent 300
The Chemistry of Carbonyl Compounds and Derivatives 300
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2340800
求助须知:如何正确求助?哪些是违规求助? 2033457
关于积分的说明 5085570
捐赠科研通 1777792
什么是DOI,文献DOI怎么找? 889054
版权声明 556172
科研通“疑难数据库(出版商)”最低求助积分说明 474054