数学
空分布
投影(关系代数)
独立性(概率论)
单变量
检验统计量
空(SQL)
距离相关
无效假设
随机投影
统计的
统计假设检验
投影寻踪
随机变量
多元随机变量
样本量测定
相关性
算法
渐近分布
统计
多元统计
估计员
计算机科学
数据挖掘
几何学
出处
期刊:Biometrika
[Oxford University Press]
日期:2023-11-15
卷期号:111 (3): 1013-1027
被引量:11
标识
DOI:10.1093/biomet/asad070
摘要
Summary Testing independence between high-dimensional random vectors is fundamentally different from testing independence between univariate random variables. Taking the projection correlation as an example, it suffers from at least three problems. First, it has a high computational complexity of O{n3(p+q)}, where n, p and q are the sample size and dimensions of the random vectors; this limits its usefulness substantially when n is extremely large. Second, the asymptotic null distribution of the projection correlation test is rarely tractable; therefore, random permutations are often suggested as a means of approximating the asymptotic null distribution, which further increases the complexity of implementing independence tests. Third, the power performance of the projection correlation test deteriorates in high dimensions. To address these issues, the projection correlation is improved by using a modified weight function, which reduces the complexity to O{n2(p+q)}. We estimate the improved projection correlation with U-statistic theory. Importantly, its asymptotic null distribution is standard normal, thanks to the high dimesnionality of the random vectors. This expedites the implementation of independence tests substantially. To enhance the power performance in high dimensions, we propose incorporating a cross-validation procedure with feature screening into the projection correlation test. The implementation efficacy and power enhancement are confirmed through extensive numerical studies.
科研通智能强力驱动
Strongly Powered by AbleSci AI