数学
度量(数据仓库)
三角形不等式
成对比较
重采样
排列(音乐)
排名(信息检索)
组合数学
统计
离散数学
人工智能
数据挖掘
声学
计算机科学
物理
作者
Peter Hall,Nader Tajvidi
出处
期刊:Biometrika
[Oxford University Press]
日期:2002-06-01
卷期号:89 (2): 359-374
被引量:141
标识
DOI:10.1093/biomet/89.2.359
摘要
Motivated by applications in high‐dimensional settings, we suggest a test of the hypothesis H0 that two sampled distributions are identical. It is assumed that two independent datasets are drawn from the respective populations, which may be very general. In particular, the distributions may be multivariate or infinite‐dimensional, in the latter case representing, for example, the distributions of random functions from one Euclidean space to another. Our test uses a measure of distance between data. This measure should be symmetric but need not satisfy the triangle inequality, so it is not essential that it be a metric. The test is based on ranking the pooled dataset, with respect to the distance and relative to any fixed data value, and repeating this operation for each fixed datum. A permutation argument enables a critical point to be chosen such that the test has concisely known significance level, conditional on the set of all pairwise distances.
科研通智能强力驱动
Strongly Powered by AbleSci AI