马尔科夫蒙特卡洛
混合蒙特卡罗
计算机科学
蒙特卡罗方法
算法
推论
贝叶斯概率
贝叶斯推理
吉布斯抽样
采用蒙地卡罗积分法
人工智能
统计推断
数学
统计
作者
Nariankadu D. Shyamalkumar,Sanvesh Srivastava
出处
期刊:Stat
[Wiley]
日期:2021-10-14
卷期号:11 (1)
被引量:5
摘要
Monte Carlo algorithms, such as Markov chain Monte Carlo (MCMC) and Hamiltonian Monte Carlo (HMC), are routinely used for Bayesian inference; however, these algorithms are prohibitively slow in massive data settings because they require multiple passes through the full data in every iteration. Addressing this problem, we develop a scalable extension of these algorithms using the divide‐and‐conquer (D&C) technique that divides the data into a sufficiently large number of subsets, draws parameters in parallel on the subsets using a powered likelihood and produces Monte Carlo draws of the parameter by combining parameter draws obtained from each subset. The combined parameter draws play the role of draws from the original sampling algorithm. Our main contributions are twofold. First, we demonstrate through diverse simulated and real data analyses focusing on generalized linear models (GLMs) that our distributed algorithm delivers comparable results as the current state‐of‐the‐art D&C algorithms in terms of statistical accuracy and computational efficiency. Second, providing theoretical support for our empirical observations, we identify regularity assumptions under which the proposed algorithm leads to asymptotically optimal inference. We also provide illustrative examples focusing on normal linear and logistic regressions where parts of our D&C algorithm are analytically tractable.
科研通智能强力驱动
Strongly Powered by AbleSci AI