离群值
计算机科学
聚类分析
高斯过程
人工智能
数据挖掘
贝叶斯信息准则
模式识别(心理学)
噪音(视频)
数据建模
回归
机器学习
高斯分布
统计
数学
数据库
图像(数学)
物理
量子力学
作者
Oliver Stegle,S. V. Fallert,David Mackay,Søren Brage
标识
DOI:10.1109/tbme.2008.923118
摘要
Heart rate data collected during nonlaboratory conditions present several data-modeling challenges. First, the noise in such data is often poorly described by a simple Gaussian; it has outliers and errors come in bursts. Second, in large-scale studies the ECG waveform is usually not recorded in full, so one has to deal with missing information. In this paper, we propose a robust postprocessing model for such applications. Our model to infer the latent heart rate time series consists of two main components: unsupervised clustering followed by Bayesian regression. The clustering component uses auxiliary data to learn the structure of outliers and noise bursts. The subsequent Gaussian process regression model uses the cluster assignments as prior information and incorporates expert knowledge about the physiology of the heart. We apply the method to a wide range of heart rate data and obtain convincing predictions along with uncertainty estimates. In a quantitative comparison with existing postprocessing methodology, our model achieves a significant increase in performance.
科研通智能强力驱动
Strongly Powered by AbleSci AI