多元统计
系列(地层学)
计算机科学
多元分析
时间序列
统计
数学
地质学
古生物学
作者
Aoqian Zhang,Zexue Wu,Yifeng Gong,Ye Yuan,Guoren Wang
出处
期刊:Cornell University - arXiv
日期:2024-11-02
被引量:1
标识
DOI:10.48550/arxiv.2411.01214
摘要
Errors are common in time series due to unreliable sensor measurements. Existing methods focus on univariate data but do not utilize the correlation between dimensions. Cleaning each dimension separately may lead to a less accurate result, as some errors can only be identified in the multivariate case. We also point out that the widely used minimum change principle is not always the best choice. Instead, we try to change the smallest number of data to avoid a significant change in the data distribution. In this paper, we propose MTCSC, the constraint-based method for cleaning multivariate time series. We formalize the repair problem, propose a linear-time method to employ online computing, and improve it by exploiting data trends. We also support adaptive speed constraint capturing. We analyze the properties of our proposals and compare them with SOTA methods in terms of effectiveness, efficiency versus error rates, data sizes, and applications such as classification. Experiments on real datasets show that MTCSC can have higher repair accuracy with less time consumption. Interestingly, it can be effective even when there are only weak or no correlations between the dimensions.
科研通智能强力驱动
Strongly Powered by AbleSci AI