数学
模糊聚类
分位数
模糊逻辑
聚类分析
主成分分析
多元统计
系列(地层学)
数据挖掘
模式识别(心理学)
人工智能
统计
计算机科学
生物
古生物学
作者
Ángel López-Oriona,José A. Vilar,Pierpaolo-D'Urso
标识
DOI:10.1016/j.fss.2022.02.015
摘要
A novel procedure to perform fuzzy clustering of multivariate time series generated from different dependence models is proposed. Different amounts of dissimilarity between the generating models or changes on the dynamic behaviours over time are some arguments justifying a fuzzy approach, where each series is associated to all the clusters with specific membership levels. Our procedure considers quantile-based cross-spectral features and consists of three stages: (i) each element is characterized by a vector of proper estimates of the quantile cross-spectral densities, (ii) principal component analysis is carried out to capture the main differences reducing the effects of the noise, and (iii) the squared Euclidean distance between the first retained principal components is used to perform clustering through the standard fuzzy C -means and fuzzy C -medoids algorithms. The performance of the proposed approach is evaluated in a broad simulation study where several types of generating processes are considered, including linear, nonlinear and dynamic conditional correlation models. Assessment is done in two different ways: by directly measuring the quality of the resulting fuzzy partition and by taking into account the ability of the technique to determine the overlapping nature of series located equidistant from well-defined clusters. The procedure is compared with the few alternatives suggested in the literature, substantially outperforming all of them whatever the underlying process and the evaluation scheme. Two specific applications involving air quality and financial databases illustrate the usefulness of our approach. • A novel approach to perform fuzzy clustering of multivariate time series. • Detecting general types of dependence in multivariate processes. • Effective and efficient clustering in time series datasets. • Combining the quantile-based metric and the fuzzy logic. • Meaningful clustering of the companies in the S&P 500 index.
科研通智能强力驱动
Strongly Powered by AbleSci AI