动态时间归整
离群值
计算机科学
模式识别(心理学)
人工智能
代表(政治)
序列(生物学)
匹配(统计)
相似性(几何)
算法
数学
图像(数学)
统计
政治
生物
遗传学
法学
政治学
作者
Nikita Dvornik,Isma Hadji,Konstantinos G. Derpanis,Animesh Garg,Allan D. Jepson
出处
期刊:Cornell University - arXiv
日期:2021-08-26
被引量:26
标识
DOI:10.48550/arxiv.2108.11996
摘要
In this work, we consider the problem of sequence-to-sequence alignment for signals containing outliers. Assuming the absence of outliers, the standard Dynamic Time Warping (DTW) algorithm efficiently computes the optimal alignment between two (generally) variable-length sequences. While DTW is robust to temporal shifts and dilations of the signal, it fails to align sequences in a meaningful way in the presence of outliers that can be arbitrarily interspersed in the sequences. To address this problem, we introduce Drop-DTW, a novel algorithm that aligns the common signal between the sequences while automatically dropping the outlier elements from the matching. The entire procedure is implemented as a single dynamic program that is efficient and fully differentiable. In our experiments, we show that Drop-DTW is a robust similarity measure for sequence retrieval and demonstrate its effectiveness as a training loss on diverse applications. With Drop-DTW, we address temporal step localization on instructional videos, representation learning from noisy videos, and cross-modal representation learning for audio-visual retrieval and localization. In all applications, we take a weakly- or unsupervised approach and demonstrate state-of-the-art results under these settings.
科研通智能强力驱动
Strongly Powered by AbleSci AI