计算机科学
BitTorrent跟踪器
人工智能
相似性(几何)
跟踪(教育)
卷积(计算机科学)
计算机视觉
特征提取
利用
模式识别(心理学)
眼动
图像(数学)
人工神经网络
心理学
教育学
计算机安全
作者
Ziang Cao,Ziyuan Huang,Liang Pan,Shiwei Zhang,Ziwei Liu,Changhong Fu
出处
期刊:Cornell University - arXiv
日期:2022-01-01
被引量:1
标识
DOI:10.48550/arxiv.2203.01885
摘要
Temporal contexts among consecutive frames are far from being fully utilized in existing visual trackers. In this work, we present TCTrack, a comprehensive framework to fully exploit temporal contexts for aerial tracking. The temporal contexts are incorporated at \textbf{two levels}: the extraction of \textbf{features} and the refinement of \textbf{similarity maps}. Specifically, for feature extraction, an online temporally adaptive convolution is proposed to enhance the spatial features using temporal information, which is achieved by dynamically calibrating the convolution weights according to the previous frames. For similarity map refinement, we propose an adaptive temporal transformer, which first effectively encodes temporal knowledge in a memory-efficient way, before the temporal knowledge is decoded for accurate adjustment of the similarity map. TCTrack is effective and efficient: evaluation on four aerial tracking benchmarks shows its impressive performance; real-world UAV tests show its high speed of over 27 FPS on NVIDIA Jetson AGX Xavier.
科研通智能强力驱动
Strongly Powered by AbleSci AI