计算机视觉
人工智能
光流
计算机科学
跟踪(教育)
视频跟踪
对象(语法)
特征(语言学)
目标检测
帧(网络)
匹配(统计)
运动(物理)
模式识别(心理学)
图像(数学)
数学
心理学
电信
教育学
语言学
哲学
统计
作者
Mufeng Yao,Jiaqi Wang,Jinlong Peng,Mingmin Chi,Chao Liu
出处
期刊:Cornell University - arXiv
日期:2023-01-01
标识
DOI:10.48550/arxiv.2308.07207
摘要
Multiple object tracking (MOT) has been successfully investigated in computer vision. However, MOT for the videos captured by unmanned aerial vehicles (UAV) is still challenging due to small object size, blurred object appearance, and very large and/or irregular motion in both ground objects and UAV platforms. In this paper, we propose FOLT to mitigate these problems and reach fast and accurate MOT in UAV view. Aiming at speed-accuracy trade-off, FOLT adopts a modern detector and light-weight optical flow extractor to extract object detection features and motion features at a minimum cost. Given the extracted flow, the flow-guided feature augmentation is designed to augment the object detection feature based on its optical flow, which improves the detection of small objects. Then the flow-guided motion prediction is also proposed to predict the object's position in the next frame, which improves the tracking performance of objects with very large displacements between adjacent frames. Finally, the tracker matches the detected objects and predicted objects using a spatially matching scheme to generate tracks for every object. Experiments on Visdrone and UAVDT datasets show that our proposed model can successfully track small objects with large and irregular motion and outperform existing state-of-the-art methods in UAV-MOT tasks.
科研通智能强力驱动
Strongly Powered by AbleSci AI