人工智能
计算机科学
计算机视觉
运动估计
插值(计算机图形学)
图像扭曲
特征(语言学)
运动插值
帧(网络)
帧间
弹道
运动(物理)
线性插值
运动补偿
参考坐标系
模式识别(心理学)
块匹配算法
视频处理
视频跟踪
电信
哲学
语言学
物理
天文
作者
Meiqin Liu,Chenming Xu,Chao Yao,Chunyu Lin,Yao Zhao
标识
DOI:10.1109/tip.2023.3315122
摘要
Video frame interpolation (VFI) aims to generate predictive frames by motion-warping from bidirectional references. Most examples of VFI utilize spatiotemporal semantic information to realize motion estimation and interpolation. However, due to variable acceleration, irregular movement trajectories, and camera movement in real-world cases, they can not be sufficient to deal with non-linear middle frame estimation. In this paper, we present a reformulation of the VFI as a joint non-linear motion regression (JNMR) strategy to model the complicated inter-frame motions. Specifically, the motion trajectory between the target frame and multiple reference frames is regressed by a temporal concatenation of multi-stage quadratic models. Then, a comprehensive joint distribution is constructed to connect all temporal motions. Moreover, to reserve more contextual details for joint regression, the feature learning network is devised to explore clarified feature expressions with dense skip-connection. Later, a coarse-to-fine synthesis enhancement module is utilized to learn visual dynamics at different resolutions with multi-scale textures. The experimental VFI results show the effectiveness and significant improvement of joint motion regression over the state-of-the-art methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI