计算机科学
运动补偿
人工智能
卷积(计算机科学)
利用
计算机视觉
时间分辨率
图像分辨率
卷积神经网络
参考坐标系
接头(建筑物)
像素
运动估计
帧(网络)
模式识别(心理学)
人工神经网络
电信
建筑工程
物理
计算机安全
量子力学
工程类
作者
José Caballero,Christian Ledig,Andrew P. Aitken,Alejandro Acosta,Johannes Totz,Zehan Wang,Wenzhe Shi
标识
DOI:10.1109/cvpr.2017.304
摘要
Convolutional neural networks have enabled accurate image super-resolution in real-time. However, recent attempts to benefit from temporal correlations in video super-resolution have been limited to naive or inefficient architectures. In this paper, we introduce spatio-temporal sub-pixel convolution networks that effectively exploit temporal redundancies and improve reconstruction accuracy while maintaining real-time speed. Specifically, we discuss the use of early fusion, slow fusion and 3D convolutions for the joint processing of multiple consecutive video frames. We also propose a novel joint motion compensation and video super-resolution algorithm that is orders of magnitude more efficient than competing methods, relying on a fast multi-resolution spatial transformer module that is end-to-end trainable. These contributions provide both higher accuracy and temporally more consistent videos, which we confirm qualitatively and quantitatively. Relative to single-frame models, spatio-temporal networks can either reduce the computational cost by 30% whilst maintaining the same quality or provide a 0.2dB gain for a similar computational cost. Results on publicly available datasets demonstrate that the proposed algorithms surpass current state-of-the-art performance in both accuracy and efficiency.
科研通智能强力驱动
Strongly Powered by AbleSci AI