Learning Degradation-Robust Spatiotemporal Frequency-Transformer for Video Super-Resolution

计算机科学 人工智能 频域 计算机视觉 时频分析 帧速率 特征提取 变压器 模式识别(心理学) 工程类 电压 电气工程 滤波器(信号处理)
作者
Zhongwei Qiu,Huan Yang,Jianlong Fu,Daochang Liu,Chang Xu,Dongmei Fu
出处
期刊:IEEE Transactions on Pattern Analysis and Machine Intelligence [Institute of Electrical and Electronics Engineers]
卷期号:45 (12): 14888-14904 被引量:2
标识
DOI:10.1109/tpami.2023.3312166
摘要

Video Super-Resolution (VSR) aims to restore high-resolution (HR) videos from low-resolution (LR) videos. Existing VSR techniques usually recover HR frames by extracting pertinent textures from nearby frames with known degradation processes. Despite significant progress, grand challenges remain to effectively extract and transmit high-quality textures from high-degraded low-quality sequences, such as blur, additive noises, and compression artifacts. This work proposes a novel degradation-robust Frequency-Transformer (FTVSR++) for handling low-quality videos that carry out self-attention in a combined space-time-frequency domain. First, video frames are split into patches and each patch is transformed into spectral maps in which each channel represents a frequency band. It permits a fine-grained self-attention on each frequency band so that real visual texture can be distinguished from artifacts. Second, a novel dual frequency attention (DFA) mechanism is proposed to capture the global and local frequency relations, which can handle different complicated degradation processes in real-world scenarios. Third, we explore different self-attention schemes for video processing in the frequency domain and discover that a "divided attention" which conducts joint space-frequency attention before applying temporal-frequency attention, leads to the best video enhancement quality. Extensive experiments on three widely-used VSR datasets show that FTVSR++ outperforms state-of-the-art methods on different low-quality videos with clear visual margins.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
wwwww发布了新的文献求助10
刚刚
Leon发布了新的文献求助10
1秒前
小t要读top博完成签到 ,获得积分10
1秒前
2秒前
2秒前
sumu完成签到,获得积分10
2秒前
出过门完成签到 ,获得积分10
2秒前
3秒前
丹丹发布了新的文献求助10
4秒前
月亮上的猫完成签到,获得积分10
4秒前
11发布了新的文献求助10
4秒前
希望天下0贩的0应助wxnice采纳,获得10
4秒前
5秒前
wwwww发布了新的文献求助10
5秒前
zxm098发布了新的文献求助10
5秒前
Leon完成签到,获得积分10
6秒前
wo发布了新的文献求助10
7秒前
CY03完成签到,获得积分10
7秒前
9秒前
11秒前
11秒前
ElbingX应助小福星采纳,获得10
11秒前
CY03发布了新的文献求助10
12秒前
顾矜应助俏皮南风采纳,获得10
12秒前
12秒前
13秒前
Lucas应助柏特瑞采纳,获得10
13秒前
WELXCNK完成签到,获得积分10
13秒前
难过的蘑菇完成签到,获得积分20
13秒前
14秒前
筱筱完成签到,获得积分10
15秒前
15秒前
15秒前
Radiant完成签到,获得积分20
16秒前
16秒前
mdjsf完成签到,获得积分10
17秒前
Camel完成签到,获得积分10
18秒前
18秒前
雾散完成签到,获得积分10
18秒前
18秒前
高分求助中
Teaching Social and Emotional Learning in Physical Education 900
Plesiosaur extinction cycles; events that mark the beginning, middle and end of the Cretaceous 500
Two-sample Mendelian randomization analysis reveals causal relationships between blood lipids and venous thromboembolism 500
Chinese-English Translation Lexicon Version 3.0 500
[Lambert-Eaton syndrome without calcium channel autoantibodies] 440
薩提亞模式團體方案對青年情侶輔導效果之研究 400
3X3 Basketball: Everything You Need to Know 310
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2387865
求助须知:如何正确求助?哪些是违规求助? 2094376
关于积分的说明 5272747
捐赠科研通 1821076
什么是DOI,文献DOI怎么找? 908483
版权声明 559300
科研通“疑难数据库(出版商)”最低求助积分说明 485355