人工智能
计算机科学
失真(音乐)
整改
特征(语言学)
计算机视觉
特征学习
图像校正
模式识别(心理学)
像素
工程类
电气工程
哲学
语言学
电压
放大器
带宽(计算)
计算机网络
作者
Zhaokang Liao,Wengang Zhou,Houqiang Li
标识
DOI:10.1109/tcsvt.2023.3315967
摘要
This paper focuses on fisheye image rectification. Existing learning-based solutions learn image representations that mix distortion features and content features. Since the distortion feature dominates the rectification process, we propose a novel distortion-aware representation learning framework, which decouples the distortion feature from the content feature, for fisheye image rectification. Specifically, we first pre-train a Vision Transformer with a supervised pre-text task, which regresses the distortion distribution map of a distorted image. The pre-training equips the Vision Transformer with the ability to capture distortion-related patterns. After that, the pre-trained model is fine-tuned to predict the pixel-wise flow map to rectify the fisheye images. Extensive experiments are conducted to evaluate our approach and verify our idea of feature decoupling. The experiment results demonstrate the state-of-the-art performance of our approach compared to existing algorithms, as well as its generality on real-world images. Our source code is publicly available at https://github.com/lzk9508/DaFIR.
科研通智能强力驱动
Strongly Powered by AbleSci AI