计算机科学
人工智能
花键(机械)
整改
图像校正
单应性
非线性系统
计算机视觉
特征学习
失真(音乐)
数学
电压
放大器
计算机网络
统计
物理
投射试验
结构工程
带宽(计算)
量子力学
射影空间
工程类
作者
Kang Liao,Lang Nie,Chunyu Lin,Zishuo Zheng,Yao Zhao
标识
DOI:10.1109/iccv51070.2023.00991
摘要
The wide-angle lens shows appealing applications in VR technologies, but it introduces severe radial distortion into its captured image. To recover the realistic scene, previous works devote to rectifying the content of the wide-angle image. However, such a rectification solution inevitably distorts the image boundary, which changes related geometric distributions and misleads the current vision perception models. In this work, we explore constructing a win-win representation on both content and boundary by contributing a new learning model, i.e., Rectangling Rectification Network (RecRecNet). In particular, we propose a thin-plate spline (TPS) module to formulate the nonlinear and non-rigid transformation for rectangling images. By learning the control points on the rectified image, our model can flexibly warp the source structure to the target domain and achieves an end-to-end unsupervised deformation. To relieve the complexity of structure approximation, we then inspire our RecRecNet to learn the gradual deformation rules with a DoF (Degree of Freedom)-based curriculum learning. By increasing the DoF in each curriculum stage, namely, from similarity transformation (4-DoF) to homography transformation (8-DoF), the network is capable of investigating more detailed deformations, offering fast convergence on the final rectangling task. Experiments show the superiority of our solution over the compared methods on both quantitative and qualitative evaluations. The code and dataset are available at https://github.com/KangLiao929/RecRecNet.
科研通智能强力驱动
Strongly Powered by AbleSci AI