计算机科学
残余物
编码器
人工神经网络
人工智能
水下
计算机视觉
解码方法
模式识别(心理学)
语音识别
算法
海洋学
地质学
操作系统
作者
Jingchao Cao,Wangzhen Peng,Yutao Liu,Junyu Dong,Patrick Le Callet,Sam Kwong
标识
DOI:10.1109/tcsvt.2025.3556203
摘要
In underwater environments, the absorption and scattering of light often result in various types of degradation in captured images, including color cast, low contrast, low brightness, and blurriness. These undesirable effects pose significant challenges for both underwater photography and downstream tasks such as object detection, recognition, and navigation. To address these challenges, we propose a novel end-to-end underwater image enhancement (UIE) network via the multistage and mixed attention mechanism and a residual-based feature refinement module, called ERD. Specifically, our network includes an encoder stage for extracting features from input underwater images with channel, spatial, and patch attention modules to emphasize degraded channels and regions for restoration; a residual stage for further purification of informative features through sufficient feature learning; and a decoder stage for effective image reconstruction. Inspired by visual perception mechanism, we design the frequency domain loss and edge details loss to retain more high-frequency information and object details while ensuring that the enhanced image approximates the reference image in terms of color tone while preserving content and structure. To comprehensively evaluate our proposed UIE model, we also curated three additional underwater image datasets through online collection and generation using Cycle-GAN. Rigorous experiments conducted on a total of eight underwater image datasets demonstrate that the proposed ERD model outperforms state-of-the-art methods in enhancing both real-world and generated underwater images. Our code and datasets are available at https://github.com/fansuregrin/ERD.
科研通智能强力驱动
Strongly Powered by AbleSci AI