自编码
失真(音乐)
计算机科学
人工智能
代表(政治)
加权
图像质量
模式识别(心理学)
质量(理念)
图像(数学)
计算机视觉
数据挖掘
人工神经网络
带宽(计算)
认识论
政治
医学
放大器
计算机网络
哲学
政治学
法学
放射科
作者
Zehong Zhou,Fei Zhou,Guoping Qiu
标识
DOI:10.1109/tcsvt.2023.3299328
摘要
The visual quality of an image mainly relies on its content and its distortions. However, the adaptability between their contributions to the image quality has not be well investigated yet. Besides, albeit of many promising efforts, lacking sufficient labeled data still hinders the robust representation of quality-related information. In this work, we first design a self-supervised architecture, named collaborative autoencoder (COAE), to separately represent the content and the distortion information, and then develop a Self-Adaptive Weighting based quAlity predictoR (SAWAR) to balance the individual representations of the content and the distortions in the prediction of image quality. Specifically, the COAE is trained with large-scale unlabeled data, consisting of a content autoencoder (CAE) and a distortion autoencoder (DAE) that work collaboratively and individually. While the CAE is a standard autoencoder for the content representation, the design of the DAE is unique. We introduce the CAE-encoded content representation as an extra input to the decoder of the DAE to learn to reconstruct distorted images, thus effectively forcing it to extract the distortion representation. The SAWAR, whose parameter number is much smaller than that of the COAE, is trained with labeled data in existing IQA datasets. It takes advantage of the interaction between the image content and the distortions to adaptively balance their contributions. Extensive experiments show that the COAE effectively extracts quality-related representations and the SAWAR achieves the state-of-the-art performance.
科研通智能强力驱动
Strongly Powered by AbleSci AI