计算机科学
变压器
人工智能
棱锥(几何)
图像质量
质量得分
特征提取
数据挖掘
注意力网络
模式识别(心理学)
计算机视觉
图像(数学)
数学
工程类
电压
电气工程
几何学
公制(单位)
运营管理
作者
Jiliang Ma,Yihua Chen,Lv Chen,Zhenjun Tang
标识
DOI:10.1016/j.eswa.2024.125008
摘要
No-Reference Image Quality Assessment (NR-IQA) is a fundamental and important task in the field of computer vision. Most NR-IQA methods have limitation in making desirable NR-IQA performance due to the lack of sufficiently rich features. To address this problem, we propose a dual-attention pyramid Transformer network for NR-IQA. In the proposed method, a feature extraction module is firstly used to extract multi-scale features which contain rich distortion and semantic information. Then, a pyramid Transformer network with channel and spatial attentions is designed to learn multi-scale global features from spatial and channel aspects. The combination of pyramid structure and dual attentions enables our network to focus on features in different regions of the image and learn richer and more comprehensive global features. This in turn improves the quality score prediction performance. Finally, the score prediction module predicts the quality scores in different stages of the pyramid Transformer network by channel adaptive prediction branches and determines the final quality score by aggregating these quality scores. Extensive experiments performed on four widely used public databases show that our proposed method is superior to some state-of-the-art NR-IQA methods in perceiving image quality.
科研通智能强力驱动
Strongly Powered by AbleSci AI