Another scale-guided parallel transformer for image aesthetic assessment

计算机科学 人工智能 安全性令牌 变压器 卷积神经网络 特征提取 像素 模式识别(心理学) 特征(语言学) 计算机视觉 特征学习 工程类 电气工程 哲学 语言学 计算机安全 电压
作者
Lili Shen,XU Shao-hu,Jing Zhang,Bo Peng
出处
期刊:Journal of Electronic Imaging [SPIE]
卷期号:32 (02)
标识
DOI:10.1117/1.jei.32.2.023035
摘要

Image aesthetic assessment (IAA) is a challenging task in computer vision fields, which aims to automatically evaluate image beauty by simulating human perception on image aesthetic. With the development of deep learning, although convolutional neural network (CNN)-based IAA approaches have achieved extraordinary progress, CNN experiences difficulty to capture long-distance relationships among visual elements. There is a strong correlation between image layout and image semantic information for image aesthetic. In order to solve this problem, an another scale-guided parallel transformer is proposed, including a multiscale local feature extractor (ME), a feature projection (FP), and an another scale-guided parallel feature fusion transformer (AST). The ME captures primary local features with classic ResNet at multiple scales. The FP performs dimension transformation on feature maps for each scale, which can obtain feature token and aesthetic token. The AST with two parallel transformer encoders is exploited to highlight the significant regions in the holistic image, in which the feature tokens and the aesthetic token from another scale are grouped together to obtain interscale guidance. The final score distribution is achieved by weighting multiple aesthetic tokens with learnable parameters for unified aesthetics assessment. Extensive experiments on two public datasets, including aesthetic visual analysis and aesthetics and attributes database, demonstrate that the proposed method outperforms the state-of-the-art methods across three different tasks.

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
CodeCraft应助DandanHan0916采纳,获得10
刚刚
充电宝应助鲤鱼导师采纳,获得10
刚刚
1秒前
1秒前
量子星尘发布了新的文献求助10
1秒前
跳脚的虾完成签到 ,获得积分10
2秒前
12发布了新的文献求助10
2秒前
举个栗子发布了新的文献求助10
4秒前
VTMS完成签到,获得积分10
4秒前
阳佟雪旋发布了新的文献求助10
5秒前
粥粥完成签到 ,获得积分10
5秒前
欣慰阑悦完成签到,获得积分20
6秒前
你好发布了新的文献求助10
6秒前
兔兔发布了新的文献求助10
6秒前
科研通AI5应助科研通管家采纳,获得10
6秒前
菜菜爸爸完成签到,获得积分10
6秒前
6秒前
顾矜应助科研通管家采纳,获得10
6秒前
7秒前
7秒前
酷波er应助科研通管家采纳,获得10
7秒前
星辰大海应助科研通管家采纳,获得10
7秒前
7秒前
7秒前
7秒前
浮游应助科研通管家采纳,获得10
7秒前
王子心发布了新的文献求助10
7秒前
我是老大应助me采纳,获得10
7秒前
KAIDOHARA完成签到,获得积分10
8秒前
8秒前
9秒前
完美世界应助顺心飞雪采纳,获得10
9秒前
小蘑菇应助solitude采纳,获得10
9秒前
11秒前
12秒前
LD完成签到,获得积分10
13秒前
13秒前
善学以致用应助低空飞行采纳,获得10
13秒前
杨迪祥完成签到 ,获得积分10
13秒前
14秒前
高分求助中
合成生物食品制造技术导则,团体标准,编号:T/CITS 396-2025 1000
The Leucovorin Guide for Parents: Understanding Autism’s Folate 1000
Pipeline and riser loss of containment 2001 - 2020 (PARLOC 2020) 1000
Critical Thinking: Tools for Taking Charge of Your Learning and Your Life 4th Edition 500
Comparing natural with chemical additive production 500
Atlas of Liver Pathology: A Pattern-Based Approach 500
Phylogenetic study of the order Polydesmida (Myriapoda: Diplopoda) 500
热门求助领域 (近24小时)
化学 医学 生物 材料科学 工程类 有机化学 内科学 生物化学 物理 计算机科学 纳米技术 遗传学 基因 复合材料 化学工程 物理化学 病理 催化作用 免疫学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 5241165
求助须知:如何正确求助?哪些是违规求助? 4407970
关于积分的说明 13720750
捐赠科研通 4276970
什么是DOI,文献DOI怎么找? 2346822
邀请新用户注册赠送积分活动 1343948
关于科研通互助平台的介绍 1302074