A study of the evaluation metrics for generative images containing combinational creativity

一致性(知识库) 公制(单位) 排名(信息检索) 组合逻辑 创造力 生成语法 计算机科学 图像(数学) 秩(图论) 人工智能 机器学习 数学 算法 工程类 逻辑门 心理学 组合数学 社会心理学 运营管理
作者
Boheng Wang,Yunhuai Zhu,Liuqing Chen,Jingcheng Liu,Lingyun Sun,Peter R.N. Childs
出处
期刊:Artificial intelligence for engineering design, analysis and manufacturing [Cambridge University Press]
卷期号:37
标识
DOI:10.1017/s0890060423000069
摘要

Abstract In the field of content generation by machine, the state-of-the-art text-to-image model, DALL⋅E, has advanced and diverse capacities for the combinational image generation with specific textual prompts. The images generated by DALL⋅E seem to exhibit an appreciable level of combinational creativity close to that of humans in terms of visualizing a combinational idea. Although there are several common metrics which can be applied to assess the quality of the images generated by generative models, such as IS, FID, GIQA, and CLIP, it is unclear whether these metrics are equally applicable to assessing images containing combinational creativity. In this study, we collected the generated image data from machine (DALL⋅E) and human designers, respectively. The results of group ranking in the Consensual Assessment Technique (CAT) and the Turing Test (TT) were used as the benchmarks to assess the combinational creativity. Considering the metrics’ mathematical principles and different starting points in evaluating image quality, we introduced coincident rate (CR) and average rank variation (ARV) which are two comparable spaces. An experiment to calculate the consistency of group ranking of each metric by comparing the benchmarks then was conducted. By comparing the consistency results of CR and ARV on group ranking, we summarized the applicability of the existing evaluation metrics in assessing generative images containing combinational creativity. In the four metrics, GIQA performed the closest consistency to the CAT and TT. It shows the potential as an automated assessment for images containing combinational creativity, which can be used to evaluate the images containing combinational creativity in the relevant task of design and engineering such as conceptual sketch, digital design image, and prototyping image.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
哎呦喂完成签到,获得积分10
刚刚
Xing完成签到,获得积分10
1秒前
白石磊发布了新的文献求助10
3秒前
zhuxd完成签到,获得积分10
3秒前
昕昕完成签到,获得积分10
5秒前
6秒前
小林完成签到,获得积分10
7秒前
Jasper应助lxlcx采纳,获得10
8秒前
日川冈坂完成签到 ,获得积分10
8秒前
昕昕发布了新的文献求助10
10秒前
进击的研狗完成签到 ,获得积分10
11秒前
shuangyanli完成签到,获得积分10
11秒前
11秒前
活泼的烙完成签到 ,获得积分10
12秒前
庸尘完成签到,获得积分10
14秒前
头号玩家发布了新的文献求助10
15秒前
Lucas应助多情赛君采纳,获得10
17秒前
17秒前
Mao完成签到,获得积分10
18秒前
teborlee完成签到,获得积分10
18秒前
carly完成签到 ,获得积分10
20秒前
希望天下0贩的0应助variant采纳,获得10
20秒前
11完成签到 ,获得积分10
21秒前
22秒前
哇咔咔完成签到 ,获得积分10
23秒前
wintersss完成签到,获得积分10
24秒前
852应助wqqq采纳,获得10
24秒前
25秒前
罗尔与柯西完成签到,获得积分10
25秒前
28秒前
variant完成签到,获得积分10
30秒前
多情赛君完成签到,获得积分10
32秒前
32秒前
variant发布了新的文献求助10
33秒前
Jc完成签到 ,获得积分10
34秒前
终于花开日完成签到 ,获得积分10
37秒前
sengow完成签到 ,获得积分10
37秒前
AI_S完成签到,获得积分10
38秒前
多情赛君发布了新的文献求助10
38秒前
小白完成签到,获得积分10
38秒前
高分求助中
Introduction to Strong Mixing Conditions Volumes 1-3 500
Tip60 complex regulates eggshell formation and oviposition in the white-backed planthopper, providing effective targets for pest control 400
Optical and electric properties of monocrystalline synthetic diamond irradiated by neutrons 320
共融服務學習指南 300
Essentials of Pharmacoeconomics: Health Economics and Outcomes Research 3rd Edition. by Karen Rascati 300
Peking Blues // Liao San 300
Political Ideologies Their Origins and Impact 13 edition 240
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 物理 生物化学 纳米技术 计算机科学 化学工程 内科学 复合材料 物理化学 电极 遗传学 量子力学 基因 冶金 催化作用
热门帖子
关注 科研通微信公众号,转发送积分 3801112
求助须知:如何正确求助?哪些是违规求助? 3346777
关于积分的说明 10330165
捐赠科研通 3063151
什么是DOI,文献DOI怎么找? 1681349
邀请新用户注册赠送积分活动 807519
科研通“疑难数据库(出版商)”最低求助积分说明 763726