发布文献求助

Aligning Text-to-Image Models using Human Feedback

计算机科学图像（数学）集合（抽象数据类型）生成模型忠诚生成语法人工智能功能（生物学）机器学习计算机视觉模式识别（心理学）电信进化生物学生物程序设计语言

作者

Kimin Lee,Hao Liu,Moonkyung Ryu,Olivia Watkins,Yuqing Du,Craig Boutilier,Pieter Abbeel,Mohammad Ghavamzadeh,Shixiang Gu

出处

期刊：Cornell University - arXiv 日期：2023-01-01 被引量：28

链接

arxiv.org datacite.orgdoi.org

标识

DOI：10.48550/arxiv.2302.12192

摘要

Deep generative models have shown impressive results in text-to-image synthesis. However, current text-to-image models often generate images that are inadequately aligned with text prompts. We propose a fine-tuning method for aligning such models using human feedback, comprising three stages. First, we collect human feedback assessing model output alignment from a set of diverse text prompts. We then use the human-labeled image-text dataset to train a reward function that predicts human feedback. Lastly, the text-to-image model is fine-tuned by maximizing reward-weighted likelihood to improve image-text alignment. Our method generates objects with specified colors, counts and backgrounds more accurately than the pre-trained model. We also analyze several design choices and find that careful investigations on such design choices are important in balancing the alignment-fidelity tradeoffs. Our results demonstrate the potential for learning from human feedback to significantly improve text-to-image models.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

活动

『应助活动周』获奖名单已公布 🔥 (2025-4-2)

更新

『中科院2025期刊分区』已更新 (2025-3-23)

更新

『即时热点』模块已上线 (2025-2-28)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: lanlan完成签到，获得积分10

刚刚; wa完成签到，获得积分20

刚刚; 毛毛虫完成签到，获得积分10

刚刚; 蒸蒸日上发布了新的文献求助10

3秒前; yyy完成签到，获得积分10

3秒前; Zephyr完成签到，获得积分10

5秒前; YY-Bubble完成签到，获得积分10

7秒前; Docsiwen完成签到，获得积分10

7秒前; wanci上传了应助文件

8秒前; 小蘑菇上传了应助文件

10秒前; f1ame完成签到，获得积分10

11秒前; 健壮丝袜完成签到，获得积分10

11秒前; yusuf发布了新的文献求助10

11秒前; 可爱的函函的应助被无辜秋珊采纳，获得10

12秒前; 健壮丝袜发布了新的文献求助10

13秒前; aokaoji发布了新的文献求助10

15秒前; 典雅的俊驰发布了新的文献求助10

15秒前; kgmilan完成签到，获得积分20

16秒前; 我是老大的应助被IKUN采纳，获得10

18秒前; 合适靖儿完成签到，获得积分10

20秒前; 孤独的盼曼完成签到，获得积分10

23秒前; 我是老大上传了应助文件

24秒前; danna的应助被Annie采纳，获得10

24秒前; dingz完成签到，获得积分10

24秒前; 李爱国的应助被zhaowenxian采纳，获得10

25秒前; ls完成签到，获得积分10

26秒前; lf发布了新的文献求助10

26秒前; persi完成签到，获得积分10

28秒前; IKUN发布了新的文献求助10

29秒前; aokaoji完成签到，获得积分20

29秒前; 情怀上传了应助文件

33秒前; 爆米花的应助被笔尖划痕采纳，获得10

35秒前; yusuf发布了新的文献求助10

39秒前; Jasper上传了应助文件

39秒前; 乐乐上传了应助文件

39秒前; 充电宝的应助被AixGnad采纳，获得10

40秒前; lf完成签到，获得积分10

41秒前; Lucas的应助被刘静采纳，获得10

42秒前; 企鹅发布了新的文献求助10

42秒前; 劼大大完成签到，获得积分10

43秒前

高分求助中: Assessing and Diagnosing Young Children with Neurodevelopmental Disorders (2nd Edition) 700; The Elgar Companion to Consumer Behaviour and the Sustainable Development Goals 540; The Martian climate revisited: atmosphere and environment of a desert planet 500; Images that translate 500; Transnational East Asian Studies 400; Towards a spatial history of contemporary art in China 400; Mapping the Stars: Celebrity, Metonymy, and the Networked Politics of Identity 400

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 3843823; 求助须知：如何正确求助？哪些是违规求助？ 3386203; 关于积分的说明 10544094; 捐赠科研通 3106943; 什么是DOI，文献DOI怎么找？ 1711344; 邀请新用户注册赠送积分活动 824042; 科研通“疑难数据库（出版商）”最低求助积分说明 774409

今日热心研友

昏睡的蟠桃

无语的安白

收手吧大哥

小茄子爷爷

jenningseastera

可千万不要躺平呀

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2025 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：941272744【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通