Align Generative Artificial Intelligence with Human Preferences: A Novel Large Language Model Fine-Tuning Method for Online Review Management

计算机科学 人工智能 生成语法 偏爱 构造(python库) 点(几何) 机器学习 人类智力 数据科学 语言模型 人类行为 生成模型 幻觉 猛增 人工智能应用 修剪 路径(计算) 人机交互 知识管理
作者
Yanan Wang,Yong Ge
出处
期刊:Information Systems Research [Institute for Operations Research and the Management Sciences]
标识
DOI:10.1287/isre.2024.1518
摘要

Online reviews can shape where people stay, eat, and shop, but businesses often struggle to keep up with the flood of customer feedback. Although generative artificial intelligence (AI) offers a promising solution, general-purpose models are not designed for the specific judgment, tone, and accuracy required in customer review responses. This study introduces a new fine-tuning method that helps large language models generate review replies that better match human preferences in real business settings. The paper makes several technical advances. It identifies why review-response systems hallucinate and introduces a context-augmentation strategy to reduce factual errors. It also develops a theory-driven way to automatically construct preference data from existing review-response records, overcoming a major barrier in preference fine-tuning. In addition, the study proposes a curriculum learning design and a new support-constraint method that reduces the overconservatism of existing offline optimization approaches, with stronger theoretical guarantees. Tests on hotel reviews show that the method produces better responses than leading alternatives in both automated evaluations and human judgments. The findings point to a practical path for using AI to help firms respond faster and more consistently to customers while also underscoring the need for safeguards, human oversight, and domain-specific model alignment in customer-facing AI systems.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
田様应助不知道采纳,获得10
刚刚
丘比特应助哈哈镜阿姐采纳,获得10
1秒前
NexusExplorer应助yowar采纳,获得10
1秒前
2秒前
2秒前
辣辣发布了新的文献求助10
2秒前
3秒前
4秒前
章yy完成签到,获得积分10
4秒前
4秒前
不安梦桃发布了新的文献求助10
4秒前
AW完成签到,获得积分10
5秒前
Carsen完成签到,获得积分10
5秒前
狂野太兰完成签到,获得积分10
7秒前
7秒前
kakafan发布了新的文献求助10
7秒前
AN发布了新的文献求助10
7秒前
学术文献互助应助哎健身采纳,获得100
9秒前
10秒前
11秒前
yy完成签到,获得积分10
11秒前
辣辣完成签到,获得积分10
11秒前
fafa完成签到,获得积分10
11秒前
12秒前
wanci应助Kevin采纳,获得10
12秒前
qifeng完成签到,获得积分10
13秒前
chengxue完成签到,获得积分10
13秒前
14秒前
管箴发布了新的文献求助10
14秒前
xiumu应助小丸子采纳,获得10
14秒前
exosome完成签到,获得积分10
15秒前
Jasper应助AN采纳,获得10
15秒前
15秒前
16秒前
hhh发布了新的文献求助10
17秒前
ben完成签到,获得积分10
17秒前
追寻傲云完成签到 ,获得积分10
18秒前
莽兽鳞上最黑的皮完成签到,获得积分10
18秒前
大樗发布了新的文献求助100
19秒前
21秒前
高分求助中
Signals, Systems, and Signal Processing 610
Fundamentals of Pharmaceutical and Biologics Regulations: A Global Perspective, Second Edition 600
久松真一著作集〈第5巻〉禅と芸術 500
Fundamentals of Modern Mathematics: A Practical Review (Dover Books on Mathematics) 500
Cold War Transcended: Australia's China Policy, 1949-1990 470
Cybercrime: The Transformation of Crime in the Information Age, 2nd Edition 400
Moore's Clinically Oriented Anatomy 10th Edition 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6619754
求助须知:如何正确求助?哪些是违规求助? 8383702
关于积分的说明 17934722
捐赠科研通 5791188
什么是DOI,文献DOI怎么找? 2960657
邀请新用户注册赠送积分活动 1935864
关于科研通互助平台的介绍 1841564