计算机科学
人工智能
生成语法
偏爱
构造(python库)
点(几何)
机器学习
人类智力
数据科学
语言模型
人类行为
生成模型
幻觉
猛增
人工智能应用
修剪
路径(计算)
人机交互
知识管理
标识
DOI:10.1287/isre.2024.1518
摘要
Online reviews can shape where people stay, eat, and shop, but businesses often struggle to keep up with the flood of customer feedback. Although generative artificial intelligence (AI) offers a promising solution, general-purpose models are not designed for the specific judgment, tone, and accuracy required in customer review responses. This study introduces a new fine-tuning method that helps large language models generate review replies that better match human preferences in real business settings. The paper makes several technical advances. It identifies why review-response systems hallucinate and introduces a context-augmentation strategy to reduce factual errors. It also develops a theory-driven way to automatically construct preference data from existing review-response records, overcoming a major barrier in preference fine-tuning. In addition, the study proposes a curriculum learning design and a new support-constraint method that reduces the overconservatism of existing offline optimization approaches, with stronger theoretical guarantees. Tests on hotel reviews show that the method produces better responses than leading alternatives in both automated evaluations and human judgments. The findings point to a practical path for using AI to help firms respond faster and more consistently to customers while also underscoring the need for safeguards, human oversight, and domain-specific model alignment in customer-facing AI systems.
科研通智能强力驱动
Strongly Powered by AbleSci AI