Mitigating Age-Related Bias in Large Language Models: Strategies for Responsible Artificial Intelligence Development

计算机科学 人工智能 机器学习
作者
Zhuang Liu,S. Qian,Shuirong Cao,Tianyu Shi
出处
期刊:Informs Journal on Computing
标识
DOI:10.1287/ijoc.2024.0645
摘要

The increasing popularity of large language models (LLMs) in digital platforms elevates the urgency to address inherent biases, particularly age-related biases, which can significantly skew the model’s fairness and performance. This paper introduces a novel two-stage bias mitigation approach utilizing LLM’s empathy ability, reinforcement learning, and human-in-the-loop mechanisms to identify and correct age-related biases without altering model parameters. There are two modes for our bias mitigation strategy. Self-bias mitigation in the loop allows LLMs to self-assess and adjust their outputs autonomously, promoting inherent bias awareness and correction. Alternatively, cooperative bias mitigation in the loop leverages collaborative filtering among multiple LLMs to debate and mitigate biases through consensus. Furthermore, we introduce the empathetic perspective exchange strategy, which can further refine the answers by changing the perspective in the context information given to the LLM. In this way, more suitable responses applicable to different ages are generated. Our comprehensive evaluation across several data sets demonstrates that our trained model, FairLLM, significantly reduces age bias, outperforming existing techniques in fairness metrics. These findings underscore the effectiveness of our proposed framework in fostering the development of more equitable artificial intelligence systems, potentially benefiting a broader demographic spectrum by reducing digital ageism. History: This paper has been accepted by Kaushik Dutta for the Special Issue on Responsible AI and Data Science for Social Good. Funding: This work was supported by the National Natural Science Foundation of China [Grants 71971046, 72172029, 72403033, 72272028, and 72442025]. Supplemental Material: The software that supports the findings of this study is available within the paper and its Supplemental Information ( https://pubsonline.informs.org/doi/suppl/10.1287/ijoc.2024.0645 ) as well as from the IJOC GitHub software repository ( https://github.com/INFORMSJoC/2024.0645 ). The complete IJOC Software and Data Repository is available at https://informsjoc.github.io/ .
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
长生完成签到 ,获得积分10
刚刚
CCC完成签到 ,获得积分10
6秒前
13秒前
14秒前
蕾蕾发布了新的文献求助10
20秒前
丘比特应助亓昂采纳,获得10
20秒前
20秒前
21秒前
22秒前
24秒前
Kristine完成签到 ,获得积分10
24秒前
25秒前
swy发布了新的文献求助10
25秒前
26秒前
丸子发布了新的文献求助10
28秒前
28秒前
猪猪猪完成签到,获得积分10
29秒前
刘老哥6发布了新的文献求助10
30秒前
swy完成签到,获得积分10
33秒前
34秒前
CC完成签到 ,获得积分10
35秒前
oMayii完成签到 ,获得积分10
35秒前
科研通AI2S应助刘老哥6采纳,获得10
35秒前
天天快乐应助刘老哥6采纳,获得10
35秒前
陈志宏发布了新的文献求助10
36秒前
honey完成签到,获得积分10
36秒前
36秒前
田様应助maxinyu采纳,获得10
36秒前
37秒前
慕青应助mr_chxb82采纳,获得10
39秒前
科研圣体完成签到,获得积分10
40秒前
41秒前
41秒前
葱葱完成签到,获得积分10
41秒前
亓昂发布了新的文献求助10
42秒前
科研小白发布了新的文献求助10
42秒前
tcmlida完成签到,获得积分10
42秒前
烟花应助Wxj246801采纳,获得10
43秒前
44秒前
鱼鱼mm完成签到,获得积分20
45秒前
高分求助中
(应助此贴封号)【重要!!请各位详细阅读】【科研通的精品贴汇总】 10000
Les Mantodea de Guyane: Insecta, Polyneoptera [The Mantids of French Guiana] 3000
Determination of the boron concentration in diamond using optical spectroscopy 600
The Netter Collection of Medical Illustrations: Digestive System, Volume 9, Part III - Liver, Biliary Tract, and Pancreas (3rd Edition) 600
Founding Fathers The Shaping of America 500
A new house rat (Mammalia: Rodentia: Muridae) from the Andaman and Nicobar Islands 500
2025-2031全球及中国蛋黄lgY抗体行业研究及十五五规划分析报告(2025-2031 Global and China Chicken lgY Antibody Industry Research and 15th Five Year Plan Analysis Report) 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 生物化学 物理 纳米技术 计算机科学 内科学 化学工程 复合材料 物理化学 基因 催化作用 遗传学 冶金 电极 光电子学
热门帖子
关注 科研通微信公众号,转发送积分 4536886
求助须知:如何正确求助?哪些是违规求助? 3971922
关于积分的说明 12305219
捐赠科研通 3638764
什么是DOI,文献DOI怎么找? 2003448
邀请新用户注册赠送积分活动 1038853
科研通“疑难数据库(出版商)”最低求助积分说明 928264