民主
机制(生物学)
捐赠
机构设计
强化学习
收入
集体智慧
价值(数学)
人工智能
经济
计算机科学
政治学
微观经济学
法学
机器学习
认识论
财务
哲学
政治
作者
Raphael Köster,Jan Balaguer,Andrea Tacchetti,Ari Weinstein,Tina Zhu,Oliver Hauser,Duncan Williams,Lucy Campbell-Gillingham,Phoebe Thacker,Matthew Botvinick,Christopher Summerfield
标识
DOI:10.1038/s41562-022-01383-x
摘要
Abstract Building artificial intelligence (AI) that aligns with human values is an unsolved problem. Here we developed a human-in-the-loop research pipeline called Democratic AI, in which reinforcement learning is used to design a social mechanism that humans prefer by majority. A large group of humans played an online investment game that involved deciding whether to keep a monetary endowment or to share it with others for collective benefit. Shared revenue was returned to players under two different redistribution mechanisms, one designed by the AI and the other by humans. The AI discovered a mechanism that redressed initial wealth imbalance, sanctioned free riders and successfully won the majority vote. By optimizing for human preferences, Democratic AI offers a proof of concept for value-aligned policy innovation.
科研通智能强力驱动
Strongly Powered by AbleSci AI