AI makes you smarter but none the wiser: The disconnect between performance and metacognition

元认知任务（项目管理）心理学认知心理学逻辑推理读写能力任务分析规范（哲学）计算机科学人工智能生成语法生成模型基本认知任务心理语言学自然语言处理知识水平分析推理认知科学认知负荷多任务学习认知

作者

Danielle Priscila Bueno Fernandes,Steeven Villa,Salla Nicholls,Otso Haavisto,Daniel Buschek,Albrecht Schmidt,Thomas Kosch,Chenxinran Shen,Robin Welsch

出处

期刊：Computers in Human Behavior [Elsevier BV]
日期：2025-10-09 卷期号：175: 108779-108779 被引量：11

标识

DOI：10.1016/j.chb.2025.108779

摘要

Optimizing human-AI interaction requires users to reflect on their performance critically, yet little is known about generative AI systems’ effect on users’ metacognitive judgments. In two large-scale studies, we investigate how AI usage is associated with users’ metacognitive monitoring and performance in logical reasoning tasks. Specifically, our paper examines whether people using AI to complete tasks can accurately monitor how well they perform. In Study 1, participants (N = 246) used AI to solve 20 logical reasoning problems from the Law School Admission Test. While their task performance improved by three points compared to a norm population, participants overestimated their task performance by four points. Interestingly, higher AI literacy correlated with lower metacognitive accuracy, suggesting that those with more technical knowledge of AI were more confident but less precise in judging their own performance. Using a computational model, we explored individual differences in metacognitive accuracy and found that the Dunning-Kruger effect, usually observed in this task, ceased to exist with AI use. Study 2 (N = 452) replicates these findings. We discuss how AI levels cognitive and metacognitive performance in human-AI interaction and consider the consequences of performance overestimation for designing interactive AI systems that foster accurate self-monitoring, avoid overreliance, and enhance cognitive performance. • People are not able to accurately assess their performance when using AI. • Large Language Model usage levels the Dunning–Kruger effect. • Higher AI literacy correlates with lower self-assessment accuracy. • Higher confidence correlates with lower self-assessment accuracy. • LLM use improves human reasoning performance in the Law School Admission Test.

求助该文献

最长约 10秒，即可获得该文献文件

AI makes you smarter but none the wiser: The disconnect between performance and metacognition

今日热心研友