元认知
任务(项目管理)
心理学
认知心理学
逻辑推理
读写能力
任务分析
规范(哲学)
计算机科学
人工智能
生成语法
生成模型
基本认知任务
心理语言学
自然语言处理
知识水平
分析推理
认知科学
认知负荷
多任务学习
认知
作者
Danielle Priscila Bueno Fernandes,Steeven Villa,Salla Nicholls,Otso Haavisto,Daniel Buschek,Albrecht Schmidt,Thomas Kosch,Chenxinran Shen,Robin Welsch
标识
DOI:10.1016/j.chb.2025.108779
摘要
Optimizing human-AI interaction requires users to reflect on their performance critically, yet little is known about generative AI systems’ effect on users’ metacognitive judgments. In two large-scale studies, we investigate how AI usage is associated with users’ metacognitive monitoring and performance in logical reasoning tasks. Specifically, our paper examines whether people using AI to complete tasks can accurately monitor how well they perform. In Study 1, participants (N = 246) used AI to solve 20 logical reasoning problems from the Law School Admission Test. While their task performance improved by three points compared to a norm population, participants overestimated their task performance by four points. Interestingly, higher AI literacy correlated with lower metacognitive accuracy, suggesting that those with more technical knowledge of AI were more confident but less precise in judging their own performance. Using a computational model, we explored individual differences in metacognitive accuracy and found that the Dunning-Kruger effect, usually observed in this task, ceased to exist with AI use. Study 2 (N = 452) replicates these findings. We discuss how AI levels cognitive and metacognitive performance in human-AI interaction and consider the consequences of performance overestimation for designing interactive AI systems that foster accurate self-monitoring, avoid overreliance, and enhance cognitive performance. • People are not able to accurately assess their performance when using AI. • Large Language Model usage levels the Dunning–Kruger effect. • Higher AI literacy correlates with lower self-assessment accuracy. • Higher confidence correlates with lower self-assessment accuracy. • LLM use improves human reasoning performance in the Law School Admission Test.
科研通智能强力驱动
Strongly Powered by AbleSci AI