计算机科学
生成语法
人类智力
劳动力
社会智力
知识管理
基础(证据)
人工智能
数据科学
环境智能
管理科学
人机交互
推荐系统
机器学习
认知
劳动力发展
生成模型
电池(电)
劳动力规划
美国劳动力的老龄化
社会学习
工作表现
作者
Wen Wang,Siqi Pei,Tianshu Sun
标识
DOI:10.1287/isre.2023.0487
摘要
This study introduces a novel, human-centered framework for evaluating the holistic intelligence of large language models (LLMs), using behavioral theory and experimental benchmarks drawn from human intelligence. Through extensive online experiments, the framework reveals that GPT-4 outperforms humans in cognitive, emotional, and creative intelligence, but falls short in social intelligence, especially in social interest, self-efficacy, and understanding mental states. Beyond theoretical insight, the study validates this framework by assessing GPT-4’s impact across diverse job roles, finding results consistent with established labor market research. It also offers a reusable tool for firms and policymakers to evaluate LLM intelligence and forecast job-level impacts. This enables informed decisions about where and how to integrate LLMs, match models to specific job requirements, and identify risks in socially intensive roles. The framework provides a foundation for responsible LLM deployment, ensuring alignment with human-centered structures and supporting strategic workforce planning.
科研通智能强力驱动
Strongly Powered by AbleSci AI