引用
计算机科学
对比度(视觉)
干预(咨询)
人工智能
数据科学
机器学习
自然语言处理
心理学
精神科
万维网
作者
Muhammad Hasnain,Khursheed Aurangzeb,Musaed Alhussein,Imran Ghani,Muhammad Hamza Mahmood
标识
DOI:10.3389/frai.2025.1579375
摘要
Introduction The advent of large language models and their applications have gained significant attention due to their strengths in natural language processing. Methods In this study, ChatGPT and DeepSeek are utilized as AI models to assist in diagnosis based on the responses generated to clinical questions. Furthermore, ChatGPT, Claude, and DeepSeek are used to analyze images to assess their potential diagnostic capabilities, applying the various sensitivity analyses described. We employ prompt engineering techniques and evaluate their abilities to generate high quality responses. We propose several prompts and use them to answer important information on conjunctivitis. Results Our findings show that DeepSeek excels in offering precise and comprehensive information on specific topics related to conjunctivitis. DeepSeek provides detailed explanations and in depth medical insights. In contrast, the ChatGPT model provides generalized public information on the infection, which makes it more suitable for broader and less technical discussions. In this study, DeepSeek achieved a better performance with a 7% hallucination rate compared to ChatGPT's 13%. Claude demonstrated perfect 100% accuracy in binary classification, significantly outperforming ChatGPT's 62.5% accuracy. Discussion DeepSeek showed limited performance in understanding images dataset on conjunctivitis. This comparative analysis serves as an insightful reference for scholars and health professionals applying these models in varying medical contexts.
科研通智能强力驱动
Strongly Powered by AbleSci AI