医学诊断
软件部署
护理部
分类学(生物学)
保护
医学
心理学
梅德林
医疗急救
过程管理
患者安全
作者
Metin Tuncer,Turgay Yalcinkaya
摘要
AIM: To evaluate the capability of large language models to generate nursing diagnoses based on NANDA-I Taxonomy II and assess their performance across domains and overall. BACKGROUND: Large language models are emerging tools in nursing, showing potential to aid in diagnosis generation and education. However, their accuracy and applicability in clinical and educational settings remain underexplored. METHODS: This cross-sectional comparative study used 10 realistic patient scenarios based on NANDA-I Taxonomy II, covering 12 domains. The study aimed to evaluate the capability of four models to generate nursing diagnoses based on patient scenarios. The responses were assessed by five nursing experts for accuracy and alignment with NANDA-I Taxonomy II in a single-blind evaluation process. RESULTS: All models demonstrated similar performance across different domains and overall, with Claude attaining the highest overall performance score. Expert evaluations indicated moderate interrater reliability. DISCUSSION: Small variations between models and occasional omissions suggest that expert review is still required before clinical use. CONCLUSIONS: Large language models are not yet sufficiently reliable for independent use in clinical settings and nursing education. Their application as supportive tools necessitates a cautious approach. Moreover, the development of specialized models designed to address the unique demands of the nursing field would be advantageous. IMPLICATIONS FOR NURSING: When large language models are used in nursing practice, their limitations should be considered, and the outputs they produce should be verified by nurses. IMPLICATIONS FOR NURSING POLICY: Ensuring the safe integration of artificial intelligence tools into nursing necessitates the establishment of robust regulatory policies to safeguard patient safety, the deployment of effective systems to monitor models' performance, and the development of comprehensive guidelines and training programs.
科研通智能强力驱动
Strongly Powered by AbleSci AI