Assessing the Accuracy of Artificial Intelligence Models in Scoliosis Classification and Suggested Therapeutic Approaches

被引:8
作者
Fabijan, Artur [1 ]
Zawadzka-Fabijan, Agnieszka [2 ]
Fabijan, Robert
Zakrzewski, Krzysztof [1 ]
Nowoslawska, Emilia [1 ]
Polis, Bartosz [1 ]
机构
[1] Polish Mothers Mem Hosp Res Inst, Dept Neurosurg, PL-93338 Lodz, Poland
[2] Med Univ Lodz, Fac Hlth Sci, Dept Rehabil Med, PL-90419 Lodz, Poland
关键词
scoliosis; artificial intelligence; PMC-LLaMA; ChatGPT; 4; clinical decision support systems;
D O I
10.3390/jcm13144013
中图分类号
R5 [内科学];
学科分类号
100201 [内科学];
摘要
Background: Open-source artificial intelligence models (OSAIMs) are increasingly being applied in various fields, including IT and medicine, offering promising solutions for diagnostic and therapeutic interventions. In response to the growing interest in AI for clinical diagnostics, we evaluated several OSAIMs-such as ChatGPT 4, Microsoft Copilot, Gemini, PopAi, You Chat, Claude, and the specialized PMC-LLaMA 13B-assessing their abilities to classify scoliosis severity and recommend treatments based on radiological descriptions from AP radiographs. Methods: Our study employed a two-stage methodology, where descriptions of single-curve scoliosis were analyzed by AI models following their evaluation by two independent neurosurgeons. Statistical analysis involved the Shapiro-Wilk test for normality, with non-normal distributions described using medians and interquartile ranges. Inter-rater reliability was assessed using Fleiss' kappa, and performance metrics, like accuracy, sensitivity, specificity, and F1 scores, were used to evaluate the AI systems' classification accuracy. Results: The analysis indicated that although some AI systems, like ChatGPT 4, Copilot, and PopAi, accurately reflected the recommended Cobb angle ranges for disease severity and treatment, others, such as Gemini and Claude, required further calibration. Particularly, PMC-LLaMA 13B expanded the classification range for moderate scoliosis, potentially influencing clinical decisions and delaying interventions. Conclusions: These findings highlight the need for the continuous refinement of AI models to enhance their clinical applicability.
引用
收藏
页数:25
相关论文
共 67 条
[1]
Learning to Make Rare and Complex Diagnoses With Generative AI Assistance: Qualitative Study of Popular Large Language Models [J].
Abdullahi, Tassallah ;
Singh, Ritambhara ;
Eickhoff, Carsten .
JMIR MEDICAL EDUCATION, 2024, 10
[2]
about, You Chat: What Is You Chat?
[3]
Multiple Severity -Level Classifications for IT Incident Risk Prediction [J].
Ahmed, Salman ;
Singh, Muskaan ;
Doherty, Brendan ;
Ramlan, Effirul ;
Harkin, Kathryn ;
Coyle, Damien .
2022 9TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE, ISCMI, 2022, :270-274
[4]
Personalized Medicine in Urolithiasis: AI Chatbot-Assisted Dietary Management of Oxalate for Kidney Stone Prevention [J].
Aiumtrakul, Noppawit ;
Thongprayoon, Charat ;
Arayangkool, Chinnawat ;
Vo, Kristine B. ;
Wannaphut, Chalothorn ;
Suppadungsuk, Supawadee ;
Krisanapan, Pajaree ;
Garcia Valencia, Oscar A. ;
Qureshi, Fawad ;
Miao, Jing ;
Cheungpasitporn, Wisit ;
Kaiser, Thorsten .
JOURNAL OF PERSONALIZED MEDICINE, 2024, 14 (01)
[5]
Comparing Artificial Intelligence and Senior Residents in Oral Lesion Diagnosis: A Comparative Study [J].
Albagieh, Hamad ;
Alzeer, Zaid O. ;
Alasmari, Osama N. ;
Alkadhi, Abdullah A. ;
Naitah, Abdulaziz N. ;
Almasaad, Khaled F. ;
Alshahrani, Turki S. ;
Alshahrani, Khalid S. ;
Almahmoud, Mohammed I. .
CUREUS JOURNAL OF MEDICAL SCIENCE, 2024, 16 (01)
[6]
[Anonymous], About us
[7]
[Anonymous], 2009, GGPLOT2 ELEGANT GRAP, DOI 10.1007/978-3-319-24277-4
[8]
anthropic, Constitutional AI: Harmlessness from AI Feedback
[9]
An Investigation into the Utility of Large Language Models in Geotechnical Education and Problem Solving [J].
Chen, Liuxin ;
Tophel, Amir ;
Hettiyadura, Umidu ;
Kodikara, Jayantha .
GEOTECHNICS, 2024, 4 (02) :470-498
[10]
Gender Bias in Artificial Intelligence: Severity Prediction at an Early Stage of COVID-19 [J].
Chung, Heewon ;
Park, Chul ;
Kang, Wu Seong ;
Lee, Jinseok .
FRONTIERS IN PHYSIOLOGY, 2021, 12