학술논문

Can AI pass the written European Board Examination in Neurological Surgery? - Ethical and practical issues
Document Type
article
Source
Brain and Spine, Vol 4, Iss , Pp 102765- (2024)
Subject
Neurosurgery board examination
Artificial intelligence
Chat gpt
Bing
Bard
EANS
Neurology. Diseases of the nervous system
RC346-429
Language
English
ISSN
2772-5294
Abstract
Introduction: Artificial intelligence (AI) based large language models (LLM) contain enormous potential in education and training. Recent publications demonstrated that they are able to outperform participants in written medical exams. Research question: We aimed to explore the accuracy of AI in the written part of the EANS board exam. Material and methods: Eighty-six representative single best answer (SBA) questions, included at least ten times in prior EANS board exams, were selected by the current EANS board exam committee. The questions’ content was classified as 75 text-based (TB) and 11 image-based (IB) and their structure as 50 interpretation-weighted, 30 theory-based and 6 true-or-false. Questions were tested with Chat GPT 3.5, Bing and Bard. The AI and participant results were statistically analyzed through ANOVA tests with Stata SE 15 (StataCorp, College Station, TX). P-values of