학술논문

Comparing ChatGPT-4 to BARD for Accuracy and Completeness of Responses to Questions Derived from the International Consensus Statement on Endoscopic Skull Base Surgery.
Document Type
Article
Source
Journal of Neurological Surgery. Part B. Skull Base. 2024 Supplement, Vol. 85, pS1-S398. 398p.
Subject
*SKULL base
*CHATGPT
*SKULL surgery
*LANGUAGE models
Language
ISSN
2193-6331
Abstract
This article discusses the use of artificial intelligence (AI) language models, specifically Chat Generative Pre-Trained Transformer 4 (GPT-4) by OpenAI and Bard by Google, in answering questions and providing information to the public. The study compares the accuracy and completeness of responses generated by GPT-4 and Bard to questions based on the International Consensus Statement on Endoscopic Skull Base Surgery (ICAR:SB) guidelines. The results show that GPT-4 had higher accuracy and completeness scores compared to Bard. The study concludes that AI language models have the potential to be effective tools for disseminating information in the future. [Extracted from the article]