학술논문

A practical guide to the implementation of AI in orthopaedic research, Part 6: How to evaluate the performance of AI research?
Document Type
Academic Journal
Author
Oettl FC; Hospital for Special Surgery New York New York USA.; Schulthess Klinik Zurich Switzerland.; Pareek A; Sports Medicine and Shoulder Institute, Hospital for Special Surgery New York New York USA.; Winkler PW; Department for Orthopaedics and Traumatology, Kepler University Hospital GmbH Johannes Kepler University Linz Linz Austria.; Department of Orthopaedics, Institute of Clinical Sciences, Sahlgrenska Academy University of Gothenburg Gothenburg Sweden.; Sahlgrenska Sports Medicine Center Göteborg Sweden.; Zsidai B; Department of Orthopaedics, Institute of Clinical Sciences, Sahlgrenska Academy University of Gothenburg Gothenburg Sweden.; Sahlgrenska Sports Medicine Center Göteborg Sweden.; Pruneski JA; Department of Orthopaedic Surgery Tripler Army Medical Center Honolulu Hawaii USA.; Senorski EH; Sahlgrenska Sports Medicine Center Göteborg Sweden.; Department of Health and Rehabilitation, Institute of Neuroscience and Physiology, Sahlgrenska Academy University of Gothenburg Gothenburg Sweden.; Kopf S; Center of Orthopaedics and Traumatology, University Hospital Brandenburg an der Havel, Brandenburg Medical School Theodor Fontane Germany.; Ley C; Department of Mathematics University of Luxembourg Esch-sur-Alzette Luxembourg.; Herbst E; Department of Trauma, Hand and Reconstructive Surgery University Hospital Muenster Muenster Germany.; Oeding JF; Department of Orthopaedics, Institute of Clinical Sciences, Sahlgrenska Academy University of Gothenburg Gothenburg Sweden.; Mayo Clinic Alix School of Medicine, Mayo Clinic Rochester Minnesota USA.; Grassi A; IIa Clinica Ortopedica e Traumatologica, IRCCS Istituto Ortopedico Rizzoli Bologna Italy.; Hirschmann MT; Department of Orthopaedic Surgery and Traumatology Kantonsspital Baselland Bruderholz Switzerland.; University of Basel Basel Switzerland.; Musahl V; Department of Orthopaedic Surgery, UPMC Freddie Fu Sports Medicine Center University of Pittsburgh Pittsburgh Pennsylvania USA.; Samuelsson K; Department of Orthopaedics, Institute of Clinical Sciences, Sahlgrenska Academy University of Gothenburg Gothenburg Sweden.; Sahlgrenska Sports Medicine Center Göteborg Sweden.; Department of Orthopaedics Sahlgrenska University Hospital Mölndal Sweden.; Tischer T; Department of Orthopaedic Surgery Universitymedicine Rostock Rostock Germany.; Department of Orthopaedic and Trauma Surgery Malteser Waldkrankenhaus Erlangen Erlangen Germany.; Feldt R; Department of Computer Science and Engineering Chalmers University of Technology Gothenburg Sweden.
Source
Publisher: Wiley Country of Publication: United States NLM ID: 101653750 Publication Model: eCollection Cited Medium: Print ISSN: 2197-1153 (Print) Linking ISSN: 21971153 NLM ISO Abbreviation: J Exp Orthop Subsets: PubMed not MEDLINE
Subject
Language
English
ISSN
2197-1153
Abstract
Artificial intelligence's (AI) accelerating progress demands rigorous evaluation standards to ensure safe, effective integration into healthcare's high-stakes decisions. As AI increasingly enables prediction, analysis and judgement capabilities relevant to medicine, proper evaluation and interpretation are indispensable. Erroneous AI could endanger patients; thus, developing, validating and deploying medical AI demands adhering to strict, transparent standards centred on safety, ethics and responsible oversight. Core considerations include assessing performance on diverse real-world data, collaborating with domain experts, confirming model reliability and limitations, and advancing interpretability. Thoughtful selection of evaluation metrics suited to the clinical context along with testing on diverse data sets representing different populations improves generalisability. Partnering software engineers, data scientists and medical practitioners ground assessment in real needs. Journals must uphold reporting standards matching AI's societal impacts. With rigorous, holistic evaluation frameworks, AI can progress towards expanding healthcare access and quality.
Level of Evidence: Level V.
Competing Interests: The authors declare no conflict of interest.
(© 2024 The Author(s). Journal of Experimental Orthopaedics published by John Wiley & Sons Ltd on behalf of European Society of Sports Traumatology, Knee Surgery and Arthroscopy.)