학술논문

Federated learning for predicting clinical outcomes in patients with COVID-19
Document Type
article
Author
Dayan, IttaiRoth, Holger RZhong, AoxiaoHarouni, AhmedGentili, AmilcareAbidin, Anas ZLiu, AndrewCosta, Anthony BeardsworthWood, Bradford JTsai, Chien-SungWang, Chih-HungHsu, Chun-NanLee, CKRuan, PeiyingXu, DaguangWu, DufanHuang, EddieKitamura, Felipe CamposLacey, Griffinde Antônio Corradi, Gustavo CésarNino, GustavoShin, Hao-HsinObinata, HirofumiRen, HuiCrane, Jason CTetreault, JesseGuan, JiahuiGarrett, John WKaggie, Joshua DPark, Jung GilDreyer, KeithJuluru, KrishnaKersten, KristopherRockenbach, Marcio Aloisio Bezerra CavalcantiLinguraru, Marius GeorgeHaider, Masoom AAbdelMaseeh, MeenaRieke, NicolaDamasceno, Pablo Fe Silva, Pedro Mario CruzWang, PochuanXu, ShengKawano, ShuichiSriswasdi, SiraPark, Soo YoungGrist, Thomas MBuch, VarunJantarabenjakul, WatsamonWang, WeichungTak, Won YoungLi, XiangLin, XihongKwon, Young JoonQuraini, AboodFeng, AndrewPriest, Andrew NTurkbey, BarisGlicksberg, BenjaminBizzo, BernardoKim, Byung SeokTor-Díez, CarlosLee, Chia-ChengHsu, Chia-JungLin, ChinLai, Chiu-LingHess, Christopher PCompas, ColinBhatia, DeepekshaOermann, Eric KLeibovitz, EvanSasaki, HisashiMori, HitoshiYang, IsaacSohn, Jae HoMurthy, Krishna Nand KeshavaFu, Li-Chende Mendonça, Matheus Ribeiro FurtadoFralick, MikeKang, Min KyuAdil, MohammadGangai, NatalieVateekul, PeeraponElnajjar, PierreHickman, SarahMajumdar, SharmilaMcLeod, Shelley LReed, SheridanGräf, StefanHarmon, StephanieKodama, TatsuyaPuthanakit, ThanyaweeMazzulli, Tonyde Lavor, Vitor LimaRakvongthai, YothinLee, Yu RimWen, YuhongGilbert, Fiona JFlores, Mona GLi, Quanzheng
Source
Nature Medicine. 27(10)
Subject
Patient Safety
Good Health and Well Being
COVID-19
Electronic Health Records
Humans
Machine Learning
Outcome Assessment
Health Care
Prognosis
SARS-CoV-2
Medical and Health Sciences
Immunology
Language
Abstract
Federated learning (FL) is a method used for training artificial intelligence models with data from multiple sources while maintaining data anonymity, thus removing many barriers to data sharing. Here we used data from 20 institutes across the globe to train a FL model, called EXAM (electronic medical record (EMR) chest X-ray AI model), that predicts the future oxygen requirements of symptomatic patients with COVID-19 using inputs of vital signs, laboratory data and chest X-rays. EXAM achieved an average area under the curve (AUC) >0.92 for predicting outcomes at 24 and 72 h from the time of initial presentation to the emergency room, and it provided 16% improvement in average AUC measured across all participating sites and an average increase in generalizability of 38% when compared with models trained at a single site using that site's data. For prediction of mechanical ventilation treatment or death at 24 h at the largest independent test site, EXAM achieved a sensitivity of 0.950 and specificity of 0.882. In this study, FL facilitated rapid data science collaboration without data exchange and generated a model that generalized across heterogeneous, unharmonized datasets for prediction of clinical outcomes in patients with COVID-19, setting the stage for the broader use of FL in healthcare.