학술논문

Federated Learning Enables Big Data for Rare Cancer Boundary Detection
Document Type
Working Paper
Author
Pati, SarthakBaid, UjjwalEdwards, BrandonSheller, MicahWang, Shih-HanReina, G AnthonyFoley, PatrickGruzdev, AlexeyKarkada, DeepthiDavatzikos, ChristosSako, ChiharuGhodasara, SatyamBilello, MichelMohan, SuyashVollmuth, PhilippBrugnara, GianlucaPreetha, Chandrakanth JSahm, FelixMaier-Hein, KlausZenk, MaximilianBendszus, MartinWick, WolfgangCalabrese, EvanRudie, JeffreyVillanueva-Meyer, JavierCha, SoonmeeIngalhalikar, MadhuraJadhav, ManaliPandey, UmangSaini, JitenderGarrett, JohnLarson, MatthewJeraj, RobertCurrie, StuartFrood, RussellFatania, KaviHuang, Raymond YChang, KenBalana, CarmenCapellades, JaumePuig, JosepTrenkler, JohannesPichler, JosefNecker, GeorgHaunschmidt, AndreasMeckel, StephanShukla, GauravLiem, SpencerAlexander, Gregory SLombardo, JosephPalmer, Joshua DFlanders, Adam EDicker, Adam PSair, Haris IJones, Craig KVenkataraman, ArchanaJiang, MeiruiSo, Tiffany YChen, ChengHeng, Pheng AnnDou, QiKozubek, MichalLux, FilipMichálek, JanMatula, PetrKeřkovský, MilošKopřivová, TerezaDostál, MarekVybíhal, VáclavVogelbaum, Michael AMitchell, J RossFarinhas, JoaquimMaldjian, Joseph AYogananda, Chandan Ganesh BangalorePinho, Marco CReddy, DivyaHolcomb, JamesWagner, Benjamin CEllingson, Benjamin MCloughesy, Timothy FRaymond, CatalinaOughourlian, TaliaHagiwara, AkifumiWang, ChencaiTo, Minh-SonBhardwaj, SargamChong, CheeAgzarian, MarcFalcão, Alexandre XavierMartins, Samuel BTeixeira, Bernardo C ASprenger, FláviaMenotti, DavidLucio, Diego RLaMontagne, PamelaMarcus, DanielWiestler, BenediktKofler, FlorianEzhov, IvanMetz, MarieJain, RajanLee, MatthewLui, Yvonne WMcKinley, RichardSlotboom, JohannesRadojewski, PiotrMeier, RaphaelWiest, RolandMurcia, DerrickFu, EricHaas, RourkeThompson, JohnOrmond, David RyanBadve, ChaitraSloan, Andrew EVadmal, VachanWaite, KristinColen, Rivka RPei, LinminAk, MuratSrinivasan, AshokBapuraj, J RajivRao, ArvindWang, NicholasYoshiaki, OtaMoritani, ToshioTurk, SevcanLee, JoonsangPrabhudesai, SnehalMorón, FannyMandel, JacobKamnitsas, KonstantinosGlocker, BenDixon, Luke V MWilliams, MatthewZampakis, PeterPanagiotopoulos, VasileiosTsiganos, PanagiotisAlexiou, SotirisHaliassos, IliasZacharaki, Evangelia IMoustakas, KonstantinosKalogeropoulou, ChristinaKardamakis, Dimitrios MChoi, Yoon SeongLee, Seung-KooChang, Jong HeeAhn, Sung SooLuo, BingPoisson, LailaWen, NingTiwari, PallaviVerma, RuchikaBareja, RohanYadav, IpsaChen, JonathanKumar, NeerajSmits, Marionvan der Voort, Sebastian RAlafandi, AhmedIncekara, FatihWijnenga, Maarten MJKapsas, GeorgiosGahrmann, RenskeSchouten, Joost WDubbink, Hendrikus JVincent, Arnaud JPEBent, Martin J van denFrench, Pim JKlein, StefanYuan, YadingSharma, SonamTseng, Tzu-ChiAdabi, SabaNiclou, Simone PKeunen, OlivierHau, Ann-ChristinVallières, MartinFortin, DavidLepage, MartinLandman, BennettRamadass, KarthikXu, KaiwenChotai, SilkyChambless, Lola BMistry, AkshitkumarThompson, Reid CGusev, YuriyBhuvaneshwar, KrithikaSayah, AnoushehBencheqroun, CameliaBelouali, AnasMadhavan, SubhaBooth, Thomas CChelliah, AlyshaModat, MarcShuaib, HarisDragos, CarmenAbayazeed, AlyKolodziej, KennethHill, MichaelAbbassy, AhmedGamal, ShadyMekhaimar, MahmoudQayati, MohamedReyes, MauricioPark, Ji EunYun, JihyeKim, Ho SungMahajan, AbhishekMuzi, MarkBenson, SeanBeets-Tan, Regina G HTeuwen, JonasHerrera-Trujillo, AlejandroTrujillo, MariaEscobar, WilliamAbello, AnaBernal, JoseGómez, JhonChoi, JosephBaek, StephenKim, YusungIsmael, HebaAllen, BryanBuatti, John MKotrotsou, AikateriniLi, HongweiWeiss, TobiasWeller, MichaelBink, AndreaPouymayou, BertrandShaykh, Hassan FSaltz, JoelPrasanna, PrateekShrestha, SampurnaMani, Kartik MPayne, DavidKurc, TahsinPelaez, EnriqueFranco-Maldonado, HeydyLoayza, FrancisQuevedo, SebastianGuevara, PamelaTorche, EstebanMendoza, CristobalVera, FrancoRíos, ElvisLópez, EduardoVelastin, Sergio AOgbole, GodwinOyekunle, DotunOdafe-Oyibotha, OlubunmiOsobu, BabatundeShu'aibu, MustaphaDorcas, AdeleyeSoneye, MayowaDako, FaroukSimpson, Amber LHamghalam, MohammadPeoples, Jacob JHu, RickyTran, AnhCutler, DanielleMoraes, Fabio YBoss, Michael AGimpel, JamesVeettil, Deepak KattilSchmidt, KendallBialecki, BrianMarella, SailajaPrice, CynthiaCimino, LisaApgar, CharlesShah, PrashantMenze, BjoernBarnholtz-Sloan, Jill SMartin, JasonBakas, Spyridon
Source
Subject
Computer Science - Machine Learning
Electrical Engineering and Systems Science - Image and Video Processing
Language
Abstract
Although machine learning (ML) has shown promise in numerous domains, there are concerns about generalizability to out-of-sample data. This is currently addressed by centrally sharing ample, and importantly diverse, data from multiple sites. However, such centralization is challenging to scale (or even not feasible) due to various limitations. Federated ML (FL) provides an alternative to train accurate and generalizable ML models, by only sharing numerical model updates. Here we present findings from the largest FL study to-date, involving data from 71 healthcare institutions across 6 continents, to generate an automatic tumor boundary detector for the rare disease of glioblastoma, utilizing the largest dataset of such patients ever used in the literature (25,256 MRI scans from 6,314 patients). We demonstrate a 33% improvement over a publicly trained model to delineate the surgically targetable tumor, and 23% improvement over the tumor's entire extent. We anticipate our study to: 1) enable more studies in healthcare informed by large and diverse data, ensuring meaningful results for rare diseases and underrepresented populations, 2) facilitate further quantitative analyses for glioblastoma via performance optimization of our consensus model for eventual public release, and 3) demonstrate the effectiveness of FL at such scale and task complexity as a paradigm shift for multi-site collaborations, alleviating the need for data sharing.
Comment: federated learning, deep learning, convolutional neural network, segmentation, brain tumor, glioma, glioblastoma, FeTS, BraTS