학술논문

Machine Learning Characterization of COPD Subtypes Insights From the COPDGene Study
Document Type
article
Author
Castaldi, Peter JBoueiz, AdelYun, JeongSan Jose Estepar, RaulRoss, James CWashko, GeorgeCho, Michael HHersh, Craig PKinney, Gregory LYoung, Kendra ARegan, Elizabeth ALynch, David ACriner, Gerald JDy, Jennifer GRennard, Stephen ICasaburi, RichardMake, Barry JCrapo, JamesSilverman, Edwin KHokanson, John ECrapo, James DBeaty, TerriBegum, FerdouseCho, MichaelDeMeo, Dawn LBoueiz, Adel RForeman, Marilyn GHalper-Stromberg, EitanHayden, Lystra PHetmanski, JacquelineHobbs, Brian DLaird, NanLange, ChristophLutz, Sharon MMcDonald, Merry-LynnParker, Margaret MProkopenko, DmitryQiao, DandiRegan, ElizabethSakornsakolpat, PhuwanatWan, Emily SWon, SunghoCenteno, Juan PabloCharbonnier, Jean-PaulCoxson, Harvey OGalban, Craig JHan, MeiLan KHoffman, Eric AHumphries, StephenJacobson, Francine LJudy, Philip FKazerooni, Ella AKluiber, AlexNardelli, PietroNewell, John DNotary, AleenaOh, AndreaSchroeder, JoyceSieren, JeredStoel, Berend CTschirren, JuergVan Beek, Edwinvan Ginneken, Bramvan Rikxoort, EvaSanchez-Ferrero, Gonzalo VegasVeitel, LucasWashko, George RWilson, Carla GJensen, RobertEverett, DouglasCrooks, JimPratte, KatherineStrand, MattKinney, GregoryBhatt, Surya PBon, JessicaDiaz, Alejandro AMake, BarryMurray, SusanSoler, XavierBowler, Russell PKechris, Katerina
Source
CHEST Journal. 157(5)
Subject
Biomedical and Clinical Sciences
Cardiovascular Medicine and Haematology
Clinical Sciences
Genetics
Chronic Obstructive Pulmonary Disease
Lung
2.1 Biological and endogenous factors
Aetiology
Respiratory
Cluster Analysis
Diagnostic Imaging
Disease Progression
Genetic Predisposition to Disease
Genome-Wide Association Study
Humans
Machine Learning
Molecular Epidemiology
Phenotype
Pulmonary Disease
Chronic Obstructive
Respiratory Function Tests
COPD
emphysema
machine learning
COPDGene Investigators
Respiratory System
Cardiovascular medicine and haematology
Clinical sciences
Language
Abstract
COPD is a heterogeneous syndrome. Many COPD subtypes have been proposed, but there is not yet consensus on how many COPD subtypes there are and how they should be defined. The COPD Genetic Epidemiology Study (COPDGene), which has generated 10-year longitudinal chest imaging, spirometry, and molecular data, is a rich resource for relating COPD phenotypes to underlying genetic and molecular mechanisms. In this article, we place COPDGene clustering studies in context with other highly cited COPD clustering studies, and summarize the main COPD subtype findings from COPDGene. First, most manifestations of COPD occur along a continuum, which explains why continuous aspects of COPD or disease axes may be more accurate and reproducible than subtypes identified through clustering methods. Second, continuous COPD-related measures can be used to create subgroups through the use of predictive models to define cut-points, and we review COPDGene research on blood eosinophil count thresholds as a specific example. Third, COPD phenotypes identified or prioritized through machine learning methods have led to novel biological discoveries, including novel emphysema genetic risk variants and systemic inflammatory subtypes of COPD. Fourth, trajectory-based COPD subtyping captures differences in the longitudinal evolution of COPD, addressing a major limitation of clustering analyses that are confounded by disease severity. Ongoing longitudinal characterization of subjects in COPDGene will provide useful insights about the relationship between lung imaging parameters, molecular markers, and COPD progression that will enable the identification of subtypes based on underlying disease processes and distinct patterns of disease progression, with the potential to improve the clinical relevance and reproducibility of COPD subtypes.