학술논문

Comprehensive tissue deconvolution of cell-free DNA by deep learning for disease diagnosis and monitoring
Document Type
article
Source
Proceedings of the National Academy of Sciences of the United States of America. 120(28)
Subject
Human Genome
Bioengineering
Genetics
Detection
screening and diagnosis
4.1 Discovery and preclinical testing of markers and technologies
Generic health relevance
Humans
Cell-Free Nucleic Acids
Deep Learning
DNA Methylation
Biomarkers
Promoter Regions
Genetic
Biomarkers
Tumor
cell-free DNA
DNA methylation
tissue deconvolution
disease diagnosis
disease monitoring
Language
Abstract
Plasma cell-free DNA (cfDNA) is a noninvasive biomarker for cell death of all organs. Deciphering the tissue origin of cfDNA can reveal abnormal cell death because of diseases, which has great clinical potential in disease detection and monitoring. Despite the great promise, the sensitive and accurate quantification of tissue-derived cfDNA remains challenging to existing methods due to the limited characterization of tissue methylation and the reliance on unsupervised methods. To fully exploit the clinical potential of tissue-derived cfDNA, here we present one of the largest comprehensive and high-resolution methylation atlas based on 521 noncancer tissue samples spanning 29 major types of human tissues. We systematically identified fragment-level tissue-specific methylation patterns and extensively validated them in orthogonal datasets. Based on the rich tissue methylation atlas, we develop the first supervised tissue deconvolution approach, a deep-learning-powered model, cfSort, for sensitive and accurate tissue deconvolution in cfDNA. On the benchmarking data, cfSort showed superior sensitivity and accuracy compared to the existing methods. We further demonstrated the clinical utilities of cfSort with two potential applications: aiding disease diagnosis and monitoring treatment side effects. The tissue-derived cfDNA fraction estimated from cfSort reflected the clinical outcomes of the patients. In summary, the tissue methylation atlas and cfSort enhanced the performance of tissue deconvolution in cfDNA, thus facilitating cfDNA-based disease detection and longitudinal treatment monitoring.