학술논문

CIRCUST: A novel methodology for temporal order reconstruction of molecular rhythms; validation and application towards a daily rhythm gene expression atlas in humans.
Document Type
Article
Source
PLoS Computational Biology. 9/28/2023, Vol. 19 Issue 9, p1-22. 22p. 1 Illustration, 1 Chart, 9 Graphs.
Subject
*GENE expression
*MOLECULAR clock
*RHYTHM
*CLOCK genes
*CHRONOBIOLOGY
*CIRCADIAN rhythms
Language
ISSN
1553-734X
Abstract
The circadian system drives near-24-h oscillations in behaviors and biological processes. The underlying core molecular clock regulates the expression of other genes, and it has been shown that the expression of more than 50 percent of genes in mammals displays 24-h rhythmic patterns, with the specific genes that cycle varying from one tissue to another. Determining rhythmic gene expression patterns in human tissues sampled as single timepoints has several challenges, including the reconstruction of temporal order of highly noisy data. Previous methodologies have attempted to address these challenges in one or a small number of tissues for which clock gene evolutionary conservation is assumed to be preserved. Here we introduce CIRCUST, a novel CIRCular-robUST methodology for analyzing molecular rhythms, that relies on circular statistics, is robust against noise, and requires fewer assumptions than existing methodologies. Next, we validated the method against four controlled experiments in which sampling times were known, and finally, CIRCUST was applied to 34 tissues from the Genotype-Tissue Expression (GTEx) dataset with the aim towards building a comprehensive daily rhythm gene expression atlas in humans. The validation and application shown here indicate that CIRCUST provides a flexible framework to formulate and solve the issues related to the analysis of molecular rhythms in human tissues. CIRCUST methodology is publicly available at https://github.com/yolandalago/CIRCUST/. Author summary: Rhythmic gene expressions determine tissue-specific functional activity regulating processes such as metabolism, endocrine function, and immune function. Expression patterns of rhythmic genes usually display oscillatory shapes with the timing of the peak closely related to the organ's function. The knowledge of these rhythmic expression patterns may be important in the prevention, diagnosis, and treatment of disease. Yet, human molecular rhythm analysis usually relies on post-mortem samples collected from many people because repeated biopsies within an individual across multiple tissues are impractical. Because the biological time in each donor generally is unknown, the sample order and temporal direction need to be reconstructed. This paper describes and validates CIRCUST, a robust, extensible, and open-source statistical framework to address, separately across organs, the molecular analysis of human rhythms with unknown sampling times; see yolandalago/CIRCUST (github.com). In particular, CIRCUST's application to GTEx, a post-mortem human gene expression database, provides the largest human rhythmic gene expression atlas to date. [ABSTRACT FROM AUTHOR]