학술논문

framework for group-wise summarization and comparison of chromatin state annotations.
Document Type
Article
Source
Bioinformatics. Jan2023, Vol. 39 Issue 1, p1-9. 9p.
Subject
*GENE regulatory networks
*ANNOTATIONS
*SOURCE code
*LOGISTIC regression analysis
Language
ISSN
1367-4803
Abstract
Motivation Genome-wide maps of epigenetic modifications are powerful resources for non-coding genome annotation. Maps of multiple epigenetics marks have been integrated into cell or tissue type-specific chromatin state annotations for many cell or tissue types. With the increasing availability of multiple chromatin state maps for biologically similar samples, there is a need for methods that can effectively summarize the information about chromatin state annotations within groups of samples and identify differences across groups of samples at a high resolution. Results We developed CSREP, which takes as input chromatin state annotations for a group of samples. CSREP then probabilistically estimates the state at each genomic position and derives a representative chromatin state map for the group. CSREP uses an ensemble of multi-class logistic regression classifiers that predict the chromatin state assignment of each sample given the state maps from all other samples. The difference in CSREP's probability assignments for the two groups can be used to identify genomic locations with differential chromatin state assignments. Using groups of chromatin state maps of a diverse set of cell and tissue types, we demonstrate the advantages of using CSREP to summarize chromatin state maps and identify biologically relevant differences between groups at a high resolution. Availability and implementation The CSREP source code and generated data are available at http://github.com/ernstlab/csrep. Supplementary information Supplementary data are available at Bioinformatics online. [ABSTRACT FROM AUTHOR]