학술논문

Coherent pathway enrichment estimation by modeling inter-pathway dependencies using regularized regression.
Document Type
Academic Journal
Author
Jablonski KP; Department of Biosystems Science and Engineering, ETH Zurich, Basel 4058, Switzerland.; SIB Swiss Institute of Bioinformatics, Basel 4058, Switzerland.; Beerenwinkel N; Department of Biosystems Science and Engineering, ETH Zurich, Basel 4058, Switzerland.; SIB Swiss Institute of Bioinformatics, Basel 4058, Switzerland.
Source
Publisher: Oxford University Press Country of Publication: England NLM ID: 9808944 Publication Model: Print Cited Medium: Internet ISSN: 1367-4811 (Electronic) Linking ISSN: 13674803 NLM ISO Abbreviation: Bioinformatics Subsets: MEDLINE
Subject
Language
English
Abstract
Motivation: Gene set enrichment methods are a common tool to improve the interpretability of gene lists as obtained, for example, from differential gene expression analyses. They are based on computing whether dysregulated genes are located in certain biological pathways more often than expected by chance. Gene set enrichment tools rely on pre-existing pathway databases such as KEGG, Reactome, or the Gene Ontology. These databases are increasing in size and in the number of redundancies between pathways, which complicates the statistical enrichment computation.
Results: We address this problem and develop a novel gene set enrichment method, called pareg, which is based on a regularized generalized linear model and directly incorporates dependencies between gene sets related to certain biological functions, for example, due to shared genes, in the enrichment computation. We show that pareg is more robust to noise than competing methods. Additionally, we demonstrate the ability of our method to recover known pathways as well as to suggest novel treatment targets in an exploratory analysis using breast cancer samples from TCGA.
Availability and Implementation: pareg is freely available as an R package on Bioconductor (https://bioconductor.org/packages/release/bioc/html/pareg.html) as well as on https://github.com/cbg-ethz/pareg. The GitHub repository also contains the Snakemake workflows needed to reproduce all results presented here.
(© The Author(s) 2023. Published by Oxford University Press.)