학술논문

A comparison of methods for multiple degree of freedom testing in repeated measures RNA-sequencing experiments.
Document Type
Journal Article
Source
BMC Medical Research Methodology. 5/28/2022, Vol. 22 Issue 1, p1-17. 17p.
Subject
*DEGREES of freedom
*RNA sequencing
*MULTIPLE comparisons (Statistics)
*FALSE discovery rate
*INTENSIVE care patients
*EXPERIMENTAL design
*SAMPLE size (Statistics)
*SEQUENCE analysis
*RNA
*RESEARCH funding
*LONGITUDINAL method
Language
ISSN
1471-2288
Abstract
Background: As the cost of RNA-sequencing decreases, complex study designs, including paired, longitudinal, and other correlated designs, become increasingly feasible. These studies often include multiple hypotheses and thus multiple degree of freedom tests, or tests that evaluate multiple hypotheses jointly, are often useful for filtering the gene list to a set of interesting features for further exploration while controlling the false discovery rate. Though there are several methods which have been proposed for analyzing correlated RNA-sequencing data, there has been little research evaluating and comparing the performance of multiple degree of freedom tests across methods.Methods: We evaluated 11 different methods for modelling correlated RNA-sequencing data by performing a simulation study to compare the false discovery rate, power, and model convergence rate across several hypothesis tests and sample size scenarios. We also applied each method to a real longitudinal RNA-sequencing dataset.Results: Linear mixed modelling using transformed data had the best false discovery rate control while maintaining relatively high power. However, this method had high model non-convergence, particularly at small sample sizes. No method had high power at the lowest sample size. We found a mix of conservative and anti-conservative behavior across the other methods, which was influenced by the sample size and the hypothesis being evaluated. The patterns observed in the simulation study were largely replicated in the analysis of a longitudinal study including data from intensive care unit patients experiencing cardiogenic or septic shock.Conclusions: Multiple degree of freedom testing is a valuable tool in longitudinal and other correlated RNA-sequencing experiments. Of the methods that we investigated, linear mixed modelling had the best overall combination of power and false discovery rate control. Other methods may also be appropriate in some scenarios. [ABSTRACT FROM AUTHOR]