PROPER: Comprehensive power evaluation for differential expression using RNA-seq

Hao Wu, Chi Wang, Zhijin Wu

Research output: Contribution to journalArticlepeer-review

73 Scopus citations

Abstract

Motivation: RNA-seq has become a routine technique in differential expression (DE) identification. Scientists face a number of experimental design decisions, including the sample size. The power for detecting differential expression is affected by several factors, including the fraction of DE genes, distribution of the magnitude of DE, distribution of gene expression level, sequencing coverage and the choice of type I error control. The complexity and flexibility of RNA-seq experiments, the high-throughput nature of transcriptome-wide expression measurements and the unique characteristics of RNA-seq data make the power assessment particularly challenging. Results: We propose prospective power assessment instead of a direct sample size calculation by making assumptions on all of these factors. Our power assessment tool includes two components: (i) a semi-parametric simulation that generates data based on actual RNAseq experiments with flexible choices on baseline expressions, biological variations and patterns of DE; and (ii) a power assessment component that provides a comprehensive view of power. We introduce the concepts of stratified power and false discovery cost, and demonstrate the usefulness of our method in experimental design (such as sample size and sequencing depth), as well as analysis plan (gene filtering).

Original languageEnglish
Pages (from-to)233-241
Number of pages9
JournalBioinformatics
Volume31
Issue number2
DOIs
StatePublished - Jan 15 2015

Bibliographical note

Publisher Copyright:
© The Author 2014. Published by Oxford University Press. All rights reserved.

Funding

FundersFunder number
National Center for Advancing Translational Sciences (NCATS)
National Institutes of Health (NIH)R01 GM067862
National Science Foundation (NSF)DBI1054905
National Center for Advancing Translational Sciences (NCATS)UL1TR000454

    ASJC Scopus subject areas

    • Statistics and Probability
    • Biochemistry
    • Molecular Biology
    • Computer Science Applications
    • Computational Theory and Mathematics
    • Computational Mathematics

    Fingerprint

    Dive into the research topics of 'PROPER: Comprehensive power evaluation for differential expression using RNA-seq'. Together they form a unique fingerprint.

    Cite this