TY - JOUR
T1 - Statistical implications of pooling RNA samples for microarray experiments
AU - Peng, Xuejun
AU - Wood, Constance L.
AU - Blalock, Eric M.
AU - Chen, Kuey Chu
AU - Landfield, Philip W.
AU - Stromberg, Arnold J.
PY - 2003/6/24
Y1 - 2003/6/24
N2 - Background: Microarray technology has become a very important tool for studying gene expression profiles under various conditions. Biologists often pool RNA samples extracted from different subjects onto a single microarray chip to help defray the cost of microarray experiments as well as to correct for the technical difficulty in getting sufficient RNA from a single subject. However, the statistical, technical and financial implications of pooling have not been explicitly investigated. Results: Modeling the resulting gene expression from sample pooling as a mixture of individual responses, we derived expressions for the experimental error and provided both upper and lower bounds for its value in terms of the variability among individuals and the number of RNA samples pooled. Using "virtual" pooling of data from real experiments and computer simulations, we investigated the statistical properties of RNA sample pooling. Our study reveals that pooling biological samples appropriately is statistically valid and efficient for microarray experiments. Furthermore, optimal pooling design(s) can be found to meet statistical requirements while minimizing total cost. Conclusions: Appropriate RNA pooling can provide equivalent power and improve efficiency and cost-effectiveness for microarray experiments with a modest increase in total number of subjects. Pooling schemes in terms of replicates of subjects and arrays can be compared before experiments are conducted.
AB - Background: Microarray technology has become a very important tool for studying gene expression profiles under various conditions. Biologists often pool RNA samples extracted from different subjects onto a single microarray chip to help defray the cost of microarray experiments as well as to correct for the technical difficulty in getting sufficient RNA from a single subject. However, the statistical, technical and financial implications of pooling have not been explicitly investigated. Results: Modeling the resulting gene expression from sample pooling as a mixture of individual responses, we derived expressions for the experimental error and provided both upper and lower bounds for its value in terms of the variability among individuals and the number of RNA samples pooled. Using "virtual" pooling of data from real experiments and computer simulations, we investigated the statistical properties of RNA sample pooling. Our study reveals that pooling biological samples appropriately is statistically valid and efficient for microarray experiments. Furthermore, optimal pooling design(s) can be found to meet statistical requirements while minimizing total cost. Conclusions: Appropriate RNA pooling can provide equivalent power and improve efficiency and cost-effectiveness for microarray experiments with a modest increase in total number of subjects. Pooling schemes in terms of replicates of subjects and arrays can be compared before experiments are conducted.
KW - Microarray experiment design
KW - Pooling
KW - Power
KW - Replicates
KW - Sample size estimation
UR - http://www.scopus.com/inward/record.url?scp=0347621463&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0347621463&partnerID=8YFLogxK
U2 - 10.1186/1471-2105-4-26
DO - 10.1186/1471-2105-4-26
M3 - Article
C2 - 12823867
AN - SCOPUS:0347621463
SN - 1471-2105
VL - 4
JO - BMC Bioinformatics
JF - BMC Bioinformatics
M1 - 26
ER -