Back to Microarray Analysis Homepage

Tips for Microarray Experimental Design

   (For UC Investigators Only!   If you have any doubts about statistical issues regarding your experimental design, feel free to contact us at 558-6781 or 558-8564.)


Considerations for your Microarray Experimental Design ( two-channel arrays):

  1. If you are comparing 4 or more treatments, tissue types, or genotypes, you may want to use the universal reference design- see example below (Note: Due to dye effects, estimates of change cannot be made between each of the treatments vs. reference!!!)
  2. If you are comparing 3 or less treatments, tissue types, or genotypes, you will probably want to use a flipped dye design- see example below
  3. For experiments involving multiple types of factors (e.g. time points, treatments, and tissue type) you will want to do flipped dyes between the comparisons of greatest importance.  A time course example is shown below.
  4. Replicates are important!!!  Fewer replicates will identify a smaller percentage of the differentially expressed genes.  In addition, biological replicates are more important than technical replicates (see description below), especially when working with individual organisms rather than tissue cultures.
  5. When deciding how to spread the wealth, decide which is more preferrable:

    a) More accurate results comparing a few factors, or

    b) Less accurate results comparing many factors

  6. If possible, do RT-PCR or another type of validation (with the same RNA used for the microarrays) for a gene(s) known to be differentially expressed.

 

 

Comparison of 2 experimental designs involving 3 treatments

 

Boxes = Treatments/factors;  Arrows=microarrays; 

Arrowheads point to cy5 (red)

Universal Reference Design in duplicate (triplicate would be preferred)
  • treatments are only measured twice each
  • Reference sample, which is not of interest, is measured 6 times
  • simpler for visualizing in Genespring, and simpler analysis
  • Recommended for comparing a large number of tissue types, etc.
Flipped Dye Design in duplicate (quadruplicate would be preferred)
  • treatments are each measured 4 times (compared to 2 for U.R.)
  • not as simple for visualization due to the lack of a common denominator
  • optimal statistical design for detecting differential expression
  • Recommended for comparing a small number of sample types

Time Course Example

  • Each timept. is done in duplicate (triplicate or quadruplicate would be preferred) with control
  • Extra flipped dye arrays may be added comparing timepoints of greater interest (e.g. dotted arrows between 24 & 48 hours)

 

Biological vs. Technical Replicates, and Pooling

  • Biological replicates are arrays that use RNA samples from different individual organisms, pools of organisms or flasks of cells, but yet compare the same treatments or control/treatment combination.

  • Technical replicates are arrays that use the same RNA samples and also the same treatments.  Thus the only differences in measurements are due to technical differences in array processing.

  • It is highly recommended that more biological replicates are done than technical replicates, especially when dealing with individual organisms rather than a cell line.

  • Pooling all samples together is NOT recommended.

  • If the degree of variation between biological replicates is thought to be very high, or not enough RNA can be collected from just one individual, then pooling samples may be beneficial.  It is still important to have more than one independent pool of samples in order to estimate biological variance.  Thus, the sample pools may be thought of as individuals when designing your experiment.

 

The role of experimental design in the removal of technical variance

Optimizing your design based on the experimental goal is an important part of a successful microarray experiment.  In fact, the importance of pre-planning cannot be stressed enough.  One question you may want to ask before designing your experiment is how much power you wish to have to detect differentially expressed genes with a ratio greater or equal to x, and how high of a false positive rate (FDR) you are willing to allow.  This will determine the number of replicates you use.  For example, (Wolfinger et al.) found that in order to have 85% power to detect a 2-fold change with a 1:20,000 false positive rate, seven replicates were needed.  This number could change drastically depending on the type of array technology (single or dual channel), quality (reproducibility) of the arrays, the number of genes on each array, and the chosen false positive rate.  Another question you want to ask is what are the most important samples, or comparisons you want to make, and how many experimental factors will be involved?  For single-channel array experiments, it is obvious that more replicates should be done for samples of greater importance.  For dual-channel array experiments, the many possible choices for designs make for a more complex problem. Depending on your answer to the above questions, you will choose from one of two main types of experimental designs- the universal reference design or the flipped dye design.  The flipped dye design is more efficient for simple designs involving few factors, or for designs where one important factor of interest will be compared to many other factors, such as in a time series.  Circular designs are complex versions of the flipped dye design and will be discussed only briefly at the end of this section.  The universal reference design may be more appropriate for designs involving many factors of equal importance, such as comparing the expression profiles of a large number of tissue types, or for experiments that will likely be part of a larger meta-analysis in the future.   In this section we will concentrate on the issues of experimental design as they relate to normalization and the removal of biases. 

The rationale for the flipped dye design is that it allows for the estimation and removal of gene specific dye effects.  These dye effects have been shown to be reproducible across independent arrays by the use of Control vs. Control arrays.  Any deviation from a ratio of 1 in these arrays is due to either dye effect or residual error.  Figure x shows ratio estimates for two genes on six arrays (3 sets of 2 replicates) using three different measurements: the raw data, the estimated ratio after dye effect has been removed by the use of control-vs-control arrays, and the ratio after dye effect has been removed by the flipped dye method (described below).  As can be seen, the extra control array estimates of dye effect are very close to the effect estimated by the arrays themselves.  The simplest flipped dye design consists of merely two arrays: one array with sample A labeled with cy3 and sample B with cy5, and the other array with sample B labeled with cy3 and sample A with cy5.  This allows gene-specific dye effects to be averaged out.  The best estimate of the log transformed treatment effect for any gene x in this experiment would simply be the average of the ratios,

[log(A1/B1) + log(A2/B2)] / 2 = [log(cy3/cy5) - log(cy3/cy5)] / 2,

and so the dye effects are averaged out.  With more replicates, it is possible to obtain both an estimate of the dye effect, and a measurement of variance, or confidence, in your result.  For any balanced design, where each sample is labeled with cy3 and cy5 an equal number of times, the ratio estimates may be calculated as averages, and the dye effects will be removed.  For arrays done in triplicate, a simple average would weight one dye more than the other skewing the results; instead, the effects of dye could be removed using an ANOVA model with dye included as a factor, or as an alternative, table 1 illustrates how dye effects could be normalized by hand.  It is the same process as above with the added step of taking the average of the replicate ratios prior to averaging over the dye flipped estimates.  Dye normalization greatly improves the reproducibility of replicates per gene. 

Including flipped dye arrays in an experiment is one requirement for having a statistically optimal design, which balances all the factors in the experiment- arrays, dyes, and treatments.  (Here, treatments denote any other factor of interest, whether it is a toxin, mouse strain, tissue type, or age group.)  Balanced designs do not confound any two experimental factors, meaning that the effect of each factor can be estimated and normalized out of the data.  When factors are confounded with each other, their effects are indistinguishable.  For example, an experiment done in triplicate without any flipped dye arrays will have treatment completely confounded with dye effects, because each sample is only labeled with one dye each, and there is no way to separate their effects.  Having confounding factors in your experiment is something to avoid, unless neither of the confounded factors are of interest.  This leads us to the universal reference design, where dye is indeed confounded with treatment, but in this case all the treatments of interest are labeled with the same dye, so the dye effects with the reference can be divided out. 

-This page was last updated on 03/09/2004 by Maureen Sartor.

Department of Environmental Health University of Cincinnati
PO Box 670056 Cincinnati, OH 45267-0056   

© Copyright 2002 University of Cincinnati