two-color microarrays: reference designs and reference rnas

27
Two-Color Microarrays: Two-Color Microarrays: Reference Designs and Reference Designs and Reference RNAs. Reference RNAs. Kathleen Kerr Kathleen Kerr Department of Biostatistics Department of Biostatistics University of Washington University of Washington Collaborators: Collaborators: Kyle Serikawa, Kyle Serikawa, Mette Mette Peters, Peters, Caimiao Wei, Caimiao Wei, Roger Bumgarner Roger Bumgarner

Upload: phila

Post on 11-Jan-2016

47 views

Category:

Documents


2 download

DESCRIPTION

Two-Color Microarrays: Reference Designs and Reference RNAs. Kathleen Kerr Department of Biostatistics University of Washington Collaborators: Kyle Serikawa, Mette Peters, Caimiao Wei, Roger Bumgarner. “ Reference Design”. “ Loop Design”. Advantages: Reference Design. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Two-Color Microarrays:  Reference Designs and Reference RNAs

Two-Color Microarrays: Reference Two-Color Microarrays: Reference Designs and Reference RNAs.Designs and Reference RNAs.

Kathleen KerrKathleen Kerr

Department of BiostatisticsDepartment of Biostatistics

University of WashingtonUniversity of Washington

Collaborators:Collaborators: Kyle Serikawa, Kyle Serikawa, Mette Peters, Mette Peters, Caimiao Wei, Caimiao Wei, Roger BumgarnerRoger Bumgarner

Page 2: Two-Color Microarrays:  Reference Designs and Reference RNAs

“Reference Design”

“Loop Design”

Page 3: Two-Color Microarrays:  Reference Designs and Reference RNAs

Advantages: Reference Design

• Simple; easy to execute

• (Relatively) easy to analyze– If a “tonsil” RNA is used as the reference RNA in

a reference design, then measurements on other RNAs can be considered to be measured in “tonsil” units

Page 4: Two-Color Microarrays:  Reference Designs and Reference RNAs

What goes here?

Page 5: Two-Color Microarrays:  Reference Designs and Reference RNAs

Some previous work on reference RNAs

• Gorreta et al, Biotechniques, 2004.

• He et al, Biotechniques, 2004.

• Novoradovskaya et al, BMC Genomics, 2004.

• All assert that a “good” reference RNA gives strong signal for all the genes on the array: most genes expressed “above background” in the reference.

Page 6: Two-Color Microarrays:  Reference Designs and Reference RNAs

• This assertion is based on the conventional wisdom that signals “near background” are unreliable

• (“Unreliable” may overstate the case)

• Consider: the popular methods of data normalization all assume that co-hybridized RNAs are “not too different”

Page 7: Two-Color Microarrays:  Reference Designs and Reference RNAs

Method Validation

• Ideally, methods should be evaluated in terms of how well they answer a scientific question of interest.– Analogous to using clinically relevant endpoints

in clinical trials rather than surrogates.

• The “proportion of spots above background” does not satisfy this ideal.

Page 8: Two-Color Microarrays:  Reference Designs and Reference RNAs

How to validate reference RNAs?

• A better (though still not ideal) criterion is to evaluate whether a comparison of RNAs made through a reference matches the comparison that would have been achieved through direct comparison.

• If the estimates agree well, then the results are not “reference-specific”

Page 9: Two-Color Microarrays:  Reference Designs and Reference RNAs

Experimental Design

Page 10: Two-Color Microarrays:  Reference Designs and Reference RNAs

3 “Test” RNA pairs:1. Placenta

Assumed to be most similar to placenta

reference2. Kidney

A component of commercial reference

3. LungNot a component of

commercial reference

Page 11: Two-Color Microarrays:  Reference Designs and Reference RNAs

3 “Test” RNA pairs:1. Placenta2. Kidney3. Lung

Predictions1. Placenta reference will

work best for the placenta test pair

2. Commercial reference will work better for the kidney test pair than the lung test pair

3. Pool reference will work well overall

Page 12: Two-Color Microarrays:  Reference Designs and Reference RNAs
Page 13: Two-Color Microarrays:  Reference Designs and Reference RNAs
Page 14: Two-Color Microarrays:  Reference Designs and Reference RNAs
Page 15: Two-Color Microarrays:  Reference Designs and Reference RNAs

3 “Test” RNA pairs:1. Placenta2. Kidney3. Lung

How did we do on our predictions?

Predictions1. Placenta reference will work best for placenta test pair2. Commercial reference will work better for the kidney test

pair than the lung test pair3. Pool reference will work well overall

Predictions 1 & 3 were born out; prediction 2 was not.

However, the main result was that choice of reference RNA did not matter as much as we thought.

3 Reference RNAs:1. Placenta2. Commercial3. Pool

Page 16: Two-Color Microarrays:  Reference Designs and Reference RNAs

Compare: data with background subtraction

Page 17: Two-Color Microarrays:  Reference Designs and Reference RNAs

Compare: data without intensity-based normalization

Page 18: Two-Color Microarrays:  Reference Designs and Reference RNAs

Compare: data with background subtraction,No intensity-based normalization

Page 19: Two-Color Microarrays:  Reference Designs and Reference RNAs

Concordance for Low-intensity Genes

Page 20: Two-Color Microarrays:  Reference Designs and Reference RNAs

-Indirect and direct log-ratios are in reasonable agreement for the vast majority of these genes-Some low-intensity genes are reproducibly measured as differentially expressed-Most “highly discrepant genes” are NOT picked out by flagging low-intensity genes

Conclusion: discarding data from low-intensity spots is a very crude filter

Page 21: Two-Color Microarrays:  Reference Designs and Reference RNAs

*Moving average for low intensity-genes only*Moving average for all genes

Conclusion: Measurements on low-intensity genes are less reliable but not

unreliable

Page 22: Two-Color Microarrays:  Reference Designs and Reference RNAs

Is this a reasonable way to evaluate reference RNAs?

• Concordance between reference-based log-ratios and direct log-ratios means that the comparison through the reference is not “reference-specific”

Page 23: Two-Color Microarrays:  Reference Designs and Reference RNAs

• The evaluation is based on a kind of reproducibility, which is not the same as accuracy. However:– A much stronger kind of reproducibility than just

reproducibility among technical replicates.– Though not sufficient, good reproducibility is necessary

for low error– The best kind of “accuracy” with microarray

measurements is an open issue.• Opinion: For most intents and purposes, a low-variance, biased

estimate of the log-ratio is preferable to a high-variance, unbiased estimate

• The results here suggest that, on average, microarrays give an estimate of log-ratios that are proportional to the true log-ratios

– Shi et al, BMC Bioinformatics, 2005

Page 24: Two-Color Microarrays:  Reference Designs and Reference RNAs
Page 25: Two-Color Microarrays:  Reference Designs and Reference RNAs
Page 26: Two-Color Microarrays:  Reference Designs and Reference RNAs

Summary/Conclusions

• To date, evaluations of reference RNAs – Have used criteria distant from scientific objectives– Have not accounted for the assumptions invoked in the

popular methods of data normalization

• We found results to be robust to choice of reference

• Is there such a thing as a “universal” reference?• The concordance between our indirect and direct

logratios is encouraging, but the “linearity” issue needs more attention

Page 27: Two-Color Microarrays:  Reference Designs and Reference RNAs

Kerr KF, Serikawa, Wei, Peters, Bumgarner (2007). What is the best reference RNA? And other questions regarding the design and analysis of two-color microarray experiments.OMICS 11:152-65.

(Pre-print at www.bepress.com/uwbiostat )

Many Thanks to the Conference Organizers!