extracting quantitative information from proteomic 2-d gels lecture in the bioinformatics course...

31
Extracting quantitative information from proteomic 2-D gels Lecture in the bioinformatics course ”Gene expression and cell models” April 20, 2005 John Gustafsson Mathematical Statistics Chalmers

Upload: cecily-mclaughlin

Post on 05-Jan-2016

215 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Extracting quantitative information from proteomic 2-D gels Lecture in the bioinformatics course ”Gene expression and cell models” April 20, 2005 John

Extracting quantitative information from proteomic 2-D gels

Lecture in the bioinformatics course ”Gene expression and cell models”

April 20, 2005

John Gustafsson

Mathematical Statistics

Chalmers

Page 2: Extracting quantitative information from proteomic 2-D gels Lecture in the bioinformatics course ”Gene expression and cell models” April 20, 2005 John

Proteomics lectures:starting points

• Anders’ starting point this Monday:– Let’s say that we want to study life at the protein

level – what technologies do we have at hand?

• Today’s lecture:– How can we get (large-scale) quantitative

measurements of protein amounts? So that we can do statistics and bioinformatics

Page 3: Extracting quantitative information from proteomic 2-D gels Lecture in the bioinformatics course ”Gene expression and cell models” April 20, 2005 John

• Proteomics• The 2-D gel technology• Extracting quantitative information

– Image analysis of 2-D gels• Comparison with microarrays• Statistic analysis of quantitative 2-D gel data

Content and structure

Page 4: Extracting quantitative information from proteomic 2-D gels Lecture in the bioinformatics course ”Gene expression and cell models” April 20, 2005 John

ProteomicsDNA

mRNA

ProductionModification Degradation

Localisation

Interaction

ACTIVITY

P

TDP

Co-factors

2-D gels

Page 5: Extracting quantitative information from proteomic 2-D gels Lecture in the bioinformatics course ”Gene expression and cell models” April 20, 2005 John

2-D gel electrophoresis: Protein separation and quantification

”protein soup”

spot volume protein quantity

mo

lecu

lar

size

molecular charge

acidic alkaline

sma

llla

rge

Page 6: Extracting quantitative information from proteomic 2-D gels Lecture in the bioinformatics course ”Gene expression and cell models” April 20, 2005 John

A typical 2-D gel experiment

statistical analysis

conclusions

protein extracts

biological experimentcontrol treatment

2-D gel images

2-D gel electrophoresis

quantified data

image analysis

25211511

225221215211

125121115111

mmmm zzzz

zzzz

zzzz

matrix with

spot volume data

rows: proteins (many)

columns: gels (few)

experimental design

Example:

Page 7: Extracting quantitative information from proteomic 2-D gels Lecture in the bioinformatics course ”Gene expression and cell models” April 20, 2005 John

The image analysis task

• The task1. In each gel image: Find and quantify the protein

spots

2. In the group of gel images: Match protein spots in different images that correspond to the same protein

• Issues– automation– time

Page 8: Extracting quantitative information from proteomic 2-D gels Lecture in the bioinformatics course ”Gene expression and cell models” April 20, 2005 John

Pseudo-color superposition 1(3)0M NaCl 1M NaCl

Page 9: Extracting quantitative information from proteomic 2-D gels Lecture in the bioinformatics course ”Gene expression and cell models” April 20, 2005 John

Pseudo-color superposition 2(3)OM NaCl 1M NaCl

Page 10: Extracting quantitative information from proteomic 2-D gels Lecture in the bioinformatics course ”Gene expression and cell models” April 20, 2005 John

Pseudo-color superposition 3(3)(red: 0M NaCl, blue: 1M NaCl)

Page 11: Extracting quantitative information from proteomic 2-D gels Lecture in the bioinformatics course ”Gene expression and cell models” April 20, 2005 John

The standard solution – workflow

In each gel image1. Background subtraction

2. Spot detection

3. Spot quantification

In the group of gel images4. Spot pattern matching

Page 12: Extracting quantitative information from proteomic 2-D gels Lecture in the bioinformatics course ”Gene expression and cell models” April 20, 2005 John

1. Background subtraction

Before After

- =

Page 13: Extracting quantitative information from proteomic 2-D gels Lecture in the bioinformatics course ”Gene expression and cell models” April 20, 2005 John

2. Spot detection / image segmentation

Page 14: Extracting quantitative information from proteomic 2-D gels Lecture in the bioinformatics course ”Gene expression and cell models” April 20, 2005 John

3. Spot quantification

spot volume protein quantity

Page 15: Extracting quantitative information from proteomic 2-D gels Lecture in the bioinformatics course ”Gene expression and cell models” April 20, 2005 John

4. Spot pattern matching

Page 16: Extracting quantitative information from proteomic 2-D gels Lecture in the bioinformatics course ”Gene expression and cell models” April 20, 2005 John

The typical 2-D gel experiment

statistical analysis

conclusions

protein extracts

biological experimentcontrol treatment

2-D gel images

2-D gel electrophoresis

quantified data

image analysis

25211511

225221215211

125121115111

mmmm zzzz

zzzz

zzzz

matrix with

spot volume data

rows: proteins (many)

columns: gels (few)

experimental design

Example:

Page 17: Extracting quantitative information from proteomic 2-D gels Lecture in the bioinformatics course ”Gene expression and cell models” April 20, 2005 John

Limitations

• Technological– hydrofobic proteins don’t

dissolve– limited pI/size coverage– limited labeling/staining

• Image analytical– Limited global matching

efficiency of automatic algorithms

– Need for time consuming manual guidance

– ”The image analysis bottle-neck”

Page 18: Extracting quantitative information from proteomic 2-D gels Lecture in the bioinformatics course ”Gene expression and cell models” April 20, 2005 John

Limited global matching efficiency

Voss and Haberl (2000)

Page 19: Extracting quantitative information from proteomic 2-D gels Lecture in the bioinformatics course ”Gene expression and cell models” April 20, 2005 John

Incomplete spot detection: Faint spots

Detected

Not detected

Page 20: Extracting quantitative information from proteomic 2-D gels Lecture in the bioinformatics course ”Gene expression and cell models” April 20, 2005 John

Incomplete spot detection:Close spots

Page 21: Extracting quantitative information from proteomic 2-D gels Lecture in the bioinformatics course ”Gene expression and cell models” April 20, 2005 John

• Proteomics• The 2-D gel technology• Extracting quantitative information

– Image analysis of 2-D gels• Comparison with microarrays• Statistic analysis of quantitative 2-D gel data

Content and structure – revisited

Page 22: Extracting quantitative information from proteomic 2-D gels Lecture in the bioinformatics course ”Gene expression and cell models” April 20, 2005 John

Comparison with microarrays

2-D gels Microarrays

Labeling one channel* one or two-color

Background subtr. yes yes

Spot detection HARD easy

Spot quantitation can be difficult quite easy

Spot matching HARD known

Identification MS or reference atlas known

*) recently also two-color

Page 23: Extracting quantitative information from proteomic 2-D gels Lecture in the bioinformatics course ”Gene expression and cell models” April 20, 2005 John

Variability

normal 1M NaCl

normal 1M NaCl

biol

ogic

al r

eplic

atio

ns

growth condition

Page 24: Extracting quantitative information from proteomic 2-D gels Lecture in the bioinformatics course ”Gene expression and cell models” April 20, 2005 John

Variance versus mean dependence

• A dot in the plot:– the measurement of one

protein

• The quadratic dependence indicates a multiplicative error structure

(2x5 gel set; normal growth condition)

slope=2 variance mean2

Page 25: Extracting quantitative information from proteomic 2-D gels Lecture in the bioinformatics course ”Gene expression and cell models” April 20, 2005 John

Why transform the data?

• A mathematical data transformation can be used to – Make errors more normally distributed– Stabilize variance versus mean dependence

• Then the model on transformed scale is more simple than on original scale

• Simplifies the subsequent analysis

Page 26: Extracting quantitative information from proteomic 2-D gels Lecture in the bioinformatics course ”Gene expression and cell models” April 20, 2005 John

Logarithmic data transformation

• Stabilized variance versus mean dependence after a logarithmic data transformation

(2x5 gel set; normal growth condition)

Page 27: Extracting quantitative information from proteomic 2-D gels Lecture in the bioinformatics course ”Gene expression and cell models” April 20, 2005 John

Statistical analysis of quantitative 2-D gel data

Examples:• Test of differential expression• Cluster analysis

– cluster proteins – cluster cell/tissue samples

• Classification– classify tissue samples (i.e. tumor classes)

Page 28: Extracting quantitative information from proteomic 2-D gels Lecture in the bioinformatics course ”Gene expression and cell models” April 20, 2005 John

• Proteomics• The 2-D gel technology• Extracting quantitative information

– Image analysis of 2-D gels• Comparison with microarrays• Statistic analysis of quantitative 2-D gel data

Summary

Page 29: Extracting quantitative information from proteomic 2-D gels Lecture in the bioinformatics course ”Gene expression and cell models” April 20, 2005 John

An alternative approach to the matching problem

• The standard solution– First spot detection– Then matching of point patterns

• An alternative, recent approach– Matching at the pixel level– Computationally heavy

Page 30: Extracting quantitative information from proteomic 2-D gels Lecture in the bioinformatics course ”Gene expression and cell models” April 20, 2005 John

Gel matching at the pixel level

Reference image

Image warping

Original image Aligned image

Page 31: Extracting quantitative information from proteomic 2-D gels Lecture in the bioinformatics course ”Gene expression and cell models” April 20, 2005 John

Future alternatives to quantitative 2-D gels?

• Quantitative masspectrometry• Protein arrays