qtl studies- past, present and future

45
QTL Studies- Past, Present and Future David Evans

Upload: billy

Post on 16-Jan-2016

40 views

Category:

Documents


0 download

DESCRIPTION

QTL Studies- Past, Present and Future. David Evans. Genetic studies of complex diseases have not met anticipated success. Glazier et al, Science (2002) 298:2345-2349. Korstanje & Pagan (2002) Nat Genet. Korstanje & Pagan (2002) Nat Genet. Linkage. Marker. Gene 1. Linkage disequilibrium. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: QTL Studies- Past, Present and Future

QTL Studies- Past, Present and Future

David Evans

Page 2: QTL Studies- Past, Present and Future

Genetic studies of complex diseases have not met anticipated

success

Glazier et al, Science (2002) 298:2345-2349

Page 3: QTL Studies- Past, Present and Future

Korstanje & Pagan (2002) Nat Genet

Korstanje & Pagan (2002) Nat Genet

Page 4: QTL Studies- Past, Present and Future

BUT…Not much success in mapping complex diseases / traits !

Reasons for Failure?

Complex Phenotype

Commonenvironment

Marker Gene1

Individualenvironment

Polygenicbackground

Gene2

Gene3

Linkage

Linkagedisequilibrium

Mode ofinheritance Linkage

Association

Weiss & Terwilliger (2000) Nat Genet

Page 5: QTL Studies- Past, Present and Future

Type 1 diabetes and Insulin VNTR

Bennett & Todd, Ann Rev Genet, 1996

Alzheimers and ApoE4

Roses, Nature 2000

LD Patterns and Allelic Association

Pattern of LD unpredictable

Extent of common genetic variation unknown

Page 6: QTL Studies- Past, Present and Future

Genome-wide Association?

Risch & Merikangas, Science 1996

Page 7: QTL Studies- Past, Present and Future

Multiple Rare Variant Hypothesis?

GWA assumes that common variants underlie common diseases

Page 8: QTL Studies- Past, Present and Future

Enabling Genome-wide Association Studies

HAPlotype MAP

High throughput genotyping

Large cohorts“ALSPAC”

Page 9: QTL Studies- Past, Present and Future

Wellcome Trust Case Control Consortium

Page 10: QTL Studies- Past, Present and Future

CASES1. Type 1 Diabetes2. Type 2 Diabetes3. Crohn’s Disease4. Coronary Heart Disease5. Hypertension6. Bipolar Disorder7. Rheumatoid Arthritis8. Malaria9. Tuberculosis

10.Ankylosing Spondylitis11.Grave’s Disease12.Breast Cancer13.Multiple Sclerosis

CONTROLS1. UK Controls A (1,500 - 1958 BC)2. UK Controls B (1,500 - NBS)3. Gambian controls (2000)

Wellcome Trust Case-Control ConsortiumGenome-Wide Association Across Major Human Diseases

DESIGNCollaboration amongst 26 UK diseaseinvestigators2000 cases each from 9 diseases1000 cases from 4 diseases

GENOTYPINGAffymetrix 500k SNPsIllumina Human NS_12 SNP chip

Page 11: QTL Studies- Past, Present and Future
Page 12: QTL Studies- Past, Present and Future

Ankylosing Spondylitis

Auto-immune arthritis resulting in fusion of vertebrae

Often associated with psoriasis, IBD and uveitis

Prevalence of 0.4% in Caucasians. More common in men.

Ed Sullivan, Mike Atherton

Page 13: QTL Studies- Past, Present and Future

Ankylosing Spondylitis GWAS

WTCCC (2007) Nat Genet

IL23-R ARTS-1

Page 14: QTL Studies- Past, Present and Future

Bre

ast

ca

Pro

stat

e ca

Co

lore

ctal

ca

Cro

hn

’s D

is

IBD

T1D

T2D

Ob

esit

y

Tri

gly

ceri

des

CA

D

Co

elia

c D

is

Rh

eum

ato

id A

An

kylo

sin

g S

Alz

he

ime

r D

is

Gla

uco

ma

Ast

hm

a

FGFR2 SMAD7

TNRC9

MAPKI3

LSP1

2q35

8q24 8q24

NOD2 IL23R INS 9p21 FTO GCKR 9p21 IL2 PTPN22 ARTS GALP LOXL1 ORMDL

IL21 IL23R6q251

2q36

12q24

18p11

IL2RA

FTO

CDKAL1

IGF2BP2

KIAAA035D HHEX

PPARG

SLC30A8

TCF7L2

KCNJ11

5p13

ATG16L1

BSN

IRGM

PHOX2B

NCF4

NKX2-3

FAM92B

PTPN2

MSMB

KLK3

SLC22A3

NUDT11

3des.

11q13

LMTK2

HNF1B

CTBP2

JAZF1

8q24

WFS1

TCF2

PTPN22

CD25

ERBB3

CTLA4

IFIH1

10q21 Common genes = common etiology?Some in gene deserts

Large relative risks does not = successDrug targets

Successes…

Page 15: QTL Studies- Past, Present and Future

What About Quantitative Traits?

Quantitative genetics theory suggests that quantitative traits are the result of many variants of small effect

The corollary is that very large sample sizes will be needed to detect these variants in UNSELECTED samples

0

1

2

3

1 Gene 3 Genotypes 3 Phenotypes

0

1

2

3

2 Genes 9 Genotypes 5 Phenotypes

01234567

3 Genes 27 Genotypes 7 Phenotypes

0

5

10

15

20

4 Genes 81 Genotypes 9 Phenotypes

Central Limit Theorem Normal Distribution

Unselected samples

Page 16: QTL Studies- Past, Present and Future
Page 17: QTL Studies- Past, Present and Future

FTSO

FTO

FTO produces a moderate signal in WTCCC T2D scan

But, no signal in an American T2D scan…?

WTCCC T2D Scan

-American cases and controls matched on BMI

Page 18: QTL Studies- Past, Present and Future

FTO

Replication is critical !!!

Page 19: QTL Studies- Past, Present and Future

Twin, family and adoption studies suggest that, within a population, 90% of variation in height is due to genetic variation

Dizygotic Twins

Twins separated at birth

Monozygotic Twins

Borjeson, Acta Paed, 1976

Height- The Archetypal Polygenic Trait

Page 20: QTL Studies- Past, Present and Future

GWA of Height

Weedon et al. (in press) Nat Genet

Large numbers are needed to detect QTLs !!!

Collaboration is the name of the game !!!

A- 1914 Cases (WTCCC T2D)

B- 4892 Cases (DGI)

C- 6788 Cases (WTCCC HT)

D- 8668 Cases (WTCCC CAD)

E- 12228 Cases (EPIC)

F- 13665 Cases (WTCCC UKBS)

Significant results

Other loci?

Page 21: QTL Studies- Past, Present and Future

A: 1,900

C: 7,200

E: 12,600 F: 14,000

D: 9,100

B: 5,000

Weedon et al. (in press) Nat Genet

Some real hits sit in the bottom of the distribution

Some hits initially look interesting but then go away

Page 22: QTL Studies- Past, Present and Future

Hedgehog signaling, cell cycle, and extra-cellular matrix genes over-represented

Candidate gene Monogenic Knockout mouse Details*

ZBTB38 - - Transcription factor.

CDK6 - Yes Involved in the control of the cell cycle.

HMGA2 Yes Yes Chromatin architectural factors

GDF5 Yes Yes Involved in bone formation

LCORL - - May act as transcription activator

LOC387103 - - Not known

EFEMP1 Yes - Extra-cellular matrix

C6orf106 - - Not known

PTCH1 Yes Yes Hedgehog signalling

SPAG17 - - Not known

SOCS2 - Yes Regulates cytokine signal transduction

HHIP - - Hedgehog signaling

ZNF678 - - Transcription factor

DLEU7 - - Not known

SCMH1 - Yes Polycomb protein

ADAMTSL3 - - Extra-cellular matrix

IHH Yes Yes Hedgehog signaling

ANAPC13 - - Cell cycle

ACAN Yes Yes Extra-cellular matrix

DYM Yes - Not known

Weedon et al. (in press) Nat Genet

Page 23: QTL Studies- Past, Present and Future

The combined impact of the 20 SNPS with a P < 5 x 10-7

• The 20 SNPs explain only ~3% of the variation of height

• Lots more genes to find – but extremely large numbers needed

Weedon et al. (in press) Nat Genet

Page 24: QTL Studies- Past, Present and Future

Perola et al, Plos Genetics, 2007; data available at http://www.genomeutwin.org; Weedon et al.; unpublished data

Height Linkage Regions

Page 25: QTL Studies- Past, Present and Future

Perola et al, Plos Genetics, 2007; data available at http://www.genomeutwin.org; Weedon et al.; unpublished data

Page 26: QTL Studies- Past, Present and Future

What’s Going On?

BUT…what if linkage analysis and association analysis identify different types of loci?

Areas identified by linkage don’t have significant assocation hits over them

Linkage analysis lacks power?

Loci identified by GWAs don’t have linkage peaks over them

Type I error?

Power?

Page 27: QTL Studies- Past, Present and Future

What next?

InitialGenome

Wide Scans

Functional Studies

Other ethnicgroups

Fine mapping

More genes

CNVs

Transcriptomics

Animal models

Mendelian Randomisation

Genomic ProfilingEpigenetics

Genome-wide Sequencing

Page 28: QTL Studies- Past, Present and Future

Distribution of MAFs in HapMap

Genome-wide panels and HapMap biased towards common variantsCommon variants don’t tag rare variants well

Page 29: QTL Studies- Past, Present and Future

FTO

IL23R

HLA-B27

TCF7L2ARTS1

TCF2

PPARG

WFS1

???

???

???

???

???

???

SEMA5A “Rotten Fruit”

“Low hanging fruit”

“High hanging fruit”

Complex Disease Tree

Page 30: QTL Studies- Past, Present and Future

Methods of gene hunting

Eff

ect

Siz

e

Frequency

rare, monogenic (linkage)

common, complex (association)

?

Page 31: QTL Studies- Past, Present and Future

Evans et al. (2008) EJHG

Genome-wide Sequencing

Sequence individuals’ genomes

Will identify rare variants

But will we have enough power?

Page 32: QTL Studies- Past, Present and Future

Genomic Profiling

BUT…if we consider several predisposing genetic and environmental factors, can we predict disease?

Predictive testing in the case of monogenic diseases has been used for years (1300+ tests available) (e.g. Phenylketonuria)

The idea of using genetic information to inform diagnosis

Not possible in complex diseases as effects of an individual variant is so small

Page 33: QTL Studies- Past, Present and Future

(from Yang et al. 2003 AJHG)

(from Janssens et al. 2004 AJHG)

Genomic Profiling

=> Give up and go home?

Page 34: QTL Studies- Past, Present and Future

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

PRIOR DISEASE PROBABILITY

PO

ST

ER

IOR

PR

OB

AB

ILIT

Y O

F D

ISE

AS

E

D+/B27+

D+/B27-

D+/B27+,ARTS1+,IL23+

D+/B27-/ARTS1-/IL23R-

Prevalence of B27+, ARTS1+,IL23R+ is 2.4%

Prevalence of B27-, ARTS1-, IL23R- is 19%

Ankylosing Spondylitis

(Brown & Evans, in prep)

Page 35: QTL Studies- Past, Present and Future

Using Genetics to Inform Classical Epidemiology

Page 36: QTL Studies- Past, Present and Future

Observational Studies

Obesity

Yes No

CHD 200 100

Control 50 250

Odds of obesity in cases: 200/100 = 2

Odds of obesity in controls: 50/250 = 0.2

Odds Ratio: 2/0.2 = 10

Fanciful claims often made from observational studies

In a case-control study, a group of diseased individuals are recruited (Cases); A group of individuals without disease are gathered (Controls); Both groups are then measured retrospectively on an exposure of interest; A test of association is performed

Example: Obesity (Exposure) and Coronary Heart Disease (Outcome)

Page 37: QTL Studies- Past, Present and Future

Classic limitations to “observational” science

• Confounding

• Reverse Causation

• Bias

Page 38: QTL Studies- Past, Present and Future

Randomized Control Trials

Individuals free from exposure and disease

TreatmentGroup

ControlGroup

Randomization

MeasureOutcome

MeasureOutcome

Randomization controls for confounding

Reverse causation impossible

Gold standard for assessing causality

Page 39: QTL Studies- Past, Present and Future

Mendelian Randomization

Fortunately nature has provided us with a natural randomized control trial !

RCTs not always ethical or possible

Mendel’s law of independent assortment states that inheritance of a trait is independent (randomized) with respect to other traits

Assessing the relationship between genotype, environmental risk factor and disease informs us on causality

Therefore individuals are randomly assigned to three groups based on their genotype (AA, Aa, aa) independent of outcome

Page 40: QTL Studies- Past, Present and Future

GeneticVariant

ModifiableEnvironmental

Exposure

Disease

ConfoundingVariables

(FTO)

(Obesity)

(smoking, diet etc.)

(Coronary Heart Disease)

Mendelian Randomization

If obesity causes CHD then the relationship between FTO and CHD should be estimated by the product of βFTO-Obesity and βObesity-

CHD

βFTO-Obesity

βObesity-CHD

Page 41: QTL Studies- Past, Present and Future

GeneticVariant

ModifiableEnvironmental

Exposure

Disease

ConfoundingVariables

(FTO)

(Obesity)

(smoking, diet etc.)

(Coronary Heart Disease)

Mendelian Randomization

If CHD causes obesity then βFTO-CHD should be zero.

βFTO-Obesity

βObesity-CHD

Page 42: QTL Studies- Past, Present and Future

GeneticVariant

ModifiableEnvironmental

Exposure

Disease

ConfoundingVariables

(FTO)

(Obesity)

(smoking, diet etc.)

(Coronary Heart Disease)

Mendelian Randomization

If the relationship between Obesity and CHD is purely correlational (i.e. due to confounding) then βFTO-CHD should be 0

βFTO-Obesity

Page 43: QTL Studies- Past, Present and Future

GeneticVariant

ModifiableEnvironmental

Exposure

Disease

ConfoundingVariables

(FTO)

(Obesity)

(smoking, diet etc.)

(Coronary Heart Disease)

Mendelian Randomization

Genotype is NOT associated with confounders

Genotype is associated with the environmental exposure of interest

Genotype is only related to its outcome via its association with the modifiable environmental exposure

Page 44: QTL Studies- Past, Present and Future

Mendelian Randomization

Mendelian Randomization is a way of using a genetic variant(s) to make causal inferences about (modifiable) environmental risk factors for disease and health related outcomes

Environmental exposures (e.g. Obesity) can be modified ! Genetic factors cannot (at least for the moment…)

Still a relatively new approach that has problems (i.e. finding genetic proxies for environmental exposures- multiple instruments?)

…but a LOT of scope for development…

Page 45: QTL Studies- Past, Present and Future

GeneticVariant

ModifiableEnvironmental

Exposure

Disease

ConfoundingVariables

(FTO)

(Obesity)

(smoking, diet etc.)

(Coronary Heart Disease)

Could SEM be used to enhance MR?

GeneticVariant

(MC4R) GeneticVariant

(?)