the beta distribution approachpure.au.dk/portal/files/81151799/paulatataruoxford.pdf · allele...

27
Beta spikes The Beta distribution approach PAULA TATARU AARHUS UNIVERSITY Bioinformatics Research Centre Oxford, July 19 th 2014 Modelling allele frequency data under the Wright Fisher model of drift, mutation and selection Joint work with Asger Hobolth and Thomas Bataillon

Upload: others

Post on 29-Sep-2020

5 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: The Beta distribution approachpure.au.dk/portal/files/81151799/PaulaTataruOxford.pdf · Allele frequencies: the Beta distribution approachAARHUS UNIVERSITY Paula Tataru paula@birc.au.dk

Beta spikes

The Beta distribution approach

PAULA TATARU

AARHUS

UNIVERSITY

Bioinformatics

Research Centre

Oxford, July 19th 2014

Modelling allele frequency data under the Wright Fisher model of drift, mutation and selection

Joint work with Asger Hobolth and Thomas Bataillon

Page 2: The Beta distribution approachpure.au.dk/portal/files/81151799/PaulaTataruOxford.pdf · Allele frequencies: the Beta distribution approachAARHUS UNIVERSITY Paula Tataru paula@birc.au.dk

Allele frequencies: the Beta distribution approach

Paula Tataru [email protected]

AARHUS

UNIVERSITY

Bioinformatics

Research Centre

Motivation

› Infer population parameters from DNA data

› mutation rates

› selection coefficients

› split times

› variable population size back in time

› Backward in time (coalescent)

› Forward in time (Wright Fisher)

2

Page 3: The Beta distribution approachpure.au.dk/portal/files/81151799/PaulaTataruOxford.pdf · Allele frequencies: the Beta distribution approachAARHUS UNIVERSITY Paula Tataru paula@birc.au.dk

Allele frequencies: the Beta distribution approach

Paula Tataru [email protected]

AARHUS

UNIVERSITY

Bioinformatics

Research Centre 3

The Wright Fisher model

Page 4: The Beta distribution approachpure.au.dk/portal/files/81151799/PaulaTataruOxford.pdf · Allele frequencies: the Beta distribution approachAARHUS UNIVERSITY Paula Tataru paula@birc.au.dk

Allele frequencies: the Beta distribution approach

Paula Tataru [email protected]

AARHUS

UNIVERSITY

Bioinformatics

Research Centre 4

The Wright Fisher model

Page 5: The Beta distribution approachpure.au.dk/portal/files/81151799/PaulaTataruOxford.pdf · Allele frequencies: the Beta distribution approachAARHUS UNIVERSITY Paula Tataru paula@birc.au.dk

Allele frequencies: the Beta distribution approach

Paula Tataru [email protected]

AARHUS

UNIVERSITY

Bioinformatics

Research Centre 5

The Wright Fisher model

Page 6: The Beta distribution approachpure.au.dk/portal/files/81151799/PaulaTataruOxford.pdf · Allele frequencies: the Beta distribution approachAARHUS UNIVERSITY Paula Tataru paula@birc.au.dk

Allele frequencies: the Beta distribution approach

Paula Tataru [email protected]

AARHUS

UNIVERSITY

Bioinformatics

Research Centre

› Diffusion

› Kimura 1964

› Gautier & Vitalis 2013

› Malaspinas et al. 2012

› Steinrucken et al. 2013

› Zhao et al. 2013

› Moment based

› Normal distribution

› Nicholson et al. 2002

› Prickrell & Pritchard 2012

› Beta distribution

› Balding & Nichols 1995

› Siren et al. 2011

6

Approximations to the WF

Page 7: The Beta distribution approachpure.au.dk/portal/files/81151799/PaulaTataruOxford.pdf · Allele frequencies: the Beta distribution approachAARHUS UNIVERSITY Paula Tataru paula@birc.au.dk

Allele frequencies: the Beta distribution approach

Paula Tataru [email protected]

AARHUS

UNIVERSITY

Bioinformatics

Research Centre

› Diffusion

› Kimura 1964

› Gautier & Vitalis 2013

› Malaspinas et al. 2012

› Steinrucken et al. 2013

› Zhao et al. 2013

› Moment based

› Normal distribution

› Nicholson et al. 2002

› Prickrell & Pritchard 2012

› Beta distribution

› Balding & Nichols 1995

› Siren et al. 2011

› Beta with spikes

7

Approximations to the WF

Page 8: The Beta distribution approachpure.au.dk/portal/files/81151799/PaulaTataruOxford.pdf · Allele frequencies: the Beta distribution approachAARHUS UNIVERSITY Paula Tataru paula@birc.au.dk

Allele frequencies: the Beta distribution approach

Paula Tataru [email protected]

AARHUS

UNIVERSITY

Bioinformatics

Research Centre 8

The Beta approximation

Page 9: The Beta distribution approachpure.au.dk/portal/files/81151799/PaulaTataruOxford.pdf · Allele frequencies: the Beta distribution approachAARHUS UNIVERSITY Paula Tataru paula@birc.au.dk

Allele frequencies: the Beta distribution approach

Paula Tataru [email protected]

AARHUS

UNIVERSITY

Bioinformatics

Research Centre 9

The Beta approximation

Page 10: The Beta distribution approachpure.au.dk/portal/files/81151799/PaulaTataruOxford.pdf · Allele frequencies: the Beta distribution approachAARHUS UNIVERSITY Paula Tataru paula@birc.au.dk

Allele frequencies: the Beta distribution approach

Paula Tataru [email protected]

AARHUS

UNIVERSITY

Bioinformatics

Research Centre 10

The Beta approximation

Page 11: The Beta distribution approachpure.au.dk/portal/files/81151799/PaulaTataruOxford.pdf · Allele frequencies: the Beta distribution approachAARHUS UNIVERSITY Paula Tataru paula@birc.au.dk

Allele frequencies: the Beta distribution approach

Paula Tataru [email protected]

AARHUS

UNIVERSITY

Bioinformatics

Research Centre

The Beta with spikes approximation

› The density of Xt

› Use recursive approach to calculate

› mean and variance

› loss and fixation probabilities

› mean and variance conditional on polymorphism

11

Page 12: The Beta distribution approachpure.au.dk/portal/files/81151799/PaulaTataruOxford.pdf · Allele frequencies: the Beta distribution approachAARHUS UNIVERSITY Paula Tataru paula@birc.au.dk

Allele frequencies: the Beta distribution approach AARHUS

UNIVERSITY

Bioinformatics

Research Centre Paula Tataru [email protected] 12

› Hellinger distance

› true vs approximated distributions

› between 0 and 1

› Stationary: Beta distribution

› Diffusion > Beta with spikes > Beta

Page 13: The Beta distribution approachpure.au.dk/portal/files/81151799/PaulaTataruOxford.pdf · Allele frequencies: the Beta distribution approachAARHUS UNIVERSITY Paula Tataru paula@birc.au.dk

Allele frequencies: the Beta distribution approach AARHUS

UNIVERSITY

Bioinformatics

Research Centre Paula Tataru [email protected] 13

Page 14: The Beta distribution approachpure.au.dk/portal/files/81151799/PaulaTataruOxford.pdf · Allele frequencies: the Beta distribution approachAARHUS UNIVERSITY Paula Tataru paula@birc.au.dk

Allele frequencies: the Beta distribution approach AARHUS

UNIVERSITY

Bioinformatics

Research Centre Paula Tataru [email protected] 14

Page 15: The Beta distribution approachpure.au.dk/portal/files/81151799/PaulaTataruOxford.pdf · Allele frequencies: the Beta distribution approachAARHUS UNIVERSITY Paula Tataru paula@birc.au.dk

Allele frequencies: the Beta distribution approach AARHUS

UNIVERSITY

Bioinformatics

Research Centre Paula Tataru [email protected] 15

Page 16: The Beta distribution approachpure.au.dk/portal/files/81151799/PaulaTataruOxford.pdf · Allele frequencies: the Beta distribution approachAARHUS UNIVERSITY Paula Tataru paula@birc.au.dk

Allele frequencies: the Beta distribution approach

Paula Tataru [email protected]

AARHUS

UNIVERSITY

Bioinformatics

Research Centre 16

The Beta with spikes: worst fit

Page 17: The Beta distribution approachpure.au.dk/portal/files/81151799/PaulaTataruOxford.pdf · Allele frequencies: the Beta distribution approachAARHUS UNIVERSITY Paula Tataru paula@birc.au.dk

Allele frequencies: the Beta distribution approach

Paula Tataru [email protected]

AARHUS

UNIVERSITY

Bioinformatics

Research Centre 17

The Beta with spikes: worst fit

Page 18: The Beta distribution approachpure.au.dk/portal/files/81151799/PaulaTataruOxford.pdf · Allele frequencies: the Beta distribution approachAARHUS UNIVERSITY Paula Tataru paula@birc.au.dk

Allele frequencies: the Beta distribution approach

Paula Tataru [email protected]

AARHUS

UNIVERSITY

Bioinformatics

Research Centre 18

The Beta with spikes: worst fit

Page 19: The Beta distribution approachpure.au.dk/portal/files/81151799/PaulaTataruOxford.pdf · Allele frequencies: the Beta distribution approachAARHUS UNIVERSITY Paula Tataru paula@birc.au.dk

Allele frequencies: the Beta distribution approach

Paula Tataru [email protected]

AARHUS

UNIVERSITY

Bioinformatics

Research Centre 19

Inference of split times

› Felsenstein’s peeling algorithm

› Numerically optimized likelihood

› 5000 loci

› 100 samples in each population

› 40 data sets

Page 20: The Beta distribution approachpure.au.dk/portal/files/81151799/PaulaTataruOxford.pdf · Allele frequencies: the Beta distribution approachAARHUS UNIVERSITY Paula Tataru paula@birc.au.dk

Allele frequencies: the Beta distribution approach AARHUS

UNIVERSITY

Bioinformatics

Research Centre Paula Tataru [email protected] 20

Page 21: The Beta distribution approachpure.au.dk/portal/files/81151799/PaulaTataruOxford.pdf · Allele frequencies: the Beta distribution approachAARHUS UNIVERSITY Paula Tataru paula@birc.au.dk

Allele frequencies: the Beta distribution approach

Paula Tataru [email protected]

AARHUS

UNIVERSITY

Bioinformatics

Research Centre

Conclusions

› Beta with spikes: new approximation to the WF

› Quality of approximation

› Consistent

› Diffusion > Beta with spikes > Beta

› Inference of split times

› Beta with spikes ~ Kim Tree

› Diffusion ?

21

Page 22: The Beta distribution approachpure.au.dk/portal/files/81151799/PaulaTataruOxford.pdf · Allele frequencies: the Beta distribution approachAARHUS UNIVERSITY Paula Tataru paula@birc.au.dk

Allele frequencies: the Beta distribution approach

Paula Tataru [email protected]

AARHUS

UNIVERSITY

Bioinformatics

Research Centre

Future work

› Inference of

› mutation rates

› selection coefficients

› variable population size

22

Page 23: The Beta distribution approachpure.au.dk/portal/files/81151799/PaulaTataruOxford.pdf · Allele frequencies: the Beta distribution approachAARHUS UNIVERSITY Paula Tataru paula@birc.au.dk

Allele frequencies: the Beta distribution approach

Paula Tataru [email protected]

AARHUS

UNIVERSITY

Bioinformatics

Research Centre 23

The Beta approximation

Page 24: The Beta distribution approachpure.au.dk/portal/files/81151799/PaulaTataruOxford.pdf · Allele frequencies: the Beta distribution approachAARHUS UNIVERSITY Paula Tataru paula@birc.au.dk

Allele frequencies: the Beta distribution approach

Paula Tataru [email protected]

AARHUS

UNIVERSITY

Bioinformatics

Research Centre 24

Mean and variance

Page 25: The Beta distribution approachpure.au.dk/portal/files/81151799/PaulaTataruOxford.pdf · Allele frequencies: the Beta distribution approachAARHUS UNIVERSITY Paula Tataru paula@birc.au.dk

Allele frequencies: the Beta distribution approach

Paula Tataru [email protected]

AARHUS

UNIVERSITY

Bioinformatics

Research Centre 25

Loss and fixation probabilities

Page 26: The Beta distribution approachpure.au.dk/portal/files/81151799/PaulaTataruOxford.pdf · Allele frequencies: the Beta distribution approachAARHUS UNIVERSITY Paula Tataru paula@birc.au.dk

Allele frequencies: the Beta distribution approach AARHUS

UNIVERSITY

Bioinformatics

Research Centre Paula Tataru [email protected] 26

Page 27: The Beta distribution approachpure.au.dk/portal/files/81151799/PaulaTataruOxford.pdf · Allele frequencies: the Beta distribution approachAARHUS UNIVERSITY Paula Tataru paula@birc.au.dk

Allele frequencies: the Beta distribution approach AARHUS

UNIVERSITY

Bioinformatics

Research Centre Paula Tataru [email protected] 27