canine genetics research at the aht · canine genetics research at the aht broad objective is to...

38

Upload: others

Post on 14-Mar-2020

23 views

Category:

Documents


0 download

TRANSCRIPT

Canine Genetics Research at the AHT

Broad objective is to identify genetic variants underlying inherited

disorders in dogs to :

1. enable development of DNA tests that:

a) allow breeders to reduce disease prevalence in specific breeds

b) aide differential diagnosis

2. improve understanding of pathophysiology of disease, that could lead to

improved treatment / therapies in dogs and other species

The Give a Dog a Genome project will enable us to identify

disease mutations more quickly and cost-effectively

Give a Dog a Genome

• On 25th January the Kennel Club contacted the Breed Health Co-ordinators

of all 215 registered breeds, inviting them to participate in Give a Dog a

Genome.

• In 2015 the Kennel Club Charitable Trust awarded the KCGC £50k to spend

on Whole Genome Sequencing (WGS).

• This is sufficient funding to sequence ~25 whole genomes.

• We invited each breed community to donate £1000

• We promised to match each donation with £1000 from the KCCT funding to

fund the sequencing of the whole genome of a dog of that breed.

• If successful this matched funding approach would enable us to sequence

50 whole genomes, creating largest canine genome bank in the UK

Talk Layout

• DNA – the basics – don’t panic

• whole genome sequencing – using it to identify a disease mutation

• A success story

• More about Give a Dog a Genome

Structure of DNA

G C

T A

G C

G C

G C

T A

G C

G C

T A

T A

5’

3’ 5’

3’

sugar-phosphate

chain

H

C

N C

C C

N

H

O O

CH3

T

N

C C

N C

C N

N C

H

A

H

N

H

N C

C C

N

O

H

C

C

N H

H

H

N

C C

N C

C N

N C

H

G O

H

N

H

H

TCCGTAGCTAGGCATGCATGGACTG

AGGCATCGATCCGTACGTACCTGAC

TCCGTAGCTAGGCATGCATGGACTG

Coding DNA

A A U G G G G G A U U U C C C mRNA

A T V W T Protein

A C T G A A A C C C C T T G G

A A T G G G G G A T T T C C C DNA

Transcription

Translation

Genetic Code

AUG used to both initiate protein synthesis and direct the incorporation

of methionine at internal positions of the polypeptide chain

• three special ‘stop’ codons that do not code for an amino acid but signal

the termination of translation

GCA

GCC

GCG

GCU

Ala

A

AGA

AGG

CGA

CGC

CGG

CGU

Arg

R

GAC

GAU

Asp

D

AAC

AAU

Asn

N

UGC

UGU

Cys

C

GAA

GAG

Glu

E

CAA

CAG

Gln

Q

GGA

GGC

GGG

GGU

Gly

G

CAC

CAU

His

H

AUA

AUC

AUU

Ile

I

UUA

UUG

CUA

CUC

CUG

CUU

Leu

L

AAA

AAG

Lys

K

AUG

Met

M

UUC

UUU

Phe

F

CCA

CCC

CCG

CCU

Pro

P

AGC

AGU

UCA

UCC

UCG

UCU

Ser

S

ACA

ACC

ACG

ACU

Thr

T

UGG

Trp

W

UAC

UAU

Tyr

Y

GUA

GUC

GUG

GUU

Val

V

UAA

UAG

UGA

Stop

DNA

Non-coding

Coding (genes)

AAGTC

Mutations

A mutation is a permanent change to the nucleotide sequence of an organism’s DNA

Types of mutation:

Insertion – the addition of one or more extra nucleotides into the DNA

Deletion – the loss of one or more extra nucleotides from the DNA

Substitution – the exchange of a single nucleotide

Inversion –the ‘flipping over’ of a stretch of nucleotides

Mutations can be small-scale, involving single, or small numbers of nucleotides, or large,

involving hundreds, or thousands of nucleotides.

Normal

Insertion

Deletion

Substitution

Inversion

Frame Shift Mutations

Insertions and deletions often alter the frame of the DNA (a frame shift),

which can significantly alter the gene product.

GCT GAA TGC TCG ATC GGC AAG ATC GAT GAT

A E C S I G K I D D

GCT GAA TGC TCG ATC GGC CAA GAT CGA TGA T

A E C S I G Q D R *

Substitutions

• Silent Mutations

GCT GAA TGC TCG ATC GGC AAA/G ATC GAT GAT

A E C S I G K I D D

• Missense Mutations

GCT GAA TGC TCG ATC GGC AAG ATC GAT/A GAT

A E C S I G K I D/E D

• Nonsense Mutations

GCT GAA TGC/A TCG ATC GGC AAG ATC GAT GAT

A E C/* S I G K I D D

Nomenclature

• Dictionary definition of mutation:

– a permanent change to the nucleotide sequence of an organism’s DNA

• Conventional definition of mutation:

– change to the nucleotide sequence of an organism’s DNA that is

deleterious

• We refer to positions in the DNA that can be different between individuals but

are not necessarily deleterious as variants

Variation

• There is a lot of variation within the canine genome

• Most of that variation is neutral

• Some of the variation is positive, or advantageous

• Some of the variation is deleterious (disease-causing)

• The challenge is to identify the deleterious variants within all the

background, non-deleterious variation

How much DNA is there?

• The canine genome contains 2.4 x 109 nucleotides.

• This is nearly two and a half billion ‘letters’ of DNA.

• If each one measured 1 mm the DNA in the dog would stretch from

Land’s End to John O’Groats and back again.

• There are about 20,000 genes.

Finding a disease mutation using whole

genome sequencing

Canine Reference Genome

• In 2004 the Canine Genome Sequencing Project produced a high-

quality draft sequence of a female boxer named Tasha, known as

CanFam1.0

• By comparing Tasha with many other breeds, the project also

compiled an initial set of ~2.5 million single nucleotide

polymorphisms (SNPs)

• The current version of the canine reference genome is CanFam3.1

DOG 3 CGATTATGATCAACTAGACTACGCATGACACACCGATCGATCGATTCGATCGG

DOG 1 CGATTATGATCAACTAGACCAGGCATGGCATACCGATCGATCGAATCGCTCGT

DOG 2 CGATCATGATCAACTGGACTAGGCATGACATACCGATCGATCGAATCGATCGT

CASE CGATTATGATCAACTGGACTAGGCATGACACACCGATCGGTCGAATCGCTCGT

REF CGATCATGATCAACTAGACTAGGCATGACATACCGATCGATCGAATCGATCGT

Affected Dog = CASE

CASE CGATTATGATCAACTGGACTAGGCATGACACACCGATCGGTCGAATCGCTCGT

REF CGATCATGATCAACTAGACTAGGCATGACATACCGATCGATCGAATCGATCGT

DOG 4 CGATCATGATCAACTAGACTACGCATGACATACCGATCGATCGAATCGATCGT

DOG 5 CGATCATGATCAACTGGACTAGGCATGACATACCGATCGATCGAATCGATCGT

DOG 6 CGAACATGATCAACTGGACTAGGCATGACATACGGATCGATCGAATCGATCGT

DOG 7 CGATCATGATCAACTAGACTACGCATGACATACCGATCGATCGAATCGATCGT

DOG 8 CGATCATGATCAACTGGACTAGGCATGACATACGGATCGATCGAATAGATCGT

It’s a bit more complicated than that…

Number of homozygous variants with both alleles unique to sample

14809_MS 2,827,515

17289_BGVP 3,299,219

23005_V 3,418,818

24093_BC 3,763,033

24604_WSS_AB 2,952,464

25078_WSS_AB 3,062,121

25314_SHY 4,119,944

25852_SBT 3,469,799

26042_BOT 3,185,908

26102_LR 4,387,742

26133_G 3,012,348

26569_FBD 3,559,523

7897_FCR 3,095,933

CKCS_17377 2,597,150

Dennis5runs_merged 3,555,431

FCR_25382 3,022,227

FCR_25384 3,199,004

ISP_21897 4,084,161

ST_25135 1,986,209

SV_25951 3,048,115

V_21485.fastq 3,305,723

Filtering by effect of mutation

Sequence ontology Term Effect score

start_lost 5

stop_gained 5

splice_acceptor_variant 5

splice_donor_variant 5

inframe_deletion 5

inframe_insertion 5

missense_variant 5

3_prime_UTR_truncation 4

5_prime_UTR_truncation 4

regulatory_region_ablation 4

non_coding_exon_variant 4

splice_region_variant 4

conserved_intergenic_variant 3

conserved_intron_variant 3

nc_transcript_variant 3

coding_sequence_variant 3

feature_elongation 2

feature_truncation 2

intron_variant 2

non_coding_transcript_exon_variant 2

regulatory_region_variant 2

3_prime_UTR_variant 2

5_prime_UTR_variant 2

downstream_gene_variant 1

intergenic_region 1

intragenic_variant 1

intergenic_variant 1

Filter by segregation score

Cas

e 1

Cas

e 1

Cas

e 2

Cas

e 2

Cas

e 3

Cas

e 3

Co

ntr

ol 1

Co

ntr

ol 1

Co

ntr

ol 2

Co

ntr

ol 2

Co

ntr

ol 3

Co

ntr

ol 3

Seg score 1: AA/AB/BB

G G G G G G A A A A A A 6

G G G G G G A A A A G A 5

A G G G G G A A A A A A 5

G G G G G G G G A A A A 5

G G G G G G G G A G A A 4

We expect: Cases AA, Carriers AB, Controls BB

The Genome Bank

• The genomes of two different dogs will differ in ~2-3 million places.

• The more breeds that are represented in the bank of genomes the

more variation across the genome we will capture.

• The more variation we capture the more effectively we can filter out

benign (or neutral) variants.

• The more effectively we can filter out neutral variants, the more

quickly (& cheaply) we will be able to identify disease mutations.

• Sequencing the genomes of single representatives of 50 different

breeds will capture far more variants than multiple dogs of the same

breed.

The Genome Bank

• The genome bank is a permanent resource that can be used to

facilitate inherited disease investigations in ANY BREED

• Dogs affected with a specific inherited diseases can act as controls

for studies of a different disease. e.g. the genome from a dog

affected with PRA can act as a control for epilepsy.

• The sequence of a dog of any breed will help investigations in other

breeds.

DOG 3 CGATTATGATCAACTAGACTACGCATGACACACCGATCGATCGATTCGATCGG

DOG 1 CGATTATGATCAACTAGACCAGGCATGGCATACCGATCGATCGAATCGCTCGT

DOG 2 CGATCATGATCAACTGGACTAGGCATGACATACCGATCGATCGAATCGATCGT

CASE CGATTATGATCAACTGGACTAGGCATGACACACCGATCGGTCGAATCGCTCGT

REF CGATCATGATCAACTAGACTAGGCATGACATACCGATCGATCGAATCGATCGT

Affected Dog = CASE

DOG 4 CGATCATGATCAACTAGACTACGCATGACATACCGATCGATCGAATCGATCGT

DOG 5 CGATCATGATCAACTGGACTAGGCATGACATACCGATCGATCGAATCGATCGT

DOG 6 CGAACATGATCAACTGGACTAGGCATGACATACGGATCGATCGAATCGATCGT

DOG 7 CGATCATGATCAACTAGACTACGCATGACATACCGATCGATCGAATCGATCGT

DOG 8 CGATCATGATCAACTGGACTAGGCATGACATACGGATCGATCGAATAGATCGT

Cerebellar cortical degeneration

Video provided by Joe Fenn, Royal Veterinary

College

• Two four month old cases (littermates) presented separately at • Joe Fenn, Royal Veterinary College neurology department

– Signs of ataxia at 2-3 months of age – Marked head tremor at four months

• Most forms of cerebellar ataxia display an autosomal recessive mode of inheritance

• Genetically different across breeds

• Could we genome sequence one of the cases and identify causative mutation ?

• Genome of Vizsla case compared to genomes of 13 other dogs

• different breeds, non-ataxic

Cerebellar cortical degeneration

Genome sequencing variants amongst our 14 canine genomes:

15,223,374

Variants with a high effect score: 50,655

Variants HOMOZYGOUS in the Vizsla only 290

Variants in known candidate genes: 1

• exon 26 splice donor site mutation (GT>AT) in sorting nexin 14 gene (SNX14)

• an altered SNX14 splicing pattern for a CCD case was demonstrated by RNA analysis

• no SNX14 protein could be detected in CCD case cerebellum by western blotting.

• additional case also homozygous

• 107 additional Vizslas genotyped, one carrier identified

0

100

200

300

400

500

600

700

10 12 14 16 18 20 22 24 26 28 30

Nu

mb

er

of

va

ria

nts

aft

er

se

g s

co

re a

nd

eff

ec

t s

co

re f

ilte

r (m

ax

imu

m)

Number of controls

WGS comparisons: effect of the number of controls on filtered variants

Give a Dog a Genome

• 75 breeds have signed up to GDG since its official launch

• All have donated at least £1000

• The KCGC has recently donated a further £25k to support this project…

• …taking total raised >£150k

• Process of selecting dogs to sequence is now underway.

Choosing Dogs to Sequence

• Depending on the health information we receive from the BHCs we

will decide to sequence:

a) A dog affected with a disorder that is a demonstrable health concern in

the breed, or

b) An older (> 8yo), apparently healthy dog.

• We will invite owners of suitable dogs to send swabs directly to us.

• We will extract DNA and measure quantity and quality from all

suitable candidates, including archived DNA samples.

• We will select the most suitable dog, based on robust clinical

diagnosis and good quality DNA.

• The identify of the dog we select will be kept confidential

• First dogs will be sequenced during October

Healthy Dogs

• We are requesting that healthy, older dogs have a full eye

examination, including gonioscopy, so we can use them as controls

for all our eye studies.

• The gonioscopy is required for our glaucoma studies.

• Because gonioscopy requires a considerable degree of expertise we

request that the eye examination is done by an ophthalmologist that

is a DipECVO, i.e.a recognised specialist.

• To help meet these rigorous criteria the AHT will reimburse the

costs for up to 3 dogs to have a BVA certificate, including

gonioscopy.

Health Summary

• > 80 health concerns listed by 66 breeds

• About half concerns are listed by single breeds only

Health concern Number of concerned

breeds

Idiopathic Epilepsy 15

Cataract, Hereditary 7

PRA - early & late onset 6

Hip dysplasia 6

Hypothyroidism (autoimmune thyroiditis?) 6

Lymphoma 6

Auto-immune mediated conditions 5

Cancer/ tumours 5

Dilated Cardiomyopathy 5

Patella luxation 5

Glaucoma 4

Cardiac 4

Degenerative Myelopathy (CDRM) 4

Elbow Dysplasia 4

Osteosarcoma 4

Acknowledgements

• Kennel Club Charitable Trust

• Louise Burmeister

Ellen Schofield

Mike Boursnell

Oliver Forman

Bryan McLaughlin

Kat Lawson

• Breed Health Co-Ordinators

Thank You

Nia & James’ Captured Moments