introduction to genomic medicine - university of...

33
Introduction to Genomic Medicine Exeter Expert Series: Respiratory 12-13 October, 2017 Session I. Genes and Genomes Presented by Júlia Baptista PhD Clinical Scientist and Honorary Lecturer Developed with Sawsan Khuri PhD

Upload: others

Post on 09-Jun-2020

8 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Introduction to Genomic Medicine - University of Exetermedicine.exeter.ac.uk/media/universityofexeter/... · Introduction to Genomic Medicine Exeter Expert Series: Respiratory 12-13

Introduction to Genomic MedicineExeter Expert Series: Respiratory

12-13 October, 2017

Session I. Genes and Genomes

Presented by

Júlia Baptista PhD

Clinical Scientist and Honorary Lecturer

Developed with Sawsan Khuri PhD

Page 2: Introduction to Genomic Medicine - University of Exetermedicine.exeter.ac.uk/media/universityofexeter/... · Introduction to Genomic Medicine Exeter Expert Series: Respiratory 12-13
Page 3: Introduction to Genomic Medicine - University of Exetermedicine.exeter.ac.uk/media/universityofexeter/... · Introduction to Genomic Medicine Exeter Expert Series: Respiratory 12-13

What will be covered

• Structure of a gene

• Genomes and the birth of genomics

• Types of genomic variants and clinical interpretation

Page 4: Introduction to Genomic Medicine - University of Exetermedicine.exeter.ac.uk/media/universityofexeter/... · Introduction to Genomic Medicine Exeter Expert Series: Respiratory 12-13

DNA

• DNA = deoxyribonucleic acid

• Stable double-stranded polymer of nucleotides: – Deoxyribose sugar, Phosphate, Base (A, T, C & G)

• Function: storage of genetic code

Page 5: Introduction to Genomic Medicine - University of Exetermedicine.exeter.ac.uk/media/universityofexeter/... · Introduction to Genomic Medicine Exeter Expert Series: Respiratory 12-13

Central dogma of biology

Page 6: Introduction to Genomic Medicine - University of Exetermedicine.exeter.ac.uk/media/universityofexeter/... · Introduction to Genomic Medicine Exeter Expert Series: Respiratory 12-13

RNA

• RNA = ribonucleic acid • Unstable single-stranded polymer of nucleotides: –

Ribose sugar, Phosphate, Base (A, U, C & G)

• Different types with different functions:– Messenger RNA (mRNA): coding intermediary between DNA and proteins– Non-coding RNA (ncRNA): various roles• Protein synthesis: ribosomal (rRNA), transfer RNA (tRNA).• Gene regulation: microRNA (miRNA), small interfering or

silencing RNA(siRNA), long non-coding RNA (lncRNA)• Catalysis: ribozymes

Page 7: Introduction to Genomic Medicine - University of Exetermedicine.exeter.ac.uk/media/universityofexeter/... · Introduction to Genomic Medicine Exeter Expert Series: Respiratory 12-13

Proteins

• Polymers of amino acids

• Many different functions: transport, structural, catalytic, signalling, etc.

Page 8: Introduction to Genomic Medicine - University of Exetermedicine.exeter.ac.uk/media/universityofexeter/... · Introduction to Genomic Medicine Exeter Expert Series: Respiratory 12-13

What is a gene?

• A gene is the basic physical and functional unit of heredity.

• Genes, which are made up of DNA, act as instructions to make proteins.

Page 9: Introduction to Genomic Medicine - University of Exetermedicine.exeter.ac.uk/media/universityofexeter/... · Introduction to Genomic Medicine Exeter Expert Series: Respiratory 12-13

Structure of a gene

Coding Exons

Un-Translated Regions at 5’ and 3’

as well as

• sequence variations (SNPs etc)

• Regulatory elements, in promoter region and throughout

Gerstein et al, Genome Research, 2007

Non protein-coding Introns

alternative splicingone transcript has this exon the other doesn't

Page 10: Introduction to Genomic Medicine - University of Exetermedicine.exeter.ac.uk/media/universityofexeter/... · Introduction to Genomic Medicine Exeter Expert Series: Respiratory 12-13

Exon 1 Exon 2

TSSTranscriptional Start Site

5’UTR

Translation initiation codonATG

Promoter region

5’ 3’

Exon 1 Exon 2

Promoters have transcription factor binding sites (regulatory elements)

The 5’ structure

Barrett et al, Cell Mol Life Sci, 2012

Page 11: Introduction to Genomic Medicine - University of Exetermedicine.exeter.ac.uk/media/universityofexeter/... · Introduction to Genomic Medicine Exeter Expert Series: Respiratory 12-13

The 3’ end structure

Exon nExon n-15’ 3’

Usually contains:• PolyA Tail addition signal• microRNA binding sites• Silencer regions• AU-rich elements& other regulatory elements

Stop codonTAG/TAA/TGA

3’UTR

Barrett et al, Cell Mol Life Sci, 2012

Page 12: Introduction to Genomic Medicine - University of Exetermedicine.exeter.ac.uk/media/universityofexeter/... · Introduction to Genomic Medicine Exeter Expert Series: Respiratory 12-13

Genes, genomes, and genomics

• A genome is the total genetic content of an organism, whether it is coding or not.

• Genomic Medicine is the use of genomic information and its emerging technologies for disease diagnosis, prognosis, and intervention.

Page 13: Introduction to Genomic Medicine - University of Exetermedicine.exeter.ac.uk/media/universityofexeter/... · Introduction to Genomic Medicine Exeter Expert Series: Respiratory 12-13

The Human Genome Project

Page 14: Introduction to Genomic Medicine - University of Exetermedicine.exeter.ac.uk/media/universityofexeter/... · Introduction to Genomic Medicine Exeter Expert Series: Respiratory 12-13

ENCODE: identify all functional elements in the human genome

from the ENCODE project

Page 15: Introduction to Genomic Medicine - University of Exetermedicine.exeter.ac.uk/media/universityofexeter/... · Introduction to Genomic Medicine Exeter Expert Series: Respiratory 12-13

Mapping genomic variation

Retired in 2016

Completed in 2015

UK now on 100,000 Genomes ProjectAfrican, Japan, Singapore, India... genome projects currently ongoing

Page 16: Introduction to Genomic Medicine - University of Exetermedicine.exeter.ac.uk/media/universityofexeter/... · Introduction to Genomic Medicine Exeter Expert Series: Respiratory 12-13

Next generation sequencingNew technologies helped reduce the cost of sequencing human genomes

Data handling is now the bottleneck. It costs more to analyse a genome than to sequence it.

Page 17: Introduction to Genomic Medicine - University of Exetermedicine.exeter.ac.uk/media/universityofexeter/... · Introduction to Genomic Medicine Exeter Expert Series: Respiratory 12-13

– Gene panel sequencing

• Targeted sequencing that will detect changes in a selected panel of genes known to cause the phenotype of interest.

– Clinical exome sequencing

• Targeted sequencing of exons of “all” known disease genes (~6,000 genes).

– Whole exome sequencing (WES)

• Targeted sequencing of “all” exons of all known genes (~23,000 genes)

– Whole genome sequencing (WGS) (entire genome sequencing) in the NHS in 2018

Next generation sequencing: genomic medicine

Page 18: Introduction to Genomic Medicine - University of Exetermedicine.exeter.ac.uk/media/universityofexeter/... · Introduction to Genomic Medicine Exeter Expert Series: Respiratory 12-13

Large scale genetic variation

• Mainly chromosomal changes. These can be changes to number or structure. Structural: deletions, duplications, translocations, inversions and insertions

• Copy Number Variants

– Insertions or deletions of >1000 nucleotides, causing gains or losses of genomic regions

Relevant in various lung cancers and a decreased risk of chronic obstructive pulmonary disease (e.g. Qiu et al., 2017; Wang and Lemos, 2017; Jamal-Hanjani et al., 2017; Chen et al., 2017)

Page 19: Introduction to Genomic Medicine - University of Exetermedicine.exeter.ac.uk/media/universityofexeter/... · Introduction to Genomic Medicine Exeter Expert Series: Respiratory 12-13

Cytogenetics/Cytogenomics detects large scale variants

DNA loss DNA gainG-banded karytotypeArray-CGH

Page 20: Introduction to Genomic Medicine - University of Exetermedicine.exeter.ac.uk/media/universityofexeter/... · Introduction to Genomic Medicine Exeter Expert Series: Respiratory 12-13

Small scale: Single nucleotide variants (SNVs)

Types of SNVs• Substitutions in non-coding sequences

– Splice sites, regulatory elements binding sites

• Substitutions in coding sequences

– Synonymous (do not change the amino acid), Non-synonymous (missense or nonsense)

• Insertions/deletions (indels)

– In coding sequences may lead to malformed proteins

– In non-coding sequences may delete or insert regulatory signals

Page 21: Introduction to Genomic Medicine - University of Exetermedicine.exeter.ac.uk/media/universityofexeter/... · Introduction to Genomic Medicine Exeter Expert Series: Respiratory 12-13

Synonymous variants: no change to protein sequence

Normal sequence

GGTCTCCTCACGCCA

Pro-Glu-Glu-Cys-Gly

Substitution : C>T

GGTCTTCTCACGCCA

Pro-Glu-Glu-Cys-Gly

Page 22: Introduction to Genomic Medicine - University of Exetermedicine.exeter.ac.uk/media/universityofexeter/... · Introduction to Genomic Medicine Exeter Expert Series: Respiratory 12-13

Missense variants: amino acid substitution

Normal sequence

AGC TAC GGG GTG GGC

Ser Tyr Gly Val Gly

Substitution

AGC TAC AGG GTG GGC

Ser Tyr Arg Val Gly

Some amino acids are similar to each otherNot all amino acids are important for protein folding/function

Page 23: Introduction to Genomic Medicine - University of Exetermedicine.exeter.ac.uk/media/universityofexeter/... · Introduction to Genomic Medicine Exeter Expert Series: Respiratory 12-13

Nonsense variants

Normal sequence

GGTCTCCTCACGCCA

Pro-Glu-Glu-Cys-Gly

Often lead to premature termination of the protein.The truncated protein can be targeted for degradation by the nonsense-mediated pathway.

Substitution

GGTCTCCTCACTCCA

Pro-Glu-Glu-STOP

Page 24: Introduction to Genomic Medicine - University of Exetermedicine.exeter.ac.uk/media/universityofexeter/... · Introduction to Genomic Medicine Exeter Expert Series: Respiratory 12-13

Insertions/Deletions

Normal sequence

GGTCTCCTCACGCCA

Pro-Glu-Glu-Cys-Gly

Insertion of G

GGTGCTCCTCACGCCA

Pro-Arg-Gly-Val-Arg

Insertions and deletions can be in-frame or out-of-frame

(frame shift)

Insertions or deletions of a nucleotide number that disrupt the reading frame will lead to frameshifts, often a premature stop codon is formed/

Page 25: Introduction to Genomic Medicine - University of Exetermedicine.exeter.ac.uk/media/universityofexeter/... · Introduction to Genomic Medicine Exeter Expert Series: Respiratory 12-13

Sources of variation

• Exogenous factors (chemical agents, UV radiation)

• Endogenous factors

– -Replication slippage/slipped strand mis-pairing

– -Recombination (large scale variants)

– -Segregation (Aneuploidies)

Page 26: Introduction to Genomic Medicine - University of Exetermedicine.exeter.ac.uk/media/universityofexeter/... · Introduction to Genomic Medicine Exeter Expert Series: Respiratory 12-13

• Variants that are present in the germ cells (cells that give rise to the gametes) can be passed onto the next generation when that cell participates in fertilization. These variants are important in evolution.

• Variants that arise in a cell lineage other than the germline are somatic. These are postzygotic events and cannot be inherited. Somatic changes are mosaic.

Somatic and germline variation

Page 27: Introduction to Genomic Medicine - University of Exetermedicine.exeter.ac.uk/media/universityofexeter/... · Introduction to Genomic Medicine Exeter Expert Series: Respiratory 12-13

From ThinkLink.com

Page 28: Introduction to Genomic Medicine - University of Exetermedicine.exeter.ac.uk/media/universityofexeter/... · Introduction to Genomic Medicine Exeter Expert Series: Respiratory 12-13

Meiosis

• Starts with replication of genetic material (where SNPs can happen)

• Chromosomes are intertwined and crossing over can occur (where CNVs can happen)

• Results in unique haploid cells called gametes which could be quite different from parent cell, each containing a single copy of the genome

• Gametes come together during mating to produce a diploid embryo

• Hence, any SNVs and CNVs are passed down to the next generation

Evolution is driven by natural selection of the variants brought about during meiosis.

Page 29: Introduction to Genomic Medicine - University of Exetermedicine.exeter.ac.uk/media/universityofexeter/... · Introduction to Genomic Medicine Exeter Expert Series: Respiratory 12-13

Mitosis

• Starts with replication of genetic material (where SNPs can happen)

• Chromosomes are lined up in middle ready to split into two cells, but they are not intertwined

• Results in diploid cells with usually only slight differences from parent cell

• Repeats to heal a wound, or to renew or grow a tissue, organ, etc

• Mitotic variants are not passed down to the next generation

Evolution is not affected by variants brought about during mitosis. Genetic changes during cancer are mitotic.

Page 30: Introduction to Genomic Medicine - University of Exetermedicine.exeter.ac.uk/media/universityofexeter/... · Introduction to Genomic Medicine Exeter Expert Series: Respiratory 12-13

Interpreting variants in the clinic

• ~ 3-4 million SNVs per genome

• Mostly “common variants” not connected with a disease phenotype

• SNVs associated with disease tend to be rare.

• Important evidence:• Population data

• Disease databases

• De novo data

• Segregation

• Functional data

• Computational predictions

Page 31: Introduction to Genomic Medicine - University of Exetermedicine.exeter.ac.uk/media/universityofexeter/... · Introduction to Genomic Medicine Exeter Expert Series: Respiratory 12-13

Nature website: Human Genome Variation

Page 32: Introduction to Genomic Medicine - University of Exetermedicine.exeter.ac.uk/media/universityofexeter/... · Introduction to Genomic Medicine Exeter Expert Series: Respiratory 12-13
Page 33: Introduction to Genomic Medicine - University of Exetermedicine.exeter.ac.uk/media/universityofexeter/... · Introduction to Genomic Medicine Exeter Expert Series: Respiratory 12-13

Reading a gene