bioinformatics and evolutionary genomics the tree of life / hgt , origin of eukaryotes

34
Bioinformatics and Bioinformatics and Evolutionary Genomics Evolutionary Genomics The tree of life / HGT , The tree of life / HGT , origin of eukaryotes origin of eukaryotes

Upload: benny

Post on 15-Jan-2016

37 views

Category:

Documents


0 download

DESCRIPTION

Bioinformatics and Evolutionary Genomics The tree of life / HGT , origin of eukaryotes. LUCA. “three kingdoms”. How to root the tree of life? 1: Find paralogs that duplicated before the LUCA. 6 found so far. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Bioinformatics and Evolutionary Genomics The tree of life / HGT , origin of eukaryotes

Bioinformatics and Evolutionary Bioinformatics and Evolutionary GenomicsGenomics

The tree of life / HGT , origin of The tree of life / HGT , origin of eukaryoteseukaryotes

Bioinformatics and Evolutionary Bioinformatics and Evolutionary GenomicsGenomics

The tree of life / HGT , origin of The tree of life / HGT , origin of eukaryoteseukaryotes

Page 2: Bioinformatics and Evolutionary Genomics The tree of life / HGT , origin of eukaryotes

LUCA

“three kingdoms”

Page 3: Bioinformatics and Evolutionary Genomics The tree of life / HGT , origin of eukaryotes

How to root the tree of life?How to root the tree of life?1: Find paralogs that duplicated before the 1: Find paralogs that duplicated before the

LUCALUCA

How to root the tree of life?How to root the tree of life?1: Find paralogs that duplicated before the 1: Find paralogs that duplicated before the

LUCALUCA

6 found so far

Page 4: Bioinformatics and Evolutionary Genomics The tree of life / HGT , origin of eukaryotes

How to root the tree of life? 2: Make a tree of How to root the tree of life? 2: Make a tree of paralogs that duplicated before the LUCAparalogs that duplicated before the LUCA

How to root the tree of life? 2: Make a tree of How to root the tree of life? 2: Make a tree of paralogs that duplicated before the LUCAparalogs that duplicated before the LUCA

Griblado 1998 J Mol Evol

Griblado 1998 J Mol Evol

Page 5: Bioinformatics and Evolutionary Genomics The tree of life / HGT , origin of eukaryotes

How ta make a tree of life?How ta make a tree of life?Issue: Horizontal Gene Transfer (HGT)Issue: Horizontal Gene Transfer (HGT)

How ta make a tree of life?How ta make a tree of life?Issue: Horizontal Gene Transfer (HGT)Issue: Horizontal Gene Transfer (HGT)

• As opposed to As opposed to normal vertical normal vertical inheritanceinheritance

• Inheritance from Inheritance from somewhere else somewhere else than parentsthan parents

• AKA lateral gene AKA lateral gene transfertransfer

• As opposed to As opposed to normal vertical normal vertical inheritanceinheritance

• Inheritance from Inheritance from somewhere else somewhere else than parentsthan parents

• AKA lateral gene AKA lateral gene transfertransfer

Page 6: Bioinformatics and Evolutionary Genomics The tree of life / HGT , origin of eukaryotes

HGTHGTHGTHGT

Bs1Bs1 Mg1Mg1Ec1Ec1Ct1Ct1

Rp1Rp1

Af1Af1

Page 7: Bioinformatics and Evolutionary Genomics The tree of life / HGT , origin of eukaryotes

HGTHGTHGTHGT

Bs1Bs1 Mg1Mg1Ec1Ec1Ct1Ct1

Rp1Rp1

Af1Af1

Page 8: Bioinformatics and Evolutionary Genomics The tree of life / HGT , origin of eukaryotes

HGTHGTHGTHGT

Bs2Bs2 Mg2Mg2

Bs1Bs1 Mg1Mg1Ec1Ec1Ct1Ct1

Rp1Rp1

Af1Af1

Page 9: Bioinformatics and Evolutionary Genomics The tree of life / HGT , origin of eukaryotes

HGTHGTHGTHGT

Bs2Bs2 Mg2Mg2

Ec1Ec1Ct1Ct1

Rp1Rp1

Af1Af1

Page 10: Bioinformatics and Evolutionary Genomics The tree of life / HGT , origin of eukaryotes

HGT: frequently observed when many HGT: frequently observed when many genome sequences became availablegenome sequences became availableHGT: frequently observed when many HGT: frequently observed when many genome sequences became availablegenome sequences became available

Page 11: Bioinformatics and Evolutionary Genomics The tree of life / HGT , origin of eukaryotes

HGT & Tree of Life (ToL)HGT & Tree of Life (ToL)HGT & Tree of Life (ToL)HGT & Tree of Life (ToL)

b

Page 12: Bioinformatics and Evolutionary Genomics The tree of life / HGT , origin of eukaryotes

Transition prokaryotes to eukaryotes: big Transition prokaryotes to eukaryotes: big transitiontransition

Transition prokaryotes to eukaryotes: big Transition prokaryotes to eukaryotes: big transitiontransition

• The prekaryoteThe prekaryote

• No more intermediatesNo more intermediates

• How to look before the event horizon?How to look before the event horizon?

• The prekaryoteThe prekaryote

• No more intermediatesNo more intermediates

• How to look before the event horizon?How to look before the event horizon?

Page 13: Bioinformatics and Evolutionary Genomics The tree of life / HGT , origin of eukaryotes

Endo symbiosis of alpha proteo-bacteria gave Endo symbiosis of alpha proteo-bacteria gave rise to mitochondriarise to mitochondria

Endo symbiosis of alpha proteo-bacteria gave Endo symbiosis of alpha proteo-bacteria gave rise to mitochondriarise to mitochondria

• Mitochondrial DNA in the mitochondriaMitochondrial DNA in the mitochondria

• Hydrogenosomes shown to be derived from Hydrogenosomes shown to be derived from mitochondriamitochondria

• Many proteins active in present-day mitochondria Many proteins active in present-day mitochondria are coded for by proteins of eukaryotic invention, are coded for by proteins of eukaryotic invention, archaeal descentarchaeal descent

• Many proteins of alpha-protein ancestor active in Many proteins of alpha-protein ancestor active in in other parts of the cellin other parts of the cell

• Mitochondrial DNA in the mitochondriaMitochondrial DNA in the mitochondria

• Hydrogenosomes shown to be derived from Hydrogenosomes shown to be derived from mitochondriamitochondria

• Many proteins active in present-day mitochondria Many proteins active in present-day mitochondria are coded for by proteins of eukaryotic invention, are coded for by proteins of eukaryotic invention, archaeal descentarchaeal descent

• Many proteins of alpha-protein ancestor active in Many proteins of alpha-protein ancestor active in in other parts of the cellin other parts of the cell

B

Page 14: Bioinformatics and Evolutionary Genomics The tree of life / HGT , origin of eukaryotes

rRNA tree rRNA tree rRNA tree rRNA tree

16S Ribosomal RNA

Mitochondria have their own mini genome

Page 15: Bioinformatics and Evolutionary Genomics The tree of life / HGT , origin of eukaryotes

Alpha-proteobacterial proteins with the rest of the bacteria and archaea

Eukaryotic + alpha-proteobacteria in the same branch

Identifying Identifying eukaryoticeukaryotic proteins with proteins with an an alpha-proteobacterialalpha-proteobacterial origin based origin based

on their phylogenyon their phylogeny

Identifying Identifying eukaryoticeukaryotic proteins with proteins with an an alpha-proteobacterialalpha-proteobacterial origin based origin based

on their phylogenyon their phylogeny

Page 16: Bioinformatics and Evolutionary Genomics The tree of life / HGT , origin of eukaryotes

PHYLOME

SELECTION OF HOMOLOGS,(Smith&Waterman)

ALIGNMENTS AND TREE(Clustalx, Kimura+Dayhoff)

GENOME

GENOMES

TREE SCANNING

LIST

Detecting eukaryotic genes of alpha-proteobacterial ancestryDetecting eukaryotic genes of alpha-proteobacterial ancestry

6 alpha-proteobacteria9 eukaryotes56 Bacteria+Archaea

6 alpha-proteobacteria (22 500 genes)

Page 17: Bioinformatics and Evolutionary Genomics The tree of life / HGT , origin of eukaryotes

- Catabolism of fatty acids, glycerol and amino acids.- Catabolism of fatty acids, glycerol and amino acids.- Some pathways are - Some pathways are not mitochondrialnot mitochondrial..

Proto-mitochondrial metabolism:Proto-mitochondrial metabolism:

non-mitoch..

mitochondrial

not in yeast/human

Page 18: Bioinformatics and Evolutionary Genomics The tree of life / HGT , origin of eukaryotes

Eric Schon, Methods Cell Biol 2001(manually curated)

Huh et al., Nature 2003Huh et al., Nature 2003(green fluorescent genomics)(green fluorescent genomics)

566

527

303

Gabaldon & HuynenScience 2003alpha-prot.

10

59

35

293

Yeast mitochondrial proteome:Yeast mitochondrial proteome:

Human mitochondrial proteome:Human mitochondrial proteome:

Eric Schon, Methods Cell Biol 2001

755

508

The majority of the proto-mitochondrial proteome is not mitochondrial (anymore)The majority of the proto-mitochondrial proteome is not mitochondrial (anymore)

113

Page 19: Bioinformatics and Evolutionary Genomics The tree of life / HGT , origin of eukaryotes

t

proteinsloss

gain

re-targeting

AncestorModern mitochondria

From endosymbiont to organell, not only From endosymbiont to organell, not only loss and gain of proteins but also loss and gain of proteins but also “retargeting”:“retargeting”:

~16% of the mitochondrial yeast proteins are of alpha-proteobacterial origin.

~65% of the alpha-proteobacteria derived set is not mitochondrial.

Gabaldon and Huynen, Science 2004

Page 20: Bioinformatics and Evolutionary Genomics The tree of life / HGT , origin of eukaryotes

““When” did the mitochondria invade the When” did the mitochondria invade the eukaryotes?eukaryotes?

““When” did the mitochondria invade the When” did the mitochondria invade the eukaryotes?eukaryotes?

• Genes from alpha-proteobacterial descent Genes from alpha-proteobacterial descent present in genomes in mitochondria-less present in genomes in mitochondria-less organisms (cf. toni)organisms (cf. toni)

• All eukaryotes have or had a mitochondria/alpha All eukaryotes have or had a mitochondria/alpha proteobacterial symbiontproteobacterial symbiont

• It thus happened before the last common It thus happened before the last common ancestor of all eukaryotesancestor of all eukaryotes

• But then still “when”? (b)But then still “when”? (b)

• Genes from alpha-proteobacterial descent Genes from alpha-proteobacterial descent present in genomes in mitochondria-less present in genomes in mitochondria-less organisms (cf. toni)organisms (cf. toni)

• All eukaryotes have or had a mitochondria/alpha All eukaryotes have or had a mitochondria/alpha proteobacterial symbiontproteobacterial symbiont

• It thus happened before the last common It thus happened before the last common ancestor of all eukaryotesancestor of all eukaryotes

• But then still “when”? (b)But then still “when”? (b)

Page 21: Bioinformatics and Evolutionary Genomics The tree of life / HGT , origin of eukaryotes

what about all other cellular innovations what about all other cellular innovations that set eukaryotes apart from that set eukaryotes apart from

prokaryotes?prokaryotes?

what about all other cellular innovations what about all other cellular innovations that set eukaryotes apart from that set eukaryotes apart from

prokaryotes?prokaryotes?

Page 22: Bioinformatics and Evolutionary Genomics The tree of life / HGT , origin of eukaryotes

the prekaryote-LECA transitionthe prekaryote-LECA transitionthe prekaryote-LECA transitionthe prekaryote-LECA transition

Makarova NAR 2005

Page 23: Bioinformatics and Evolutionary Genomics The tree of life / HGT , origin of eukaryotes

Duplication more prevalent in pre-Duplication more prevalent in pre-eukaryotes that in archaea or bacteriaeukaryotes that in archaea or bacteria

Duplication more prevalent in pre-Duplication more prevalent in pre-eukaryotes that in archaea or bacteriaeukaryotes that in archaea or bacteria

Makarova NAR 2005

Page 24: Bioinformatics and Evolutionary Genomics The tree of life / HGT , origin of eukaryotes

duplications: e.g. duplications: e.g. small GTPasessmall GTPases

duplications: e.g. duplications: e.g. small GTPasessmall GTPases

Page 25: Bioinformatics and Evolutionary Genomics The tree of life / HGT , origin of eukaryotes

Thus all these duplications & endosymbios Thus all these duplications & endosymbios order?order?

Thus all these duplications & endosymbios Thus all these duplications & endosymbios order?order?

• Unknown but all before elucaUnknown but all before eluca

• According to the theory of endocytosis as a late According to the theory of endocytosis as a late thing for the prekaryote, after many of the thing for the prekaryote, after many of the eukaryotic inventions: to be tested involvement of eukaryotic inventions: to be tested involvement of genes of alpha-prot origin in crucial (cellular) euk genes of alpha-prot origin in crucial (cellular) euk processes? processes?

(nuclear import)(nuclear import)

• Unknown but all before elucaUnknown but all before eluca

• According to the theory of endocytosis as a late According to the theory of endocytosis as a late thing for the prekaryote, after many of the thing for the prekaryote, after many of the eukaryotic inventions: to be tested involvement of eukaryotic inventions: to be tested involvement of genes of alpha-prot origin in crucial (cellular) euk genes of alpha-prot origin in crucial (cellular) euk processes? processes?

(nuclear import)(nuclear import)

Page 26: Bioinformatics and Evolutionary Genomics The tree of life / HGT , origin of eukaryotes

Eukaryotic tree of life?Eukaryotic tree of life?Eukaryotic tree of life?Eukaryotic tree of life?

• The divisions: The divisions:

– Ophistokonts (animals, fungi, microsporidia)Ophistokonts (animals, fungi, microsporidia)

– Amoebozoa (Dicty)Amoebozoa (Dicty)

– Chromalveolata Paramecium, Plasmodium but Chromalveolata Paramecium, Plasmodium but also diatomsalso diatoms

– ArchaeplastidaArchaeplastida

– ExcavataExcavata

– RhizariaRhizaria

• Historically: crown-group eukaryotes vs protistsHistorically: crown-group eukaryotes vs protists

• What is a complete genome; draft genomesWhat is a complete genome; draft genomes

• The divisions: The divisions:

– Ophistokonts (animals, fungi, microsporidia)Ophistokonts (animals, fungi, microsporidia)

– Amoebozoa (Dicty)Amoebozoa (Dicty)

– Chromalveolata Paramecium, Plasmodium but Chromalveolata Paramecium, Plasmodium but also diatomsalso diatoms

– ArchaeplastidaArchaeplastida

– ExcavataExcavata

– RhizariaRhizaria

• Historically: crown-group eukaryotes vs protistsHistorically: crown-group eukaryotes vs protists

• What is a complete genome; draft genomesWhat is a complete genome; draft genomes

Page 27: Bioinformatics and Evolutionary Genomics The tree of life / HGT , origin of eukaryotes

AnimalsAnimalsAnimalsAnimals• Most primitive: spongesMost primitive: sponges

• Quite a number of genome sequences (of Quite a number of genome sequences (of dubious completeness)dubious completeness)

• Most primitive: spongesMost primitive: sponges

• Quite a number of genome sequences (of Quite a number of genome sequences (of dubious completeness)dubious completeness)

Page 28: Bioinformatics and Evolutionary Genomics The tree of life / HGT , origin of eukaryotes

FungiFungiFungiFungi

• Many complete genomesMany complete genomes

• Broad, GenolevuresBroad, Genolevures

• Microsporidium (Microsporidium (E. cuniculiE. cuniculi))

• Mushrooms are Mushrooms are BasidomyctesBasidomyctes

• Together with animals: Together with animals: ophistokontsophistokonts

• Many complete genomesMany complete genomes

• Broad, GenolevuresBroad, Genolevures

• Microsporidium (Microsporidium (E. cuniculiE. cuniculi))

• Mushrooms are Mushrooms are BasidomyctesBasidomyctes

• Together with animals: Together with animals: ophistokontsophistokonts

Page 29: Bioinformatics and Evolutionary Genomics The tree of life / HGT , origin of eukaryotes

AmoebozoaAmoebozoaAmoebozoaAmoebozoa

• Few genomesFew genomes

– Entamoeba histolyticaEntamoeba histolytica

– Dictyostelium discoideumDictyostelium discoideum

• Few genomesFew genomes

– Entamoeba histolyticaEntamoeba histolytica

– Dictyostelium discoideumDictyostelium discoideum

Page 30: Bioinformatics and Evolutionary Genomics The tree of life / HGT , origin of eukaryotes

ArchaeplastidaArchaeplastidaArchaeplastidaArchaeplastida

• Second bacterial endosymbiosis event: Second bacterial endosymbiosis event: cyanobacteriacyanobacteria

• Green algae, red algae, plantsGreen algae, red algae, plants

• ~5 genomes ~5 genomes

• Second bacterial endosymbiosis event: Second bacterial endosymbiosis event: cyanobacteriacyanobacteria

• Green algae, red algae, plantsGreen algae, red algae, plants

• ~5 genomes ~5 genomes

Page 31: Bioinformatics and Evolutionary Genomics The tree of life / HGT , origin of eukaryotes

ChromalveoatesChromalveoatesChromalveoatesChromalveoates

• Secondary endosymbios: plastidsSecondary endosymbios: plastids

• Very different speciesVery different species

• (diatoms (also commonly referred to as algae), (diatoms (also commonly referred to as algae), oomycetes, paramecium, alvealotes, oomycetes, paramecium, alvealotes, dinoflagelates)dinoflagelates)

• Quite some genomes (~10)Quite some genomes (~10)

• Secondary endosymbios: plastidsSecondary endosymbios: plastids

• Very different speciesVery different species

• (diatoms (also commonly referred to as algae), (diatoms (also commonly referred to as algae), oomycetes, paramecium, alvealotes, oomycetes, paramecium, alvealotes, dinoflagelates)dinoflagelates)

• Quite some genomes (~10)Quite some genomes (~10)

B

Page 32: Bioinformatics and Evolutionary Genomics The tree of life / HGT , origin of eukaryotes

ExcavataExcavataExcavataExcavata

• Weird parasitesWeird parasites (Giardia, Trypanosome, (Giardia, Trypanosome, Leismania)Leismania)

• But also: But also: Naegleria gruberi: amoeboflagelateNaegleria gruberi: amoeboflagelate

• Weird parasitesWeird parasites (Giardia, Trypanosome, (Giardia, Trypanosome, Leismania)Leismania)

• But also: But also: Naegleria gruberi: amoeboflagelateNaegleria gruberi: amoeboflagelate

Page 33: Bioinformatics and Evolutionary Genomics The tree of life / HGT , origin of eukaryotes

RhizariaRhizariaRhizariaRhizaria• Amoeboids + amoeboflagellates Amoeboids + amoeboflagellates

• produce shells which make up the vast majority produce shells which make up the vast majority of protozoan fossils. of protozoan fossils.

• No genomes (yet)No genomes (yet)

• Amoeboids + amoeboflagellates Amoeboids + amoeboflagellates

• produce shells which make up the vast majority produce shells which make up the vast majority of protozoan fossils. of protozoan fossils.

• No genomes (yet)No genomes (yet)

Page 34: Bioinformatics and Evolutionary Genomics The tree of life / HGT , origin of eukaryotes

How are eukaryotes related ???How are eukaryotes related ???How are eukaryotes related ???How are eukaryotes related ???

• Historically: crown-group eykaryotes vs protists Historically: crown-group eykaryotes vs protists but now molecular evidencebut now molecular evidence

• Two hypothesis:Two hypothesis:

– In or just after excavataIn or just after excavata

– Inbetween ophistokonts/amoebozoa vs the Inbetween ophistokonts/amoebozoa vs the rest (unikont vs bikont), myosinsrest (unikont vs bikont), myosins

• Rhizaria?Rhizaria?

• phagotrophic origin of eukaryotes: an amoebe phagotrophic origin of eukaryotes: an amoebe with flagella? with flagella?

• Historically: crown-group eykaryotes vs protists Historically: crown-group eykaryotes vs protists but now molecular evidencebut now molecular evidence

• Two hypothesis:Two hypothesis:

– In or just after excavataIn or just after excavata

– Inbetween ophistokonts/amoebozoa vs the Inbetween ophistokonts/amoebozoa vs the rest (unikont vs bikont), myosinsrest (unikont vs bikont), myosins

• Rhizaria?Rhizaria?

• phagotrophic origin of eukaryotes: an amoebe phagotrophic origin of eukaryotes: an amoebe with flagella? with flagella?

b