botrytis/sclerotinia resources: an integrated system for ... · manual curation annotation apollo...

52
A L I M E N T A T I O N A G R I C U L T U R E E N V I R O N N E M E N T Botrytis/Sclerotinia resources: an integrated system for structural and functional genome annotation BSPGW, Sept 17th, 2011 J. Amselem, N. Lapalu

Upload: others

Post on 04-Jun-2020

5 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

A L I M E N T A T I O N A G R I C U L T U R E

E N V I R O N N E M E N T

Botrytis/Sclerotinia resources: an integrated system for

structural and functional genome annotation

BSPGW, Sept 17th, 2011

J. Amselem, N. Lapalu

Page 2: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

GnpIS: a data warehouse http://urgi.versailles.inra.fr/gnpis

2

Interop.

Transcriptome

Maps

DNA Polymorphisms

Genetic collections Genetic resources

Genome annotation

Page 3: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

Distributed annotation system

1

Annotation pipelines: - Gene prediction - Refinement analysis - Functional annotation - Comparative genomics

SeqFeature:Store Gbrowse fast display

1- Structural 2- Functional 3- comparative

Gbrowse

Ref=polypeptide

Ref=genome assembly Gbrowse_Syn

S. Sclerotiorum

B. cinerea T4

B. cinerea B0510

Page 4: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

Chado Edition

Structural & functional annotation Manual curation

Apollo

Genome Report System

+ manual curation

Distributed annotation system

1

Annotation pipelines: - Gene prediction - Refinement analysis - Functional annotation

SeqFeature:Store Gbrowse fast display

1- Structural 2- Functional 3- comparative

Gbrowse

Ref=polypeptide

Ref=genome assembly Gbrowse_Syn

S. Sclerotiorum

B. cinerea T4

B. cinerea B0510

Annotation pipelines: - Gene prediction - Refinement analysis - Functional annotation - Comparative genomics

Page 5: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

Advanced search BioMart Quick search

Lucene/Hibernate

Chado Edition

Structural & functional annotation Manual curation

Apollo

Genome Report System

+ manual curation

Distributed annotation system

1

Annotation pipelines: - Gene prediction - Refinement analysis - Functional annotation

SeqFeature:Store Gbrowse fast display

1- Structural 2- Functional 3- comparative

Gbrowse

Ref=polypeptide

Ref=genome assembly Gbrowse_Syn

S. Sclerotiorum

B. cinerea T4

B. cinerea B0510

Annotation pipelines: - Gene prediction - Refinement analysis - Functional annotation - Comparative genomics

Page 6: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

Quick and advanced search http://urgi.versailles.inra.fr/gnpis

Quick search based on :

- hibernate search - Apache Lucene™ full-featured text search engine library

Biomart based advanced search

Galaxy Set of tools

for data mining

Page 7: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

Quick search

Keywords Gene id

Sequence id … User guide

Page 8: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

Quick search

8

Link to GBrowse

Page 9: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

Quick search

9

Link to Genome Report System

Page 10: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

Biomart based advanced search

Page 11: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

Signal_peptide

Start=1 End=30

Iprscan_annotation

Biomart : request form

Page 12: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

Biomart results

Link to GRS

Results

Page 13: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

Pipelines

13

Page 14: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

Structural annotation pipelines Transposable Elements: REPET package

http://urgi.versailles.inra.fr/Tools

Page 15: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

Structural annotation pipelines Transposable Elements: REPET package

http://urgi.versailles.inra.fr/Tools/REPET Genes prediction: EuGene

http://eugene.toulouse.inra.fr/

Page 16: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

Functional annotation pipeline

Protein domain identification

Coils, Patternscan, Profilescan, Scanregexp, seg

Predicted polypeptides

Page 17: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

Functional annotation pipeline

Protein domain identification

Coils, Patternscan, Profilescan, Scanregexp, seg

Blast similarities

KOG

rpsBlast Conserved domains

Predicted polypeptides

Page 18: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

Functional annotation pipeline

Protein domain identification

Coils, Patternscan, Profilescan, Scanregexp, seg

TMHMM

SignalP TargetP

Localization Targeting

Blast similarities

KOG

rpsBlast Conserved domains

Predicted polypeptides

Page 19: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

Functional annotation pipeline

Protein domain identification

Coils, Patternscan, Profilescan, Scanregexp, seg

TMHMM

SignalP TargetP

Localization Targeting

Blast similarities

KOG

rpsBlast Conserved domains

Genome Report System

Gene/Protein

Gbrowse Ref=polypetide

Predicted polypeptides

Chado DB:SeqFeatureStore

GFF3

Page 20: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

Functional annotation pipeline

Protein domain identification

Coils, Patternscan, Profilescan, Scanregexp, seg

TMHMM

SignalP TargetP

Localization Targeting

Blast similarities

KOG

rpsBlast Conserved domains

Genome Report System

Gene/Protein

Gbrowse Ref=polypetide

Predicted polypeptides

Chado DB:SeqFeatureStore

GFF3

Botrytis T4, B05.10 and Sclerotinia predicted genes

annotated

Page 21: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

Interfaces & dataflow

21

Page 22: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

Interfaces and dataflow : B. cinerea

Botrytis portal

GnpIS portal

http://urgi.versailles.inra.fr/Species/Botrytis

http://urgi.versailles.inra.fr/gnpis

2 ways to access data

Page 23: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

Interfaces and dataflow : B. cinerea

Botrytis portal Sequences &

Databases

GnpIS portal

Genes curation/Validation

Transcriptomics GnpArray

Structural annotation

GnpGenome

Gbrowse_syn Botrytis T4

Botrytis B0510 Sclerotinia

Genome Report System Botrytis T4

Botrytis B0510 Sclerotinia

Page 24: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

Interfaces and dataflow : B. cinerea

Botrytis portal Sequences &

Databases

GnpIS portal

Genes curation/Validation

Transcriptomics GnpArray

Structural annotation

GnpGenome

Gbrowse_syn Botrytis T4

Botrytis B0510 Sclerotinia

Genome Report System Botrytis T4

Botrytis B0510 Sclerotinia

Quick search

Advanced search

Page 25: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

Interfaces and dataflow : B. cinerea

Botrytis portal Sequences &

Databases

GnpIS portal

Genes curation/Validation

Transcriptomics GnpArray

Structural annotation

GnpGenome

Gbrowse_syn Botrytis T4

Botrytis B0510 Sclerotinia

Genome Report System Botrytis T4

Botrytis B0510 Sclerotinia

Page 26: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

Interfaces and dataflow : B. cinerea

Botrytis portal Sequences &

Databases

GnpIS portal

Genes curation/Validation

Transcriptomics GnpArray

Structural annotation

GnpGenome

Gbrowse_syn Botrytis T4

Botrytis B0510 Sclerotinia

Genome Report System Botrytis T4

Botrytis B0510 Sclerotinia Galaxy

to cross and mine data

Biomart Advance search

Page 27: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

Interfaces and dataflow : B. cinerea

Botrytis portal Sequences &

Databases

GnpIS portal

Genes curation/Validation

Transcriptomics GnpArray

Structural annotation

GnpGenome

Gbrowse_syn Botrytis T4

Botrytis B0510 Sclerotinia

Genome Report System Botrytis T4

Botrytis B0510 Sclerotinia Galaxy

to cross and mine data

Biomart Advance search

Demo In next Nicolas' talk

Page 28: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

Transcriptomics : GnpArray http://urgi.versailles.inra.fr/GnpArray/transcriptome/welcome.do

Page 29: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

Transcriptomics : GnpArray http://urgi.versailles.inra.fr/GnpArray/transcriptome/welcome.do

29

Gene lists: In planta upregulated In planta down regulated Unchanged

Gene lists associated to a project 1- Gene list in "Queries" menu 2- select project & data 3- Display results

1

2

3

Page 30: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

30

4

1- Gene list in "Queries" menu 2- select project & data 3- Display results 4- Display Gene list details

Page 31: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

31

Gene card and reporters (probes)

1- Gene list in "Queries" menu 2- select project & data 3- Display results 4- Display Gene list details 5- Display Gene card

5

Page 32: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

GnpGenome

32 Reference=supercontigs

Page 33: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

GnpGenome

33 Reference=supercontigs

Page 34: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

GnpGenome

34

GnpArray: Gene card

Genome Report System

Gbrowse: functional domains

Page 35: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

Gbrowse_syn: Botrytis T4 & B0510 /Sclerotinia

35

landmark Reference species Zoom Species to

align with the reference

Reference species chosen

Page 36: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

GRS : Genome Report System

36

http://urgi.versailles.inra.fr/grs

Page 37: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

37

More in the next Nicolas' talk

Page 38: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

38

Apollo editing interface for gene curation http://urgi.versailles.inra.fr/Tools/Apollo

Analysis tracks

Gene models

Selected feature information

Zoom

Page 39: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

Manual annotation roundtrip

SeqFeature:Store Gbrowse fast

display Structural Annotation

Chado Edition

Structural & functional annotation

GBrowse

Read

Write Write

Read Read

Apollo webstart page Genome Report

System Botrytis T4

Botrytis B0510 Sclerotinia

Annotation Results

Page 40: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

Manual annotation roundtrip

SeqFeature:Store Gbrowse fast

display Structural Annotation

Chado Edition

Structural & functional annotation

GBrowse

Read

Write Write

Read Read

Apollo webstart page

Replace the previous manual

annotation release

Genome Report System

Botrytis T4 Botrytis B0510

Sclerotinia

Page 41: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

Manual annotation roundtrip

Chado Edition

Structural & functional annotation

Write Write

Read Read

Apollo webstart page

Run Functional Annotation

pipeline

Genome Report System

& Edition

Extract Genes

manually curated Update database

Page 42: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

Manual annotation roundtrip

Chado Edition

Structural & functional annotation

Write Write

Read Read

Apollo webstart page

Run Functional Annotation

pipeline

Genome Report System

& Edition

Extract Genes

manually curated Update database

> 1200 genes manually validated/curated

+ Functional automated

re-annotation

Page 43: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

Annotation Workflow tutorial

43

Page 44: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

Annotation workflow 1

•  Start from fasta sequence •  Blast it

  http://urgi.versailles.inra.fr/index.php/urgi/Species/Botrytis/   Sequences & Databases Blast

  Get the list of genes IDs in Botrytis T4/B0510 or Sclerotinia genome

44

Page 45: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

Annotation workflow 1

•  Start from fasta sequence •  Blast it •  Retrieve whole BT4 annotation (structural & functional)

for each gene in Genome Report System   http://urgi.versailles.inra.fr/grs/index.html

45

Page 46: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

Annotation workflow 1

•  Start from fasta sequence •  Blast it •  Retrieve whole BT4 annotation (structural & functional)

for each gene in Genome Report System   What does the conserved domains indicate about the protein

function?   How many Botrytis ESTs match to the gene? What does it

indicate about the expression pattern?   Does the BDBH program indicate putative orthologs in B05-10

strain and in Sclerotinia?   Which T4 SuperContig contain the gene?   Select the ESTs tracks. Are they in accordance with the

predicted annotation from Eugene (and/or Fgenesh)? How many exons are detected?

46

Page 47: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

Annotation workflow 1 •  Start from fasta sequence •  Blast it •  Retrieve whole BT4 annotation (structural & functional) for each

gene in Genome Report System •  Look at the orthologs genes in the three genomes. •  Is there a synteny between Botrytis and Sclerotinia around the gene

  http://gpi.versailles.inra.fr/cgi-bin/gbrowse_syn/botrytis

47

bt4_SupSuperContig_36_28_1:20000..100000 SS1G_02407.1

BofuT4_P124220.1

BC1G_05954.1

Page 48: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

Annotation workflow 1 •  Start from fasta sequence •  Blast it •  Retrieve whole BT4 annotation (structural & functional) for each

gene in Genome Report System •  Look at the orthologs genes in the three genomes. •  Is there a synteny between Botrytis and Sclerotinia around the gene •  Get coordinates from GnpGenome then open apollo to check

curate/validate the gene   http://urgi.versailles.inra.fr/Tools/Apollo

Page 49: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

Annotation workflow 2

•  Search for a gene family. Ex: "transcription factor"   No fasta sequence provided   Search by keyword   http://urgi.versailles.inra.fr/gnpis

49

Page 50: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

Annotation workflow 2

•  Search for a gene family. Ex: "transcription factor" •  Display the Genome report for these genes

•  Get list of gene sharing the same GeneOntology ID

50

Page 51: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

Annotation workflow 2

•  Search for a gene family. Ex: "transcription factor"

•  Display the Genome report for these genes •  Get list of gene sharing the same GeneOntology

ID •  Get the orthologs in B0510 & Sclerotinia

  From GRS using Ortholog category   From GRS, get gene coordinates and look for

syntenic region using Gbrowse_syn

51

Page 52: Botrytis/Sclerotinia resources: an integrated system for ... · Manual curation annotation Apollo Genome Report System + manual curation Distributed annotation system 1 Annotation

•  Nicolas Lapalu

•  Baptiste Brault

•  Laetitia Brigitte

•  Jonathan Kreplak •  Françoise Alfama

•  Aminah Keliet

•  Erik Kimmel •  Isabelle Luyten

•  Sébastien Reboux

•  Delphine Steinbach

•  Hadi Quesneville

Thanks to …

Funding ANR GnpInteGr ANR GnpAnnot

Thanks to consortium

•  Marc-Henri Lebrun

•  Adeline Simon •  Muriel Viaud

Botrytis / Sclerotinia Genome project consortium