ontologies go
TRANSCRIPT
-
8/7/2019 Ontologies GO
1/38
Sheena Scroggins
BI 7553
Fall 2010
-
8/7/2019 Ontologies GO
2/38
Ontologies
An ontology is a shared representation
of a subject area, in which terms andhierarchical relationships among them
are defined
-
8/7/2019 Ontologies GO
3/38
Gene Ontology Project
A collaboration between many
databases to ensure consistent
descriptions of gene products.
Started with Flybase, (Drosophila),
Saccharomyces Genome Database
(SGD) and Mouse Genome Database(MGD), but now includes many more
-
8/7/2019 Ontologies GO
4/38
GO Project Goals
Develop and maintain
vocabulary of gene and gene
product characteristics Annotate genes and gene products,
curate and publish data
Provide tools to facilitate access to alldata provided by the GO Project.
-
8/7/2019 Ontologies GO
5/38
GO-Vocabulary
Used to describe 3 characteristics of
proteins:
y Cellular component
y Biological Process
y Molecular Function
Protein or gene may participate in more
than one molecular function, biologicalprocess, and cellular component.
-
8/7/2019 Ontologies GO
6/38
GO-Vocabulary
Cellular Component
y
Describes locations of genes and proteins. Anatomical structures: ie cytoplasm or
nucleus
Gene product group: ie ribosome or ubiquitin
ligase complex
-
8/7/2019 Ontologies GO
7/38
GO-Vocabulary
Biological Process
y
The overall goal of the gene product Terms for broad processes: ie mitosis or
signal transduction
Terms for specific processes: ie pyrimidine
metabolic process
-
8/7/2019 Ontologies GO
8/38
GO-Vocabulary
Molecular Function
y The function of the gene products:
Terms for broad functions: ie catalytic activity
or hydrolase activity
Terms for more specific functions: ie Toll
receptor binding or protein-arginine deiminase
activity
Represent activities of proteins anddoes not specify where or when, or in
what context, the action takes place.
-
8/7/2019 Ontologies GO
9/38
GO-Vocabulary
Biological Process Molecular Function
A process must havemore than one distinct
step
Larger picture
description
Many molecular
functions have theword activity added to
them to help
distinguish from
biological process
Smaller picture
description
-
8/7/2019 Ontologies GO
10/38
GO Project
Develop and maintain vocabulary of
gene and gene product characteristics
Annotate genes and geneproducts, curate and publish
data
Provide tools to facilitate access to alldata provided by the GO Project.
-
8/7/2019 Ontologies GO
11/38
GO Annotation
Using GO terms to describe the
activities and localization of a gene
product, providing references and
indicating what evidence supports the
annotations
-
8/7/2019 Ontologies GO
12/38
GO Annotation
Includes gene product identifier, GO
terms, and the following data:
y The reference used to make the annotation
y An evidence code denoting the type of
evidence upon which the annotation is
basedy The date and the creator of the annotation
-
8/7/2019 Ontologies GO
13/38
GO Annotation
Experimental Evidence Codesy Cited paper displayed results from a
physical characterization of a gene or gene
product that supported the association of aGO term.
Inferred from Experiment (EXP)
Inferred from Direct Assay (IDA)
Inferred from Physical Interaction (IPI)
Inferred from Mutant Phenotype (IMP)
Inferred from Genetic Interaction (IGI)
Inferred from Expression Pattern (IEP)
-
8/7/2019 Ontologies GO
14/38
GO Annotation
ComputationalAnalysis Evidence Codesy Annotation based on an in silico analysis of the
gene sequence and/or other data as describedin the cited reference.
Inferred from Sequence or structural Similarity(ISS)
Inferred from Sequence Orthology (ISO)
Inferred from Sequence (ISA)
Inferred from Sequence Model (ISM) Inferred from Genomic Context (IGC)
inferred from Reviewed Computational Analysis(RCA)
-
8/7/2019 Ontologies GO
15/38
GO Annotation
Author Statement Evidence Codes
y
Annotation made on the basis of astatement made by the author(s) in the
reference cited.
Traceable Author Statement (TAS)
Non-traceable Author Statement (NAS)
-
8/7/2019 Ontologies GO
16/38
GO Annotation
Curatorial Statement Evidence Codes
y
Annotation made on the basis of a curatorialjudgment that does not fit into one of the
other evidence code classifications.
Inferred by Curator (IC)
No biological Data available (ND) evidence
code
-
8/7/2019 Ontologies GO
17/38
GO Project
Develop and maintain vocabulary of
gene and gene product characteristics
Annotate genes and gene products,
curate and publish data
Provide tools to facilitate
access to all data provided by
the GO Project.
-
8/7/2019 Ontologies GO
18/38
GO Tools
There are lots of tools for using the data
provided by the GO Project, including
online and downloadable versions.
The GO Consortium works directly with
two tools:
-
8/7/2019 Ontologies GO
19/38
AmiGo is the application used to searchthe Gene Ontology database.
Available online and as a download
-
8/7/2019 Ontologies GO
20/38
-
8/7/2019 Ontologies GO
21/38
Search for a gene or gene product andview the GO term associations
-
8/7/2019 Ontologies GO
22/38
Gene Search in AmGO
-
8/7/2019 Ontologies GO
23/38
Perform a BLAST search and view theGO term associations for the genes or
proteins returned
-
8/7/2019 Ontologies GO
24/38
BLAST Results in AmiGO
-
8/7/2019 Ontologies GO
25/38
Other uses:
y Search for GO terms and view the genes or
gene products they are annotated to
y Browse the GO ontology and view terms
y Term enrichment tool is used to discover
what a set of genes may have in common by
examining annotations and finding
significant shared GO terms.
-
8/7/2019 Ontologies GO
26/38
An open source ontology editor developed and
maintained by the Gene Ontology Consortium. Features:
y Comprehensive search and filter interface
y Interface can be customized to user preferences.
y Reasoner tool that can infer links based on existingrelationships and their properties.
Can be used to view, search and edit anyontology, not just the Gene Ontology
-
8/7/2019 Ontologies GO
27/38
Viewing Ontologies in EDIT
OBO-Edit uses a graph-based approach for
ontology editing
-
8/7/2019 Ontologies GO
28/38
Default displays anontology in tree view.Each line in the treeview represents arelationship between
terms, with an arrowindicating the directionof the relationship.A term in the tree viewmay be expanded to
show its childrelationships
FIGURE 1-A
Day-Richter J, et al.
OBO-Edit - an ontology editor for biologists.
Bioinformatics 2007;23:2198-2200.
-
8/7/2019 Ontologies GO
29/38
The DAG Viewer
plugin displaysevery path from a
selected term to the
ontology root
FIGURE 1-A
Day-Richter J, et al.
OBO-Edit - an ontology editor for biologists.
Bioinformatics 2007;23:2198-2200.
-
8/7/2019 Ontologies GO
30/38
Editing Ontologies in
OBO-Edit presents structural ontology
editing operations, such as adding new
terms or new relationships, to the user
as graph editing operations.
All edits in OBO-Edit are tracked,
allowing users to undo any change they
make during an editing session.
-
8/7/2019 Ontologies GO
31/38
The Ontology Editor
serves as the editinginterface for adding
or removing terms
and relationships.
FIGURE 1-ADay-Richter J, et al.
OBO-Edit - an ontology editor for biologists.
Bioinformatics 2007;23:2198-2200.
-
8/7/2019 Ontologies GO
32/38
The Term Editor panel
allows users to editthe term name,
definition, synonyms
and other textual
metadata
FIGURE 1-B
Day-Richter J, et al.
OBO-Edit - an ontology editor for biologists.
Bioinformatics 2007;23:2198-2200.
-
8/7/2019 Ontologies GO
33/38
Searching & Filtering in
The search facility in OBO-Edit is built
around filters.
y Term attribute (such as name, definition)
y Comparison (equals, starts with, is less than,
etc.)
y Value.
Compound filters combine a number offilters together using the Boolean
operators AND and OR.
-
8/7/2019 Ontologies GO
34/38
Criteria
specified in the
filter interface
can be used tosearch for an
ontology.
FIGURE 1-C Day-Richter J, et al.
OBO-Edit - an ontology editor for biologists.
Bioinformatics 2007;23:2198-2200.
-
8/7/2019 Ontologies GO
35/38
GO Review
Provide vocabulary for gene and gene
product descriptions, annotations and
tools.
AmiGO- Application used to search
the GO databases
OBO-Edit- Application to view
and edit ontologies, including the GeneOntology
http://www.geneontology.org/
-
8/7/2019 Ontologies GO
36/38
What GO Is Not
GO is not a database of gene sequences, nor acatalog of gene products.
GO describes how gene products behave in acellular context.
Database lists attributes of genes and geneproducts, not the genes or products themselves.y e.g. cytochrome c is not in the ontologies, but attributes of
cytochrome c, such as oxidoreductase activity, are.
Processes, functions or components that areunique to mutants or diseases are not includedy e.g. oncogenesis is not a valid GO term because causing
cancer is not the normal function of any gene.
-
8/7/2019 Ontologies GO
37/38
What GO Is Not
Also not included:
y Attributes of sequence such as intron/exon
parameters
y Protein domains or structural features
y Protein-protein interactions.
y Environment, evolution and expression.
y
Anatomical or histological features abovethe level of cellular components, including
cell types.
-
8/7/2019 Ontologies GO
38/38
References
http://www.geneontology.org/
http://wiki.geneontology.org/index.php/AmiGO_Manual:_Overview
http://oboedit.org/docs/index.htm Pevsner, J. Bioinformatics and
Functional Genomics, 2009. Chp 10
Day-Richter J, et al.
OBO-Edit - anontology editor for biologists.Bioinformatics 2007;23:2198-2200.