biomedical ontology tutorial_atlanta_june2011_part1
TRANSCRIPT
![Page 1: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/1.jpg)
How to Build a Biomedical Ontology
Success StoriesThe Gene Ontology (GO)
SNOMED, ICD and other controlled vocabulariesOntology Design Principles
Ontology Applications
Barry Smithhttp://ontology.buffalo.edu/smith
![Page 2: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/2.jpg)
Uses of ‘ontology’ in PubMed abstracts
2
![Page 3: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/3.jpg)
3
![Page 4: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/4.jpg)
By far the most successful: GO (Gene Ontology)
4
![Page 5: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/5.jpg)
5
![Page 6: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/6.jpg)
Hierarchical view of GO representing relations between represented types
6
![Page 7: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/7.jpg)
Gene Ontology
$100 mill. invested in literature and database curation using the Gene Ontology (GO)
based on the idea of annotation
over 11 million annotations relating gene products (proteins) described in the UniProt, Ensembl and other databases to terms in the GO
multiple secondary uses – because the ontology was not built to meet one specific set of requirements
7
![Page 8: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/8.jpg)
GO provides a controlled system of terms for use in annotating (describing, tagging)
data• multi-species, multi-disciplinary, open source
• contributing to the cumulativity of scientific results obtained by distinct research communities
• compare use of kilograms, meters, seconds in formulating experimental results
8
![Page 9: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/9.jpg)
Sample Gene Array Data
9
![Page 10: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/10.jpg)
where in the cell ?
what kind of molecular function ?
semantic annotation of data
what kind of biological process?
10
![Page 11: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/11.jpg)
natural language labels
to make the data cognitively accessible to human beings
11
![Page 12: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/12.jpg)
compare: legends for mapscompare: legends for maps
12
![Page 13: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/13.jpg)
compare: legends for diagrams
13
![Page 14: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/14.jpg)
ontologies are legends for data
14
![Page 15: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/15.jpg)
compare: legends for mapscompare: legends for maps
15
![Page 16: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/16.jpg)
ontologies are legends for images
16
![Page 17: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/17.jpg)
what lesion ?
what brain function ?
17
![Page 18: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/18.jpg)
ontologies are legends for databases
MouseEcotope GlyProt
DiabetInGene
GluChem
sphingolipid transporter
activity
18
![Page 19: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/19.jpg)
annotation using common ontologies yields integration of databases
MouseEcotope GlyProt
DiabetInGene
GluChem
Holliday junction helicase complex
19
![Page 20: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/20.jpg)
annotation using common ontologies can support comparison of data
20
![Page 21: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/21.jpg)
annotation with Gene Ontology
supports reusability of data
supports search of data by humans
supports comparison of data
supports aggregation of data
supports reasoning with data by humans and machines
21
![Page 22: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/22.jpg)
22
![Page 23: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/23.jpg)
The goal: virtual science
• consistent (non-redundant) annotation
• cumulative (additive) annotation
yielding, by incremental steps, a virtual map of the entirety of reality that is accessible to computational reasoning
23
![Page 24: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/24.jpg)
This goal is realizable if we have a common ontology framework
data is retrievable
data is comparable
data is integratable
only to the degree that it is annotated using a common controlled vocabulary
– compare the role of seconds, meters, kilograms … in unifying science
24
![Page 25: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/25.jpg)
To achieve this end we have to engage in something like philosophy (?)
is this the right way to organize the top level of this portion of the GO?how does the top level of this ontology relate to the top levels of other, neighboring ontologies? 25
![Page 26: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/26.jpg)
Strategy for doing this
see the world as organized via types/universals/categories which are hierarchically organized
and in relation to which statements can be formulated which are universally true of all instances:
cell membrane part_of cell26
![Page 27: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/27.jpg)
Pleural Cavity
Pleural Cavity
Interlobar recess
Interlobar recess
Mesothelium of Pleura
Mesothelium of Pleura
Pleura(Wall of Sac)
Pleura(Wall of Sac)
VisceralPleura
VisceralPleura
Pleural SacPleural Sac
Parietal Pleura
Parietal Pleura
Anatomical SpaceAnatomical Space
OrganCavityOrganCavity
Serous SacCavity
Serous SacCavity
AnatomicalStructure
AnatomicalStructure
OrganOrgan
Serous SacSerous Sac
MediastinalPleura
MediastinalPleura
TissueTissue
Organ PartOrgan Part
Organ Subdivision
Organ Subdivision
Organ Component
Organ Component
Organ CavitySubdivision
Organ CavitySubdivision
Serous SacCavity
Subdivision
Serous SacCavity
Subdivision
part
_of
is_a
Foundational Model of Anatomy Ontology27
![Page 28: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/28.jpg)
siamese
mammal
cat
organism
substancespecies, genera
animal
instances
frog
28
![Page 29: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/29.jpg)
29
![Page 30: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/30.jpg)
with thanks to http://dbmotion.com 30
the problem of continuity of care: patients move around
![Page 31: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/31.jpg)
31
f
f
f
ff
synchronic and diachronic problems of semantic interoperability
(across space and across time)
f
![Page 32: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/32.jpg)
32
f
f
f
ff
how can we link EHR 1 to EHR 2 in a reliable, trustworthy, useful way, which
both systems can understand ?
f
EHR 1 EHR 2
![Page 33: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/33.jpg)
33
f
f
f
ff
the ideal solution: WHO International Classification of
Diseases
fICD
EHR 1 EHR 2
![Page 34: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/34.jpg)
ICDICDPRO: De facto US billing standardMultilanguageCON: De facto US billing standard (corrupts data)No definitions of terms, and so difficult to
judge accuracy of hierarchy and of codingInconsistent hierarchiesHard to reason with resultsHence few secondary uses e.g. for research
34
![Page 35: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/35.jpg)
ICD 11ICD 11The (ontology-based) planmultiple views including
◦billing ◦public health statistics◦research
◦SNOMED compatibility
35
![Page 36: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/36.jpg)
36
f
f
f
ff
the ideal solution: a single universal clinical vocabulary
fSNOMED-CT
EHR 1 EHR 2
![Page 37: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/37.jpg)
SNOMED CT: SNOMED CT: Systematized Nomenclature of Systematized Nomenclature of Medicine-Clinical TermsMedicine-Clinical Terms
PRO:International standard (sort of)Huge resourceFree for member countriesMulti-language (including Spanish)
37
![Page 38: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/38.jpg)
SNOMED CTSNOMED CTCONHuge (but redundant ... and gappy)Contains many examples of false synonymyStill in need of work
◦ No consistent interpretation of relations◦ Many erroneous relation assertions◦ Many idiosyncratic relations◦ Mixes ontology with epistemology◦ It contains numerous compound terms (e.g., test for
X) without the constituent terms (here: X), even where the latter are of obvious salience
(38
![Page 39: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/39.jpg)
Coding with SNOMED-CT is unreliable and inconsistent
Multi-stage multi-committee process for adding terms that follows intuitive rules and not formal principles
Does there exist a strategy for evolutionary improvement?
39
SNOMED CT
![Page 40: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/40.jpg)
40
f
f
f
fanf
above all: SNOMED CT cannot solve the problem of continuity of care because it has
too much redundancy
f
EHR 1 EHR 2
SNOMED-CT
![Page 41: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/41.jpg)
41
f
f
f
fanf
AND because it is used only in certain countries
f
EHR 1 EHR 2
SNOMED-CT
![Page 42: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/42.jpg)
42
f
f
f
ff
link EHR 1 to EHR 2 through a snapshot of the patient’s condition which both systems
can understand
f Unified Medical Language System
(UMLS)
EHR 1 EHR 2
![Page 43: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/43.jpg)
Unified Medical Language Unified Medical Language System (UMLS)System (UMLS)
UMLS is not unified, not a language, not a system (and not only medical); it is an aggregation If we use something like UMLS as reference terminology, we will not solve the translation problem
43
EN
DE
![Page 44: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/44.jpg)
New York State Center of Excellence in Bioinformatics & Life Sciences
R T U New York State Center of Excellence in Bioinformatics & Life Sciences
R T U
UMLS approach to countering silo formation– By ‘linking between different clinical or biomedical
vocabularies’
– However: ‘… the Metathesaurus does not represent a comprehensive NLM-authored ontology of biomedicine or a single consistent view of the world. The Metathesaurus preserves the many views of the world present in its source vocabularies because these different views may be useful for different tasks.’
http://www.nlm.nih.gov/pubs/factsheets/umlsmeta.html
![Page 45: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/45.jpg)
New York State Center of Excellence in Bioinformatics & Life Sciences
R T U New York State Center of Excellence in Bioinformatics & Life Sciences
R T U
![Page 46: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/46.jpg)
Prospective standardization is a good thing
Prospective standardization is the only thing which will work in mission critical domains
Prospective standardization means that certain limits to tolerance must be imposed,
Need for top-down governance to ensure common architecture and resolution of border disputes in areas of overlap between domains
46
![Page 47: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/47.jpg)
Principles of Best Practice in Ontology Development
47
![Page 48: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/48.jpg)
Problem of ensuring sensible cooperation in a massively interdisciplinary community
Consider multiple uses of technical terms such as
− type− concept− instance− model− representation− data
48
![Page 49: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/49.jpg)
Three Levels
L3. Words, models (published representations, ontologies, databases ...)
L2. Ideas (concepts, thoughts, memories, ...)
L1. Things (cells, planets, processes of cell division ...)
49
![Page 50: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/50.jpg)
Entity =def
anything which exists, including things and processes, functions and qualities, beliefs and actions, documents and software
(entities on levels 1, 2 and 3)
50
![Page 51: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/51.jpg)
First basic distinction among entities
type vs. instance
(science text vs. diary)
(human being vs. Tom Cruise)
51
![Page 52: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/52.jpg)
For ontologies
it is generalizations that are important = types, universals,
kinds, species
52
![Page 53: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/53.jpg)
A 515287 DC3300 Dust Collector Fan
B 521683 Gilmer Belt
C 521682 Motor Drive Belt
Catalog vs. inventory
53
![Page 54: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/54.jpg)
An ontology is a representation of types
We learn about types in reality from looking at the results of scientific experiments in the form of scientific theories
experiments relate to what is particular science describes what is general
54
![Page 55: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/55.jpg)
Ontology =def.
a representational artifact whose representational units (which may be drawn from a natural or from some formalized language) are intended to represent
1. types in reality
2. those relations between these types which obtain universally (= for all instances)
lung is_a anatomical structure
lobe of lung part_of lung
in accordance with our best current established science
55
![Page 56: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/56.jpg)
siamese
mammal
cat
organism
objecttypes
animal
frog
instances56
![Page 57: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/57.jpg)
Domain =def
a portion of reality that forms the subject-matter of a single science or technology or mode of study or administrative practice:
proteomics
epidemiology
C2
M&S
57
![Page 58: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/58.jpg)
Representation =def
an image, idea, map, picture, name or description ... of some entity or entities.
58
![Page 59: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/59.jpg)
Ontologies are representational artifacts
comparable to science textsand subject to the same sorts of
constraints (including need for update)
59
![Page 60: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/60.jpg)
Representational units =def
terms, icons, alphanumeric identifiers ... which refer, or are intended to refer, to entities
and which are minimal (atoms)
60
![Page 61: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/61.jpg)
Composite representation =def
representation
(1) built out of representational units
which
(2) form a structure that mirrors, or is intended to mirror, the entities in some domain
61
![Page 62: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/62.jpg)
Periodic Table
The Periodic Table
62
![Page 63: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/63.jpg)
Ontologies are here
63
![Page 64: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/64.jpg)
or here
64
![Page 65: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/65.jpg)
Ontologies represent general structures in reality (leg)
65
![Page 66: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/66.jpg)
Ontologies do not represent concepts in people’s heads
66
![Page 67: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/67.jpg)
They represent types in reality
67
![Page 68: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/68.jpg)
How do we know which general terms designate types?
Types are repeatables:
cell, electron, weapon, F16 ...
Instances are one-off:
Bill Clinton, this laptop, this handwave
68
![Page 69: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/69.jpg)
Problem
The same general term can be used to refer both to types and to collections of particulars. Consider:
HIV is an infectious retrovirus
HIV is spreading very rapidly through Asia
69
![Page 70: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/70.jpg)
Class =def
a maximal collection of particulars determined by a general term (‘cell’, ‘electron’ but also: ‘ ‘restaurant in Palo Alto’, ‘Italian’)
the class A = the collection of all particulars x for which ‘x is A’ is true
70
![Page 71: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/71.jpg)
types vs. their extensions
types
{a,b,c,...} collections of particulars
71
![Page 72: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/72.jpg)
Extension
=def The extension of a type is the class of its instances
72
![Page 73: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/73.jpg)
types vs. classes
types
{c,d,e,...} classes
73
![Page 74: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/74.jpg)
types vs. classes
compare: ‘natural kinds’
types
extensions other sorts of classes
74
![Page 75: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/75.jpg)
types vs. classes
types
populations, ...
the class of all diabetic patients in Leipzig on 4 June 1952
75
![Page 76: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/76.jpg)
OWL is a good representation of classes
• F16s
• sibling of Finnish spy
• member of Abba aged > 50 years
76
![Page 77: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/77.jpg)
types, classes, concepts
types
classes
‘concepts’ ?
77
![Page 78: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/78.jpg)
types < classes < ‘concepts’ ?
Cases of ‘concepts’ which, some people say, do not correspond to classes:
‘Cancelled oophorectomy’‘Absent nipple’‘Unlocalized ligand’
A cancelled oophorectomy is not a special kind of conceptual oophorectory
Use: Information Artifact Ontology (IAO)
78
![Page 79: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/79.jpg)
Principle of Low Hanging Fruit
Include even absolutely trivial assertions (assertions you know to be universally true)
pneumococcal virus is_a virus
Computers need to be led by the hand
79
![Page 80: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/80.jpg)
Example: MeSH
MeSH Descriptors Index Medicus Descriptor Anthropology, Education, Sociology and Social Phenomena (MeSH Category) Social Sciences Political Systems National Socialism
National Socialism is_a Political SystemsNational Socialism is_a Anthropology ...
80
![Page 81: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/81.jpg)
Principle of Singular Nouns
Terms in ontologies represent types
Goal: Each term in an ontology should represent exactly one type
Thus every term should be a singular noun
81
![Page 82: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/82.jpg)
Principle: do not commit the use-mention confusion
mouse =def. common name for the species mus musculus
swimming is healthy and has eight letters
82
![Page 83: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/83.jpg)
Principle: do not commit the use-mention confusion
Avoid confusing between words and things
Avoid confusing between concepts in our minds and entities in reality
Recommendation: avoid the word ‘concept’ entirely
83
![Page 84: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/84.jpg)
Trialbank
‘information’ = def. ‘a written or spoken designation of a concept’
84
![Page 85: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/85.jpg)
‘Heparin therapy’ is an instance of ‘written or spoken designation of a concept’
What are the problems here?
1. misuse of quotation marks
2. confusion of instances and types
3. confusion of concept and reality
Trialbank
85
![Page 86: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/86.jpg)
Principle: beware of terminological baggage
For the sake of interoperability with other ontologies, do not give special meanings to terms with established general meanings
(Don’t use ‘cell’ when you mean ‘plant cell’)
86
![Page 87: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/87.jpg)
ICNP: International Classification of Nursing Procedures (old version)
water =def. a type of Nursing Phenomenon of Physical Environment with the specific characteristics: clear liquid compound of hydrogen and oxygen that is essential for most plant and animal life influencing life and development of human beings.
87
![Page 88: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/88.jpg)
Principle of definitions
Supply definitions for every term
1.human-understandable natural language definition
2.an equivalent formal definition
88
![Page 89: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/89.jpg)
Principle: definitions must be unique
Each term should have exactly one definition
it may have both natural-language and formal versions
(issue with ontologies which exist with different levels of expressivity)
89
![Page 90: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/90.jpg)
The Problem of Circularity
A Person =def. A person with an identity document
Hemolysis =def. The causes of hemolysis
90
![Page 91: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/91.jpg)
Principle of non-circularity
The term defined should not appear in its own definition
91
![Page 92: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/92.jpg)
Example: HL7
‘stopping a medication’ = def.
change of state in the record of a Substance Administration Act from Active to Aborted
92
![Page 93: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/93.jpg)
Principle of Increase in Understandability
A definition should use only terms which are easier to understand than the term defined
Definitions should not make simple things more difficult than they are
93
![Page 94: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/94.jpg)
Generalized Tarski principle (a good, general constraint on a
theory of meaning)
For each linguistic expression ‘E’
‘E’ means E
‘snow’ means: snow
‘pneumonia’ means: pneumonia
94
![Page 95: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/95.jpg)
HL7 Reference Information Model
‘medication’ does not mean: medication
rather it means:
the record of medication in an information system
‘disease’ does not mean: disease
rather it means:
the observation of a disease
95
![Page 96: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/96.jpg)
Principle of Acknowledging Primitives
In every ontology some terms and some relations are primitive = they cannot be defined (on pain of infinite regress)
Examples of primitive relations:
identity
instance_of
96
![Page 97: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/97.jpg)
Principle of Aristotelian Definitions
Use Aristotelian definitions
An A is a B which C’s.
A human being is an animal which is rational
97
![Page 98: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/98.jpg)
Rules for Formulating Terms
Avoid abbreviations even when it is clear in context what they mean (‘breast’ for ‘breast tumor’)
Avoid acronymsAvoid mass terms (‘tissue’, ‘brain mapping’,
‘clinical research’ ...)Treat each term ‘A’ in an ontology is
shorthand for a term of the form ‘the type A’
98
![Page 99: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/99.jpg)
Univocity Terms should have the same meanings on
every occasion of use.
(= They should refer to the same types)
Basic ontological relations such as is_a and part_of should be used in the same way by all ontologies
99
![Page 100: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/100.jpg)
Universality
Ontologies are made of relational assertions
They should include only those which hold universally
100
![Page 101: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/101.jpg)
Universality
Often, order will matter:
We can assert
adult transformation_of child
but not
child transforms_into adult
101
![Page 102: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/102.jpg)
Universality
viral pneumonia caused by virus
but not
virus causes pneumonia
pneumococcal virus causes pneumonia
102
![Page 103: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/103.jpg)
Principle of Universality
results analysis later_than protocol-design
but not
protocol-design earlier_than results analysis
103
![Page 104: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/104.jpg)
Principle of PositivityComplements of types are not themselves types.
Terms such as
non-mammal non-membrane other metalworker in New Zealand
do not designate types in reality
104
![Page 105: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/105.jpg)
Generalized Anti-Boolean Principle
There are no conjunctive and disjunctive types:
anatomic structure, system, or substance
musculoskeletal and connective tissue disorder
105
![Page 106: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/106.jpg)
Objectivity
Which types exist in reality is not a function of our knowledge.
Terms such as
unknown
unclassified
unlocalized
arthropathies not otherwise specified
do not designate types in reality.106
![Page 107: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/107.jpg)
Keep Epistemology Separate from Ontology
If you want to say that
We do not know where A’s are located
do not invent a new class of
A’s with unknown locations
(A well-constructed ontology should grow linearly; it should not need to delete classes or relations because of increases in knowledge)
107
![Page 108: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/108.jpg)
If you want to say
I surmise that this is a case of pneumonia
do not invent a new class of surmised pneumonias
Confusion of ‘findings’ in medical terminologies
Keep Sentences Separate from Terms
108
![Page 109: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/109.jpg)
Single Inheritance
No kind in a classificatory hierarchy should be asserted to have more than one is_a parent on the immediate higher level
109
![Page 110: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/110.jpg)
Multiple Inheritance
thing
carblue thing
blue car
is_a is_a
110
![Page 111: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/111.jpg)
Multiple Inheritance
is a source of errors
encourages laziness
serves as obstacle to integration with neighboring ontologies
hampers use of Aristotelian methodology for defining terms
hampers use of statistical search tools
111
![Page 112: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/112.jpg)
Multiple Inheritance
thing
carblue thing
blue car
is_a1 is_a2
112
![Page 113: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/113.jpg)
Principle of asserted single inheritance
Each reference ontology module should be built as an asserted monohierarchy (a hierarchy in which each term has at most one parent)
Asserted hierarchy vs. inferred hierarchy
113
![Page 114: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/114.jpg)
Principle of normalization
Polyhierarchies should be decomposable into homogeneous disjoint monohierarchies
114
![Page 115: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/115.jpg)
Principle of instantiability
A term should be included in an ontology only if there is evidence that instances to which that term refers exist or have existed or can exist in reality.
Fist
Crowd
115
![Page 116: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/116.jpg)
Avoid mass nouns
Count nouns = an organism, a planet, a handshake
Mass nouns = tissue, information, discourse
Mass nouns almost always go hand in hand with ontological confusion
116
![Page 117: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/117.jpg)
is_a Overloading
The success of ontology alignment demands that ontological relations (is_a, part_of, ...) have the same meanings in the different ontologies to be aligned.
117
![Page 118: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/118.jpg)
Multiple Inheritance
thing
carblue thing
blue car
is_a1 is_a2
118
![Page 119: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/119.jpg)
How to solve this problem
Create two ontologies:
of cars
of colors
Link the two together via cross-products
(= factoring, normalization, modularization)
119
![Page 120: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/120.jpg)
Compositionality
The meanings of compound terms should be determined
1. by the meanings of component terms
together with
2. the rules governing syntax
120
![Page 121: Biomedical ontology tutorial_atlanta_june2011_part1](https://reader035.vdocuments.mx/reader035/viewer/2022062706/557c117bd8b42af2418b4923/html5/thumbnails/121.jpg)
User feedback principle
An ontology should evolve on the basis of feedback derived from those who are using the ontology for example for purposes in annotation.
121