biomedical ontologies what are they (for) ? stefan schulz medical informatics research group...

97
Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Upload: anthony-reynolds

Post on 25-Dec-2015

225 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Biomedical Ontologies

What are they (for) ?

Stefan Schulz

Medical Informatics

Research Group

UniversityMedical Center

Freiburg, Germany

Page 2: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Understanding / Semantic Interoperability

HealthCare

PublicHealth

BiomedicalResearch

Consumers

Enables understanding

between human and computational

agents

Common language: Ontologies and Terminology Systems

data

data data

datadata

Page 3: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Ontologies and Terminology Systems

aka Knowledge Organization Systems: Systems that support semantic interoperability by communicating and processing information: In a structured form Well-defined Unambiguous Processable by machines Understandable by humans

Life Sciences: major focus for the development of ontologies and terminological systems

Page 4: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

0

500

1000

1500

2000

2500

Terminolog*

Ontolog*

Literature on Biomedical Terminologies and Ontologies

Page 5: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Purpose of this Talk

What are Ontologies

What are they for ?

Formal

Page 6: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Structure of this talk

Introduction - Current Systems

Terminological Clarification

What do Formal Ontologies Represent ?

Terminologies vs. Formal Ontologies

Practice of Good Ontology

Outlook

Page 7: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Structure of this talk

Introduction - Current Systems

Terminological Clarification

What do Formal Ontologies Represent ?

Terminologies vs. Formal Ontologies

Practice of Good Ontology

Outlook

Page 8: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

A cruise through the archipelago of systems for biomedical knowledge organization

GO

ChEBI

MA

FBcv

FMA

WordNet

ICD

GALEN

SNOMED FAO

GENIA

NCI

MeSHTA

BRENDA

GROCL MedDRA

Page 9: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

MeSH: Medical Subject Headings

MeSHMedical Subject Headings

Page 10: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany
Page 11: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Hierarchical principle: broader term / narrower

term(not a taxonomy)

Page 12: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany
Page 13: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany
Page 14: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

MeSH: Medical Subject Headings

GOGene Ontology

Page 15: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany
Page 16: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany
Page 17: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Part of

(partonomy)

Is a

(taxonomy)

Page 18: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

MeSH: Medical Subject HeadingsICD

International Classification of Diseases

Page 19: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany
Page 20: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany
Page 21: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Class / subclassRelation

(is_a)

Page 22: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

MeSH: Medical Subject HeadingsSNOMED

Clinical Terms

Page 23: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany
Page 24: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

SNOMED CT Facts (I)

SNOMED CT is a terminology consisting of terms used in health & health care, attached to concept codes with multiple terms per

code structured according to logic-based representation

of meanings increasingly guided by ontological principles Current size:

283,000 Concepts 732,000 Terms 923,000 Concept – Concept Relations

Page 25: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

SNOMED CT Facts (II)

Since 2007: Maintained by IHTSDO (International Health Terminology standards development organization)

Members: Australia, Canada, Denmark, Lithuania, The Netherlands, New Zealand, Sweden, UK, USA.

Annual budget ~ 5 M€

Page 26: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Different Purposes – Heterogeneous Approaches

Page 27: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Different Purposes – Heterogeneous Approaches

MeSH [Medical Subject Headings]: Hierarchy (broader / narrower) of descriptors, used for indexing biomedical publications for literature retrieval support

GO [Gene Ontology]:Hierarchy (is_a / part_of) of controlled terms for describing gene an gene product properties

ICD [International Classification of Diseases]:Strict Hierarchy of non-overlapping classes for classifying statistically relevant health conditions

SNOMED CT [Systematized Nomenclature of Medicine – Clinical Terms ]:Hierarchical system of concepts with (partially) logic-based concept definitions

Page 28: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Other Biomedical Knowledge Organization Systems: Medicine

AI/RHEUMAlcohol and Other Drug ThesaurusAlternative Billing ConceptsBeth Israel VocabularyCanonical Clinical Problem Statement SystemClinical Classifications SoftwareClinical Terms Version 3 (CTV3) (Read Codes)Common Terminology Criteria for Adverse EventsCOSTARCOSTARTCRISP ThesaurusCurrent Dental Terminology 2005 (CDT-5)Current Procedural TerminologyDiseases DatabaseDSM-III-RDSM-IVDXplainGene OntologyHCPCS Version of Current Dental Terminology 2005 (CDT-5)HCPCS Version of Current Procedural Terminology (CPT)Healthcare Common Procedure Coding SystemHL7 Vocabulary Version 2.5HL7 Vocabulary Version 3.0Home Health Care ClassificationHUGO Gene NomenclatureICD10ICD-9-CMICPCICPC2 - ICD10 ThesaurusICPC2-ICD10 Thesaurus

International Classification of Primary CareInternational Classification of Primary Care 2nd EditionInternational Statistical Classification of Diseases and Related Health ProblemsJAMAS Japanese Medical Thesaurus (JJMT)Library of Congress Subject HeadingsLOINC 2.15Master Drug Data BaseMcMaster University Epidemiology TermsMedical Dictionary for Regulatory Activities Terminology (MedDRA)Medical Entities DictionaryMedical Subject HeadingsMEDLINE (1996-2000)MEDLINE (2001-2006)MedlinePlus Health Topics_2004_08_14Micromedex DRUGDEXMultum MediSource LexiconNANDA nursing diagnoses: definitions & classificationNational Drug Data File Plus Source VocabularyNational Drug File - Reference TerminologyNational Library of Medicine Medline DataNCBI Taxonomy

NCI SEER ICD Neoplasm Code MappingsNCI ThesaurusNeuronames Brain HierarchyNursing Interventions ClassificationNursing Outcomes ClassificationOmaha SystemOnline Congenital Multiple Anomaly/Mental Retardation SyndromesOnline Mendelian Inheritance in ManPatient Care Data SetPerioperative Nursing Data SetPharmacy Practice Activity ClassificationPhysician Data QueryPhysicians' Current Procedural TerminologyQuick Medical Reference (QMR)Read thesaurusRead thesaurus Americanized Synthesized TermsRXNORM ProjectSNOMED-2SNOMED Clinical TermsSNOMED InternationalStandard Product NomenclatureThesaurus of Psychological Index TermsThe Universal Medical Device Nomenclature System (UMDNS)UltraSTARUMLS MetathesaurusUniversity of Washington Digital AnatomistUSP Model GuidelinesVeterans Health Administration National Drug FileWHO Adverse Reaction TerminologyWHOART

Source: UMLS

Page 29: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Other Biomedical Knowledge Organization Systems: Biology (OBO)

Page 30: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Structure of this talk

Introduction - Current Systems

Terminological Clarification

What do Formal Ontologies Represent ?

Terminologies vs. Formal Ontologies

Practice of Good Ontology

Outlook

Page 31: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Unresolved Terminological Confusion…

Knowledge Organization Systems: artifacts for ordering domain entities, relating word meanings or providing semantic reference:

Vocabularies

Terminologies

Thesauri

Concept Systems

Classifications

(Formal) Ontologies

Page 32: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Different scientific traditions:

Biology, Medicine, Philosophy, Logic,

Linguistics, Library and Information

Science, Computer Science, Cognitive

Science, International Terminology norms

Different philosophical schools of thinking:

Platonism, Aristotelian Realism,

Conceptualism, Relativism, Idealism,

Postmodernism, Constructivism,

Nominalism, Tropism,…

Unresolved Terminological Confusion…

Page 33: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Components of Knowledge Organization Systems

Hierarchically orderedNodes and Links

Formal or informal Definitions

domain or region of DNA [GENIA]: A substructure of DNA molecule which is supposed to have a particular function, such as a gene, e.g., c-jun gene, promoter region, Sp1 site, CA repeat. This class also includes a base sequence that has a particular function.

Peptides [MeSH]: Members of the class of compounds composed of AMINO ACIDS joined together by peptide bonds between adjacent amino acids into linear, branched or cyclical structures. OLIGOPEPTIDES are composed of approximately 2-12 amino acids. Polypeptides are composed of approximately 13 or more amino acids. PROTEINS are linear polypeptides that are normally synthesized on RIBOSOMES.

19429009|chronic ulcer of skin|116680003|is a|=64572001|disease| {116676008|associated morphology|= 405719001|chronic ulcer| 363698007|finding site|= 39937001|skin structure|}

Dictionaries of Natural language Terms

• Benign neoplasm of heart• Benign tumor of heart• Benign tumour of heart• Benign cardiac neoplasm• Gutartiger Herzumor• Gutartige Neubildung am

Herzen• Gutartige Neubildung:

Herz• Gutartige Neoplasie des

Herzens• Tumeur bénigne

cardiaque• Tumeur bénigne du cœur• Neoplasia cardíaca

benigna• Neoplasia benigna do

coração• Neoplasia benigna del

corazón• Tumor benigno do

corazón

Page 34: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

What do the nodes in Formal Ontologies / Terminological Systems stand for?

conceptsconcepts

classesclasses

entitiesentities

categoriescategories

termsterms

setssets

synsetssynsets

universalsuniversals

sortssorts

propertiesproperties

typestypes

descriptorsdescriptors

descriptorsdescriptors

namesnames

Page 35: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Ontology: Gradient or crisp boundary ?

Terminology Ontology

Information Model

Page 36: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Ontology: Gradient or crisp boundary ?

TerminologyFormal

Ontology

Information Model

Page 37: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Organizing the world

TerminologyFormal

Ontology

Set of terms

representing the system

of concepts of a

particular subject field. 

(ISO 1087)

Ontology is the study of what there is. Formal

ontologies are theories that attempt to give precise

mathematical formulations of the properties and

relations of certain entities. (Stanford Encyclopedia of

Philosophy)

bla bla bla

Page 38: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Structure of this talk

Introduction - Current Systems

Terminological Clarification

What do Formal Ontologies Represent ?

Terminologies vs. Formal Ontologies

Practice of Good Ontology

Outlook

Page 39: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Terminologies start with human language

TerminologyFormal

Ontology

Set of terms

representing the system

of concepts of a

particular subject field. 

(ISO 1087)

Ontology is the study of what there is. Formal

ontologies are theories that attempt to give precise

mathematical formulations of the properties and

relations of certain entities. (Stanford Encyclopedia of

Philosophy)

bla bla bla

Page 40: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Semantic Reference

Entities of Language(Terms)

„benign neoplasm of heart“ „gutartige Neubildung des Herzmuskels”“neoplasia cardíaca benigna”

Shared / Meanings / Entities of Thought (Concepts)

Page 41: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Example: UMLS (mrconso table)

C0153957|ENG|P|L0180790|PF|S1084242|Y|A1141630||||MTH|PN|U001287|benign neoplasm of heart|0|N||

C0153957|ENG|P|L0180790|VC|S0245316|N|A0270815||||ICD9CM|PT| 212.7|Benign neoplasm of heart|0|N||

C0153957|ENG|P|L0180790|VC|S0245316|N|A0270817||||RCD|SY|B727.| Benign neoplasm of heart|3|N||

C0153957|ENG|P|L0180790|VO|S1446737|Y|A1406658||||SNMI|PT| D3-F0100|Benign neoplasm of heart, NOS|3|N||

C0153957|ENG|S|L0524277|PF|S0599118|N|A0654589||||RCDAE|PT|B727.|Benign tumor of heart|3|N||

C0153957|ENG|S|L0524277|VO|S0599510|N|A0654975||||RCD|PT|B727.| Benign tumour of heart|3|N||

C0153957|ENG|S|L0018787|PF|S0047194|Y|A0066366||||ICD10|PS|D15.1|Heart|3|Y||

C0153957|ENG|S|L0018787|VO|S0900815|Y|A0957792||||MTH|MM|U003158|Heart <3>|0|Y||

C0153957|ENG|S|L1371329|PF|S1624801|N|A1583056|||10004245|MDR|LT|10004245|Benign cardiac neoplasm|3|N||

C0153957|GER|P|L1258174|PF|S1500120|Y|A1450314||||DMDICD10|PT| D15.1|Gutartige Neubildung: Herz|1|N||

C0153957|SPA|P|L2354284|PF|S2790139|N|A2809706||||MDRSPA|LT| 10004245|Neoplasia cardiaca benigna|3|N||

Entities of Language(Terms)

Shared / Meanings / Entities of Thought

Unified Medical Language System, Bethesda, MD: National Library of Medicine, 2007: http://umlsinfo.nlm.nih.gov/

Page 42: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Semantic relations

Example: UMLS (mrrel table)

C0153957|A0066366|AUI|PAR|C0348423|A0876682|AUI | |R06101405||ICD10|ICD10|||N||

C0153957|A0066366|AUI|RQ |C0153957|A0270815|AUI |default_mapped_ from|R03575929||NCISEER|NCISEER|||N||

C0153957|A0066366|AUI|SY |C0153957|A0270815|AUI |uniquely_mapped_ to |R03581228||NCISEER|NCISEER|||N||

C0153957|A0270815|AUI|RQ |C0810249|A1739601|AUI |classifies | R00860638||CCS|CCS|||N||

C0153957|A0270815|AUI|SIB|C0347243|A0654158|AUI | |R06390094 || ICD9CM|ICD9CM||N|N||

C0153957|A0270815|CODE|RN|C0685118|A3807697|SCUI |mapped_to | R15864842||SNOMEDCT|SNOMEDCT||Y|N||

C0153957|A1406658|AUI|RL |C0153957|A0270815|AUI |mapped_from | R04145423||SNMI|SNMI|||N||

C0153957|A1406658|AUI|RO |C0018787|A0357988|AUI |location_of | R04309461||SNMI|SNMI|||N||

C0153957|A2891769|SCUI|CHD|C0151241|A2890143|SCUI|isa |R19841220|47189027|SNOMEDCT|SNOMEDCT|0|Y|N||

Shared / Meanings / Entities of Thought

Shared / Meanings / Entities of Thought

Page 43: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Example: UMLS

C0153957|A0066366|AUI|PAR|C0348423|A0876682|AUI | |R06101405||ICD10|ICD10|||N||

C0153957|A0066366|AUI|RQ |C0153957|A0270815|AUI |default_mapped_ from|R03575929||NCISEER|NCISEER|||N||

C0153957|A0066366|AUI|SY |C0153957|A0270815|AUI |uniquely_mapped_ to |R03581228||NCISEER|NCISEER|||N||

C0153957|A0270815|AUI|RQ |C0810249|A1739601|AUI |classifies | R00860638||CCS|CCS|||N||

C0153957|A0270815|AUI|SIB|C0347243|A0654158|AUI | |R06390094 || ICD9CM|ICD9CM||N|N||

C0153957|A0270815|CODE|RN|C0685118|A3807697|SCUI |mapped_to | R15864842||SNOMEDCT|SNOMEDCT||Y|N||

C0153957|A1406658|AUI|RL |C0153957|A0270815|AUI |mapped_from | R04145423||SNMI|SNMI|||N||

C0153957|A1406658|AUI|RO |C0018787|A0357988|AUI |location_of | R04309461||SNMI|SNMI|||N||

C0153957|A2891769|SCUI|CHD|C0151241|A2890143|SCUI|isa |R19841220|47189027|SNOMEDCT|SNOMEDCT|0|Y|N||

Shared / Meanings / Entities of Thought

Shared / Meanings / Entities of Thought

Semantic relationsINFORMAL

Page 44: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Formal Ontology represents the world

TerminologyFormal

Ontology

Set of terms

representing the system

of concepts of a

particular subject field. 

(ISO 1087)

Ontology is the study of what there is (Quine). Formal ontologies are theories that attempt to give precise mathematical formulations of the properties and relations of certain entities. (Stanford Encyclopedia of Philosophy)

bla bla bla

Page 45: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Organizing Entities

Entity Types

The type “benign neoplasm of heart”

Entities of the World

My benign neoplasm of heart

Page 46: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Organizing Entities

Entity Types

Entities of the World

The benign neoplasm of my heart

The type “benign neoplasm of heart”

Instance_of

Universals, classes, (Concepts)

abstract

concrete

Particulars, instances

Page 47: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Organizing Entities

Entities of Language

Entity Types

Entities of the World

The string „benign neoplasm of heart“

The benign neoplasm of my heart

The type “benign neoplasm of heart”

Instance_of

Universals, classes, (Concepts)

abstract

concrete

Particulars, instances

Terms, names

represents

represents

Page 48: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Organizing Entities

represents

(die Komplikation

meines) Gutartigen

Herztumors

(the complication of my)

benign heart tumor

Page 49: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Organizing Entities

Terms, names

represents

(die Komplikation

meines) Gutartigen

Herztumors

(the) benign heart tumor

(is congenital)

Page 50: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Entities of Language …are stored in dictionaries

and represented by terminologies

Page 51: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Entities of the World

Database systems / information models store references to…

Page 52: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Entity Types

… are organized in formal ontologies

Page 53: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Hierarchical framework for entity types

Taxonomy: relates types and subtypes: Tumor of Heart is_a Tumor equivalent to:All instances of Tumor of Heart are instances of Tumor

(without exceptions)

Relations: instance_of relates instances with types, all others relate

instances (e.g. part_of) or are derived from them (e.g. is_a)

Definitions: describe what is always true for all instances of a type

Tumor of Heart has_location Heart : All instances of Tumor of Heart are located in some Heart

Page 54: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Type / Subtype Hierarchy

Tumorof Heart

Benign Tumor of Heart

Malignant Tumor of Heart

BenignTumor

Is_a Is_a Is_a

Page 55: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

A classification view on Formal Ontologies

World

Page 56: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Hierarchies, Types, Classes, Individuals

World

Page 57: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Hierarchies, Types, Classes, Individuals

World

Page 58: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Hierarchies, Types, Classes, Individuals

World

Type 1

Page 59: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Hierarchies, Types, Classes, Individuals

Formal Ontology

World

Is_a Is_a Is_a

Type 1

Subtype 1.2

Subtype 1.1

Subtype 1.3

Page 60: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Hierarchies, Types, Classes, Individuals

Formal OntologyInflammatory

Disease

Page 61: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Hierarchies, Types, Classes, Individuals

Formal Ontology

Is_aIs_a

Gastritis PacreatitisHepatitis

InflammatoryDisease

Is_a

Page 62: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Hierarchies, Types, Classes, Individuals

Formal Ontology

Is_aIs_a

Gastritis PacreatitisHepatitis

InflammatoryDisease

Is_a

Page 63: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Hierarchies, Types, Classes, Individuals

Formal Ontology

Is_aIs_a

Gastritis PacreatitisHepatitis

InflammatoryDisease

Is_a

Page 64: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Relations and Definitions

Formal Ontology

Liver

hasLocationHepatitis

InflammatoryDisease

Is_a

Page 65: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Relations and Definitions

Formal Ontology

Liver

hasLocationHepatitis

InflammatoryDisease

Is_a

Page 66: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Relations and Definitions

Formal Ontology

Liver

hasLocationHepatitis

InflammatoryDisease

Is_a

Viral HepatitisPopulation of Virus

Population

Is_a causedby

Page 67: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Languages for formal ontologies

x: instanceOf(x, Hepatitis) instanceOf(x, Inflammation) y: instanceOf(y, Liver) hasLocation(x,y)

“Every hepatitis is an inflammatory disease that is located in some liver” “Every inflammatory disease that is located in some liver is an hepatitis”

Natural Language

Logic

Logic is computable: it supports machine inferences but…

it only scales up if it has a very limited expressivity

Page 68: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Structure of this talk

Introduction - Current Systems

Terminological Clarification

What do Formal Ontologies Represent ?

Terminologies vs. Formal Ontologies

Practice of Good Ontology

Outlook

Page 69: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Terminologies vs. Formal Ontologies

Describe: Meaning of human language units

“Concepts”: aggregate (quasi)-synonymous terms

Relations: informal, elastic Associations between Concepts ……..

Description pattern:Concept1 Relation Concept2

Describe: entities of reality as they generically are – independent of human language

“Types”: represent the generic properties of world entities

Relations: rigid, exactly defined, quantified relationships between particulars

Description pattern:for all instance of Type1 : there is some…

Terminologies Formal Ontologies

Page 70: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Example Hepatitis - Liver

Concept Hepatitis: {Hepatitis (D), Leberentzündung (D), hepatitis (E), hépatite (F)}

Concept Liver: {Leber (D), liver (E), foie (F)}

Relations: Hepatitis – hasLocation – Liver Hepatitis – isA - Inflammation

Type: Hepatitis:

Description:

”Every hepatitis is an inflammatory

disease that is located in some liver”

“Every inflammatory disease that is

located in some liver is an hepatitis”

Terminologies Formal Ontologies

Page 71: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Concept Hand: {Hand (D), hand (E), main (F)}

Concept Thumb: {Daumen (D), thumb (E), pouce (F)}

Relations: Hand – hasPart – Thumb Thumb – partOf – Hand

Type: Thumb:

Description:

?

Terminologies Formal Ontologies

Example Hand - Thumb

”Every thumb is part of some hand”

“Every hand has some thumb as

part”

Page 72: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Example Aspirin - Headache

Concept Aspirin: {Aspirin (D,E), Acetylsalicylsäure (D), ASS (D), acetylsalicylic acid (E), Acide acétylsalicylique(F)}

Concept Headache: {Kopfschmerz (D), headache (E), céphalée(F)}

Relation: Aspirin – treats – Headache

Type: Aspirin:

Description:

fuzzy complicated !

Terminologies Formal Ontologies

”For every portion of aspirin there is

some disposition for treating

headache”

Page 73: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Strengths of Formal Ontologies

Exact, logic-based descriptions of entity types that are instantiated by real-world objects, processes, states

Representation of stable, context-independent accounts of reality

Use of formal reasoning methods using tools and approaches from the AI / Semantic Web community

Page 74: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Deficit of Nomenclatures / Terminologies

D5-46210 Acute appendicitis, NOS

D5-46100 Appendicitis, NOSG-A231 Acute

M-41000 Acute inflammation, NOSG-C006 InT-59200 Appendix, NOS

G-A231 AcuteM-40000 InflammationG-C006 InT-59200 Appendix, NOS

SNOMED INTERNATIONAL

Page 75: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Formal-ontological descriptions: Advantages

Different description of the same thing can be automatically mapped to a canonic description by a logic-based reasoning device

Meaning of defined classes can be unambiguously expressed

Page 76: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Formal Ontologies: Limitations (I)

Only suitable to represent shared, uncontroversial meaning of a domain vocabulary

Supports universal statements about instances of a type: All Xs are Ys For all Xs there is some Y

Properties of types are properties of all entities that instantiate these types (strict inheritance)

Page 77: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Representation of context dependent knowledge „Allergic Rhinitis is a common disorder (in Europe)“

Representation of probabilistic knowledge „95% of people infected with viral hepatitis recover “ “Smoking is a cardiovascular risk factor”

Default / canonic knowledge „Adult humans have 32 teeth“

Dispositions: „Oxazepam is indicated for anxiety disorders” „Aspirin affects the gastric mucosa”

Ontology Knowledge Representation

Formal Ontologies: Limitations (II)

Page 78: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Continuum of knowledge

Universally accepted assertions

Consolidated but context-dependent facts

Hypotheses, beliefs, statistical associations

Domain Knowledge

Page 79: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Consolidated but context-dependent facts

Hypotheses, beliefs, statistical associations

Formal Ontology !

Domain Knowledge

Universally accepted assertions

Page 80: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Structure of this talk

Introduction - Current Systems

Terminological Clarification

What do Formal Ontologies Represent ?

Terminologies vs. Formal Ontologies

Practice of Good Ontology

Outlook

Page 81: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Practice of Good Ontology

Learning good ontology practice from bad

ontologies…

Page 82: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Don’t mix up universals (Concepts, Classes) with individuals (Instances)

subclass-of (Motor Neuron, Neuron) (FMA, OpenGALEN)

Is_a (Motor Neuron, Neuron) instance-of (Motor Neuron, Neuron) (FlyBase)

But: instance-of (my Hand, Hand) instance-of (this amount of insulin, Insulin) instance-of (Germany, Country) not: instance of (Heart, Organ) not: instance of (Insulin, Protein)

Is_a = subclass_of:

Taxonomic Subsumption

Instance_of Class Membership

Page 83: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Don’t use superclasses to express roles

Is_a (Fish, Animal) Is_a (Fish, Food) ??

Is_a (Acetylsalicylic Acid, Salicylate) Is_a (Acetylsalicylic Acid, Analgetic Drug) ??

Be aware of the “rigidity” of entity types

Page 84: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Endurant (Continuant)Physical

Amount of matterPhysical objectFeature

Non-PhysicalMental objectSocial object

…Perdurant (Occurrent)

StaticStateProcess

DynamicAchievementAccomplishment

QualityPhysical Qualities

Spatial location…

Temporal QualitiesTemporal location…

Abstract Qualities…

AbstractQuality region

Time regionSpace regionColor region

Source: S. Borgo ISTC-CNR

Example: DOLCE’s Upper Ontology

Partition the ontology by principled upper level categories

Page 85: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Limit to a parsimonious set of semantically precise Basic Relations

Barry Smith, Werner Ceusters, Bert Klagges, Jacob Köhler,Anand Kumar, Jane Lomax, Chris Mungall, Fabian Neuhaus,Alan L Rector and Cornelius Rosse. Relations in biomedical ontologies. Genome Biology, 6(5), 2005.

Page 86: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Avoid idiosyncratic categorization

Physical object (8)DeviceDomestic, office and garden

artefactFastening(…)

Procedure (23)Administrative procedureCommunity health procedure(…)

Qualifier value (52)ActionAdditional dosage instructions(…)

Record artifactRecord organizerRecord type

Situation with explicit context (17)A/N risk factorsCritical incident factors (…)

Social context (10)CommunityFamilyGroup

(…)Special concept

Namespace conceptNavigational conceptNon-current concept

Specimen (45)Biopsy sampleBody substance sampleCardiovascular sample(…)

Staging and scales (6)Assessment scalesEndometriosis classification of

American Fertility Society

(…)Substance (11)

Allergen classBiological substanceBody substance(…)

Body structure (10)Acquired body structureAnatomical organizational pattern(…)

Clinical finding (22)Administrative statusesAdverse incident outcome categories(…)

Environment or geographical locationEnvironmentGeogr. and/or political region of the world

Event (19)AbuseAccidental eventBioterrorism related event(…)

Linkage conceptAttributeLink assertion

Observable entityAge AND/OR growth periodBody product observable(…)

Clin. history / examination observable (21)Device observableDrug therapy observableFeature of Entity

(…)Organism (11)

AnimalChromistaInfectious agent(…)

Pharmaceutical / biologic product (58)Alcohol productsAlopecia preparationAlternative medicines(…)

Physical force (21)AltitudeElectricity(…)

Page 87: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Jorge Luis Borges

"On those remote pages it is written that animals are divided into:

a. those that belong to the Emperor

b. embalmed ones c. those that are trained d. suckling pigse. mermaids f. fabulous ones

g. stray dogs h. those that are included

in this classificationi. those that tremble as

if they were mad j. innumerable ones k. those drawn with a

very fine camel's hair brush

l. others m. those that have just

broken a flower vase n. those that resemble

flies from a distance"

The Celestial Emporium of Benevolent Knowledge

Page 88: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Be aware of ambiguities

“Institution” may refer to1. (abstract) institutional rules2. (concrete) things instituted3. act of instituting sth.

“Tumor”1. evolution of a tumor as a disease process2. having a tumor as a pathological state3. tumor as a physical object

“Gene”1. a (physical) sequence of nucleotides on a DNA

chain2. a collection of (1)3. A piece of information conveyed by (1)

Page 89: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Don‘t mix up ontology with epistemiology

Is_a (Infection of unknown origin, Infection) Is_a (Newly diagnosed diabetes, Diabetes) Is_a (Family history of diabetes, Diabetes)

„what is“

„what sth. knows about “

Page 90: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

• Glycerin Kinase• Glycerokinase• GK•Glyzerinkinase

Don‘t mix up Ontology IDs with Terms

Page 91: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

„what is“

„what sth. knows about “

„how it is expressed in human language“

Page 92: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Don’t underestimate Ontology Maintenance

Formal Ontologies must always be maintained consistent (free of logic contradiction): prerequisite for

machine reasoning adequate (correctly describe the domain) prerequisite to

prevent erroneous deductions

Maintenance load is much higher than with terminologies.

Ontology maintenance is mainly task of domain experts. IT staff has supportive function

Typical design and maintenance errors

Page 93: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Structure of this talk

Introduction - Current Systems

Terminological Clarification

What do Formal Ontologies Represent ?

Terminologies vs. Formal Ontologies

Practice of Good Ontology

Outlook

Page 94: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Outlook

Ontology often used a buzzword for nontologies but… Formal ontological principles increasingly govern the

construction of Life Science Knowledge Organization Systems Users / domain expert must be heavily involved into ontology

engineering and maintenance Insufficient evidence:

Which use cases require formal ontologies In which cases informal terminology systems are sufficient? Which cases require both ? Can existing terminologies be ontologized? Can terminologies and ontologies co-exist ?

The outcome of the existing legacy systems’ move toward principled ontologies is still open

Page 95: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

SNOMED CT

One huge system Impressive domain coverage Considerable investments, important stakeholders Increasing number of (clinical) users Other use cases mainly unexplored (basic research, clinical trials) Increasing mappings to existing terminologies Legacy: hybrid of terminology, ontology, information model Overall structure idiosyncratic, disorganized Some major architectural weaknesses Unreflected use of logic, unintended entailments Major redesign necessary: formal foundations, editing guidelines,

quality control procedures Risk: uncontrollable proliferation, loss of expressiveness, Chances: Positive input by user groups

Page 96: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Open Biomedical Ontologies (OBO)

Many focused ontologies Increasing number of annotated sources Broad range: organisms – anatomies (plants,

animals) – pathways – biomedical investigations – cells – development – protein – sequence …

Convergence to standardized syntax and semantics

Increasingly using formal ontology principles Public

Page 97: Biomedical Ontologies What are they (for) ? Stefan Schulz Medical Informatics Research Group University Medical Center Freiburg, Germany

Thank you!Contact:

[email protected]

Stefan Schulz

Medical Informatics

Research Group

UniversityMedical Center

Freiburg, Germany