manchester medical informatics group opengalen 1 linking formal ontologies: scale, granularity and...
TRANSCRIPT
1
Manchester Medical Informatics Group OpenGALEN
Linking Formal Ontologies: Scale, Granularity and Context
Alan Rector
Medical Informatics Group, University of Manchesterwww.cs.man.ac.uk/mig
www.opengalen.orgimg.cs.man.ac.uk
2
Manchester Medical Informatics Group OpenGALEN
Why use Logic-based Ontologies?
because
Knowledge is Fractal!&
Changeable!
3
Manchester Medical Informatics Group OpenGALEN
Four Roles of Terminology/Ontologies
⢠Content of Databases and Patient Records â Structural linkage within EPR/EHR & messagesâ Content of EPR/EHR & messages
⢠Capturing information - the user interface
⢠Linkage between domainsLinkage between domainsâ Health and Bio Sciences Health and Bio Sciences â Macro, Micro, and Molecular scalesMacro, Micro, and Molecular scalesâ Contexts: Normal / abnormal; species; stage of developmentContexts: Normal / abnormal; species; stage of developmentâ Healthcare delivery and Clinical researchâ Patient Records and Decision Support
⢠Indexing Informationâ Metadata and the semantic web
⢠www.semanticweb.org www.w3c.org
4
Manchester Medical Informatics Group OpenGALEN
Logic based ontologies
⢠The descendants of frame systems and object hierarchies via KL-ONE
⢠âis-kind-ofâ = âimpliesâ â âDog is a kind of wolfâ
meansâAll dogs are wolvesâ
â Therefore logically computable
⢠Modern examples: OIL, DAML+OIL (âOWLâ?)â Underpinned by the FaCT family of Description Logic Reasoners
⢠Others LOOM, CLASSIC, BACK, GRAIL,...
⢠www.ontoknowledge.org/oil www.semanticweb.org
5
Manchester Medical Informatics Group OpenGALEN
Logic-based Ontologies: Conceptual Lego
hand
extremity
body
acute
chronic
abnormal
normalischaemic
deletion
bacterial
polymorphism
cell
protein
gene
infection
inflammation
Lung
expression
6
Manchester Medical Informatics Group OpenGALEN
Logic-based Ontologies: Conceptual Lego
âSNPolymorphism of CFTRGene causing Defect in MembraneTransport of ChlorideIon causing Increase in Viscosity of Mucus in CysticFibrosisâŚâ
âHand which isanatomicallynormalâ
7
Manchester Medical Informatics Group OpenGALEN
Whatâs in a âLogic based ontologyâ?
⢠Primitive concepts - in a hierarchyâ Described but not defined
⢠Properties - relations between conceptsâ Also in a hierarchy
⢠Descriptors - property-concept pairs â qualified by âsomeâ, âonlyâ, âat leastâ, âat mostâ
⢠Defined conceptsâ Made from primitive concepts and descriptors
⢠Axiomsâ disjointness, further description of defined concepts
⢠A Reasonerâ to organise it for you
8
Manchester Medical Informatics Group OpenGALEN
Encrustation
+ involves: MitralValve
Thing
+ feature: pathological
Structure
+ feature: pathological
+ involves: Heart
Logic Based Ontologies: A crash course
Thing
Structure
Heart MitralValve EncrustationMitralValve* ALWAYS partOf: Heart
Encrustation* ALWAYS feature: pathological
Feature
pathological red
+ (feature: pathological)
red
+ partOf: Heart
red
+ partOf: Heart
9
Manchester Medical Informatics Group OpenGALEN
Bridging Bio and Health Informatics
⢠Define concepts with âpiecesâ from different scales and disciplinesâ âPolymorphism which causes defect which causes diseaseâ
⢠Define concepts which make context explicitâ â âHand which is anatomically normalâ
has five fingersâ
⢠Separate properties for different contexts/views â âAbnormalities of clinical parts of the heartâ
⢠includes pericardium
10
Manchester Medical Informatics Group OpenGALEN
Bridging Scales and
context with Ontologies
GenesSpecies
Protein
Function
Disease
Protein coded bygene in species
Function ofProtein coded bygene in species
Disease caused by abnormality inFunction ofProtein coded bygene in species
Gene in Species
11
Manchester Medical Informatics Group OpenGALEN
Representing context and views by variant properties
Organ
HeartPericardium
OrganPart
CardiacValve
Disease of (is_part_of) Heart
Disease of Pericardium
is_part_of
is_structurally_part_ofis_clinically_part_of
12
Manchester Medical Informatics Group OpenGALEN
The cost: Ontologies are not Thesauri
organ } kind heart } part heart valve } kind aortic valve } part aortic valve cusp
A Mixed Hierarchy
Works for navigation by humans
Works for âDisease ofâŚâ and âProcedure onâŚâ
Fails for âSurface ofâŚâ
How can the computer know the difference?
13
Manchester Medical Informatics Group OpenGALEN
From a thesaurus to a logic-based ontology
disorder of organ
disorder of heart
disorder of valve in heart
disorder of aortic valve in heart
disorder of cusp in aortic valve in heart
A logic-based is-kind-of (subsumption) hierarchy
Untangle part-whole and is-kind-of in anatomic ontology
Link Clinical Ontology with Anatomical ontology
Add rule that âDisorder of part disorder of wholeâ
Reasoner can then create automatically:
14
Manchester Medical Informatics Group OpenGALEN
Examples common in Bio Ontologies
Is part ofGolgi membrane Integral protein
Is part ofPlasma membrane Apical plasma membrane
15
Manchester Medical Informatics Group OpenGALEN
The Cost: Normalising (untangling) Ontologies
StructureFunction
Part-wholeStructure Function
Part-w
hole
16
Manchester Medical Informatics Group OpenGALEN
The Cost: Normalising (untangling) Ontologies
Making each meaning explicit and separate
⌠ActionRole PhysiologicRole HormoneRole CatalystRole âŚ
⌠Substance BodySubstance Protein Steroid âŚ
PhysSubstance Protein ProteinHormone Insulin Enzyme Steroid SteroidHormone Hormone ProteinHormone^ Insulin^ SteroidHormone^ Catalyst Enzyme^
Hormone = Substance & playsRole-HormoneRoleProteinHormone = Protein & playsRole-HormoneRoleSteroidHormone = Steroid & playsRole-HormoneRoleCatalyst = Substance & playsRole CatalystRole
...and helping keep argument rational and meetings short!
Enzyme ?=? Protein & playsRole-CatalystRole
PhysSubstance Protein â ProteinHormoneâ Insulin âEnzymeâ Steroid âSteroidHormoneâ âHormoneâ âProteinHormoneâ Insulin^ âSteroidHormoneâ âCatalystâ âEnzymeâ
17
Manchester Medical Informatics Group OpenGALEN
The Cost
⢠You canât say everything you want toâ Expressiveness costs computational complexity
⢠More inference takes more timeâ Scaling for complex tasks still being investigated
⢠Many other kinds of reasoning needed
It doesnât make the! Coffee!
18
Manchester Medical Informatics Group OpenGALEN
Other benefits⢠Limit combinatorial explosions
From âphrase bookâ to âdictionary + grammarâ Avoid the âexploding bicycleâ
â 1980 - ICD-9 (E826) 8 â 1990 - READ-2 (T30..) 81â 1995 - READ-3 87â 1996 - ICD-10 (V10-19) 587
⢠V31.22 Occupant of three-wheeled motor vehicle injured in collision with pedal cycle, person on outside of vehicle, nontraffic accident, while working for income
â and meanwhile elsewhere in ICD-10
⢠W65.40 Drowning and submersion while in bath-tub, street and highway, while engaged in sports activity
⢠X35.44 Victim of volcanic eruption, street and highway, while resting, sleeping, eating or engaging in other vital activities
19
Manchester Medical Informatics Group OpenGALEN
Study a phase 2
Other benefits
Hypertension
Idiopathic Hypertension
In our companyâs studies
Study a
Phase 2
Hypertension
Idiopathic Hypertension`
In our companyâs studies
Phase 2
⢠Index and assemble information
20
Manchester Medical Informatics Group OpenGALEN
Summary: Logic based ontologies because
Knowledge is Fractal⢠Link âConceptual Legoâ
â at all levels⢠indefinitely
â Spanning scales, genotype, phenotype, etc.
⢠Model context and viewsâ Express differences explicitly
⢠Manage combinatorial explosion
⢠Index information efficiently
Next step: Larger scale demonstrations in Genotype to Phenotype