distributed common ground system army (dcgs-a)ncor.buffalo.edu/oi2/slides/big military data -...

55
Distributed Common Ground System – Army (DCGS-A) Barry Smith Director National Center for Ontological Research The Role of Ontology in the Era of Big (Military) Data 1

Upload: vuongdan

Post on 30-Mar-2018

221 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

Distributed Common Ground System – Army

(DCGS-A)

Barry Smith

Director

National Center for Ontological Research

The Role of Ontology in the Era

of Big (Military) Data

1

Page 2: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

Distributed Development of a Shared Semantic Resource (SSR)

in support of US Army’s Distributed Common

Ground System Standard Cloud (DSC) initiative

with thanks to: Tanya Malyuta, Ron Rudnicki

Background materials: http://x.co/yYxN

2

Page 3: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

3

Page 4: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

Making data (re-)usable through common controlled vocabularies

• Allow multiple databases to be treated as if they were a single data source by eliminating terminological redundancy in ways data are described – not ‘Person’, and ‘Human’, and ‘Human Being’, and

‘Pn’, and ‘HB’, but simply: person

• Allow development and use of common tools and techniques, common training, single validation of data, focused around – semantic technology – coordinated ontology development and use

4

Page 5: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

Ontology =def.

• controlled vocabulary organized as a graph • nodes in the graph are terms representing types

in reality • each node is associated with definition and

synonyms • edges in the graph represent well-defined

relations between these types • the graph is structured hierarchically via subtype

relations

5

Page 6: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

Ontologies

• computer-tractable representations of types in specific areas of reality

• divided into more and less general

– upper = organizing ontologies, provide common architecture and thus promote interoperability

– lower = domain ontologies, provide grounding in reality

• reflecting top-down and bottom-up strategy

6

Page 7: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

Success story in biomedicine Goal: integration of biological and clinical data

– across different species

– across levels of granularity (organ, organism, cell, molecule)

– across different perspectives (physical, biological, clinical)

– within and across domains (growth, aging, environment, genetic disease, toxicity …)

8

Page 8: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

RELATION TO TIME

GRANULARITY

CONTINUANT OCCURRENT

INDEPENDENT DEPENDENT

ORGAN AND ORGANISM

Organism (NCBI

Taxonomy)

Anatomical Entity (FMA, CARO)

Organ Function

(FMP, CPRO) Phenotypic Quality (PaTO)

Biological Process

(GO)

CELL AND CELLULAR

COMPONENT

Cell (CL)

Cellular Component (FMA, GO)

Cellular Function

(GO)

MOLECULE Molecule

(ChEBI, SO, RnaO, PrO)

Molecular Function (GO)

Molecular Process (GO)

The Open Biomedical Ontologies (OBO) Foundry 9

Page 9: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

RELATION TO TIME

GRANULARITY

CONTINUANT OCCURRENT

INDEPENDENT DEPENDENT

COMPLEX OF ORGANISMS

Family, Community, Population

Organ Function

(FMP, CPRO)

Population Phenotype

Population Process

ORGAN AND ORGANISM

Organism (NCBI

Taxonomy)

Anatomical Entity (FMA, CARO)

Phenotypic Quality (PaTO)

Biological Process

(GO) CELL AND CELLULAR

COMPONENT

Cell (CL)

Cellular Component (FMA, GO)

Cellular Function

(GO)

MOLECULE Molecule

(ChEBI, SO, RnaO, PrO)

Molecular Function (GO)

Molecular Process (GO)

Population-level ontologies 10

Page 10: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

RELATION TO TIME

GRANULARITY

CONTINUANT OCCURRENT

INDEPENDENT DEPENDENT

ORGAN AND ORGANISM

Organism (NCBI

Taxonomy)

Anatomical Entity (FMA, CARO)

Organ Function

(FMP, CPRO) Phenotypic

Quality (PaTO)

Biological Process

(GO)

CELL AND CELLULAR

COMPONENT

Cell (CL)

Cellular Component (FMA, GO)

Cellular Function

(GO)

MOLECULE Molecule

(ChEBI, SO, RnaO, PrO)

Molecular Function (GO)

Molecular Process (GO)

Environment Ontology

En

vir

on

men

t

On

tolo

gy

11

Page 11: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

CONTINUANT OCCURRENT

INDEPENDENT DEPENDENT

ORGAN AND ORGANISM

Organism (NCBI

Taxonomy)

Anatomical Entity (FMA, CARO)

Organ Function

(FMP, CPRO) Phenotypic Quality (PaTO)

Organism-Level Process

(GO)

CELL AND CELLULAR

COMPONENT

Cell (CL)

Cellular Component (FMA, GO)

Cellular Function

(GO)

Cellular Process (GO)

MOLECULE Molecule

(ChEBI, SO, RNAO, PRO)

Molecular Function (GO)

Molecular Process

(GO)

rationale of OBO Foundry coverage

GRANULARITY

RELATION TO

TIME

12

Page 12: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

OBO Foundry approach extended into other domains

13

NIF Standard

Neuroscience Information Framework

ISF Ontologies Integrated Semantic Framework

OGMS and Extensions Ontology for General Medical Science

IDO Consortium Infectious Disease Ontology

cROP Common Reference Ontologies for Plants

Page 13: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

Anatomy Ontology (FMA*, CARO)

Environment Ontology (EnvO)

Infectious Disease

Ontology (IDO*)

Biological Process

Ontology (GO*)

Cell Ontology

(CL)

Cellular Component

Ontology (FMA*, GO*) Phenotypic

Quality Ontology

(PaTO) Subcellular Anatomy Ontology (SAO)

Sequence Ontology (SO*) Molecular

Function (GO*) Protein Ontology

(PRO*)

14

top level

domain level

Basic Formal Ontology (BFO)

Modular organization + Extension strategy

Page 14: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

~100 ontologies using BFO

US Army Biometrics Ontology

Brucella Ontology (IDO-BRU)

eagle-i and VIVO (NCRR)

Financial Report Ontology (to support SEC through XBRL)

IDO Infectious Disease Ontology (NIAID)

Malaria Ontology (IDO-MAL)

Nanoparticle Ontology (NPO)

Ontology for Risks Against Patient Safety

(RAPS/REMINE)

Parasite Experiment Ontology (PEO)

Subcellular Anatomy Ontology (SAO)

Vaccine Ontology (VO)

… 15

Page 15: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

Basic Formal Ontology

Thursday, April 18, 2013 16

BFO:Entity

BFO:Continuant BFO:Occurrent

BFO:Process BFO:Independent Continuant

BFO

BFO:Dependent Continuant

BFO:Disposition

Page 16: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

Basic Formal Ontology and Mental Functioning Ontology (MFO)

Thursday, April 18, 2013 17

BFO:Entity

BFO:Continuant BFO:Occurrent

BFO:Process

Organism

BFO:Independent Continuant

BFO

MFO

BFO:Dependent Continuant

Behaviour inducing state

Mental Functioning Related Anatomical

Structure

Cognitive Representation

BFO:Quality

Affective Representation

Mental Process

Bodily Process BFO:Disposition

Page 17: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

BFO:Entity

BFO:Continuant BFO:Occurrent

BFO:Process BFO:Independent

Continuant

BFO

MFO

BFO:Dependent Continuant

Cognitive Representation

Affective Representation

Mental Process

Bodily Process BFO:Disposition

MFO-EM

Emotion Occurrent

Organism

Emotional Action Tendencies

Appraisal

Subjective Emotional Feeling

Physiological Response to

Emotion Process

inheres_in

is_output_of

Emotional Behavioural Process

Appraisal Process

has_part

agent_of

Emotion Ontology extends MFO

Page 18: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

Sample from Emotion Ontology: Types of Feeling

Thursday, April 18, 2013 19

Page 19: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

The problem of joint / coalition operations

Fire

Support

Logistics Air

Operations

Intelligence

Civil-Military

Operations

Targeting

Maneuver

&

Blue

Force

Tracking

23

Page 20: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

US DoD Civil Affairs strategy for non-classified information sharing

24

Page 21: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

Ontologies / semantic technology

can help to solve this problem

Fire

Support

Logistics Air

Operations

Intelligence

Civil-Military

Operations

Targetin

g

Maneuver

&

Blue Force

Tracking

25

Page 22: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

But each community produces its own ontology,

this will merely create new, semantic siloes

Fire

Support

Logistics Air

Operations

Intelligence

Civil-Military

Operations

Targeting

Maneuver

&

Blue

Force

Tracking

26

Page 23: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

What we are doing to avoid the problem of semantic siloes

Distributed Development of a Shared Semantic Resource

Pilot testing to demonstrate feasibility

27

Page 24: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

Anatomy Ontology (FMA*, CARO)

Environment Ontology (EnvO)

Infectious Disease

Ontology (IDO*)

Biological Process

Ontology (GO*)

Cell Ontology

(CL)

Cellular Component

Ontology (FMA*, GO*) Phenotypic

Quality Ontology

(PaTO) Subcellular Anatomy Ontology (SAO)

Sequence Ontology (SO*) Molecular

Function (GO*) Protein Ontology

(PRO*)

28

top level

domain level

Basic Formal Ontology (BFO)

creating the analog of this in the military domain

Page 25: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

Semantic Enhancement

Annotation (tagging) of source data models using terms from coordinated ontologies

– data remain in their original state (are treated at arms length)

– tagged using interoperable ontologies created in tandem

– can be as complete as needed, lossless, long-lasting because flexible and responsive

– big bang for buck – measurable benefit even from first small investments

Coordination through shared governance and training

29

Page 26: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

Main challenge: Will it scale?

The problem of scalability turns on

• the ability to accommodate ever increasing volumes and types of data and numbers of users

• can we preserve coordination (consistency, non-redundancy) as ever more domains become involved?

• can we respond in agile fashion to ever changing bodies of source data?

31

Page 27: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

Strategy for agile ontology creation

• Identify or create carefully validated general purpose plug-and-play reference ontology modules for principal domains

• Develop a method whereby these reference ontologies can be extended very easily to cope with specific, local data through creation of application ontologies

32

Page 28: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

vehicle =def: an object used for transporting people or goods

tractor =def: a vehicle that is used for towing crane =def: a vehicle that is used for lifting and moving heavy objects

vehicle platform=def: means of providing mobility to a vehicle

wheeled platform=def: a vehicle platform that provides mobility through the use of wheels tracked platform=def: a vehicle platform that provides mobility through the use of continuous tracks

artillery vehicle = def. vehicle designed for the transport of one or more artillery weapons wheeled tractor = def. a tractor that has a wheeled platform

Russian wheeled tractor type T33 = def. a wheeled tractor of type T33 manufactured in Russia Ukrainian wheeled tractor type T33 = def. a wheeled tractor of type T33 manufactured in Ukraine

Reference Ontology Application Ontology

Page 29: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

vehicle =def: an object used for transporting people or goods

tractor =def: a vehicle that is used for towing crane =def: a vehicle that is used for lifting and moving heavy objects

vehicle platform=def: means of providing mobility to a vehicle

wheeled platform=def: a vehicle platform that provides mobility through the use of wheels tracked platform=def: a vehicle platform that provides mobility through the use of continuous tracks

artillery vehicle = def. vehicle designed for the transport of one or more artillery weapons wheeled tractor = def. a tractor that has a wheeled platform

Russian wheeled tractor type T33 = def. a wheeled tractor of type T33 manufactured in Russia

Ukrainian wheeled tractor type T33 = def. a wheeled tractor of type T33 manufactured in Ukraine

Reference Ontology Application Ontology

Page 30: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

Basic Formal Ontology

(BFO)

Extended Relation Ontology

Time Ontology

Quality Ontology

Information Entity

Ontology Geospatial Ontology

Event Ontology

Artifact Ontology

Agent Ontology

Page 31: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US
Page 32: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US
Page 33: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US
Page 34: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

40

http://milportal.org

Page 35: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

41

Page 36: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

42

Page 37: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

43

Page 38: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

An example of agile application ontology development:

The Bioweapons Ontology (BWO)

44

Page 39: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

Kinds of chemical and biological weapons

Chemical

Nerve agents (sarin gas)

Blister agents (mustard gas)

Blood agents (cyanide gas)

Biological

Infectious agents – BWO(I)

Toxic agents (botulinum toxin, ricin) – BWO(T)

45

Page 40: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

We focus here on BWO(I)

Infectious agents –Bacterial (anthrax, bubonic plague,

tularemia, brucellosis, cholera …)

–Viral (Ebola, Marburg …)

46

Page 41: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

BFO IDO StaphIDO

Independent

Continuant

Infectious

disorder

Staph. aureus

disorder

Dependent

Continuant

Infectious

disease

Protective

resistance

MRSA

Methicillin

resistance

Occurrent

Infectious

disease

course

MRSA course

Examples of ontology terms

47

Page 42: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

Infectious Disease Ontology (IDO)

IDO Core (Reference Ontology)

• General terms in the ID domain.

IDO Extensions (Application Ontologies)

• Disease-, host-, pathogen-specific.

• Developed by subject matter experts.

The hub-and-spokes strategy ensures that logical

content of IDO Core is automatically inherited by

the IDO Extensions

• with thanks to Lindsay Cowell (University of Texas SW Medical Center) and Albert Goldfain (Blue Highway, Inc.)

Page 43: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

IDO Core

• Contains general terms in the ID domain:

– E.g., ‘colonization’, ‘pathogen’, ‘infection’

• A contract between IDO extension ontologies

and the datasets that use them.

• Intended to represent information along

several dimensions: – biological scale (gene, cell, organ, organism, population)

– discipline (clinical, immunological, microbiological)

– organisms involved (host, pathogen, and vector types)

Page 44: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

BFO IDO StaphIDO

Independent

Continuant

Infectious

disorder

Staph. aureus

disorder

Dependent

Continuant

Infectious

disease

Protective

resistance

MRSA

Methicillin

resistance

Occurrent

Infectious

disease

course

MRSA course

Examples of ontology terms

50

Page 45: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

IDO Extensions

IDO – Brucellosis

IDO – Dengue Fever

IDO – Influenza

IDO – Malaria

IDO – Staphylococcus Aureus Bacteremia

IDO – Vector Surveillance and Management

IDO – Plant

VO – Vaccine Ontology

BWO(I) – Bioweapons Ontology (Infectious Agents)

51

Page 46: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

How IDO evolves: the case of Staph. aureus

IDOCore

IDOSa

IDOHumanSa

IDORatSa

IDOStrep

IDORatStrep

IDOHumanStrep

IDOMRSa

IDOHumanBacterial

IDOAntibioticResistant

IDOMAL IDOHIV HUB and SPOKES: Domain ontologies

SEMI-LATTICE: By subject matter experts in different communities of interest.

IDOFLU

Page 47: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US
Page 48: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

54

Page 49: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US
Page 50: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

BWO:disease by infectious agent

= def. a disease that is the consequence of the presence of pathogenic microbial agents, including pathogenic viruses, pathogenic bacteria, fungi, protozoa, multicellular parasites, and aberrant proteins known as prions

Page 51: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

Strategy used to build BWO(I) with thanks to Lindsay Cowell and Oliver He (Michigan)

1. Start with a glossary such as: http://www.emedicinehealth.com/biological_warfare/

2. Select corresponding terms from IDO core and related ontologies such as the CHEBI Chemistry Ontology terms needed to describe bioweapons

3. All ontology terms keep their original definitions and IDs.

4. The result is a spreadsheet

57

Page 52: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

5. Where glossary terms have no ontology equivalent, create BWO ontology terms and

definitions as needed

58

no corresponding ontology term

Page 53: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

6. Use the Ontofox too to create the first version of the BWO(I) application ontology (http://ontofox.hegroup.org/)

7. Use BWO(I) in annotations, and where gaps are identified create extension terms, for instance – weaponized brucella

– aerosol anthrax

– smallpox incubation period

This establishes a virtuous cycle between ontology development and use in annotations

59

Page 54: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

Potential uses of BWO

– semantic enhancement of bioweapons intelligence data

– results will be automatically interoperable with relevant bioinformatics and public health IT tools for dealing with infections, epidemics, vaccines, forensics, …

–to annotate research literature and research data on bioweapons

– to create computable definitions to substitute for definitions in free text glossaries

60

Page 55: Distributed Common Ground System Army (DCGS-A)ncor.buffalo.edu/OI2/slides/Big Military Data - Smith.pdf · Distributed Common Ground System – Army (DCGS-A) ... in support of US

Why do people think they need lexicons

• Training • Compiling lessons learned • Compiling results of testing, e.g. of proposed new

doctrine • Collective inferencing • Official reporting • Doctrinal development • Standard operating procedures • Sharing of data • People need to (ensure that they) understand

each other