workflow-driven ontologies for the geosciences

32
GEON Cyberinfrastructure Workshop Beijing, China, July 21-23, 2006 Workflow-Driven Ontologies for the Geosciences Leonardo Salayandía The University of Texas at El Paso

Upload: hayley

Post on 12-Jan-2016

41 views

Category:

Documents


0 download

DESCRIPTION

Workflow-Driven Ontologies for the Geosciences. Leonardo Salayandía The University of Texas at El Paso. Overview. Background Cyberinfrastructure Ontologies Workflows Purpose of this talk The Workflow-Driven Ontology approach Knowledge capture Workflow creation from WDOs - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Workflow-Driven Ontologies for the Geosciences

GEON Cyberinfrastructure WorkshopBeijing, China, July 21-23, 2006

Workflow-Driven Ontologies for the Geosciences

Leonardo Salayandía

The University of Texas at El Paso

Page 2: Workflow-Driven Ontologies for the Geosciences

GEON Cyberinfrastructure WorkshopBeijing, China, July 21-23, 2006

Overview

• Background– Cyberinfrastructure– Ontologies– Workflows

• Purpose of this talk• The Workflow-Driven Ontology approach

– Knowledge capture– Workflow creation from WDOs– Benefits of WDOs

• Status• Summary

Page 3: Workflow-Driven Ontologies for the Geosciences

GEON Cyberinfrastructure WorkshopBeijing, China, July 21-23, 2006

CyberinfrastructureS-wave tomography models

GPS plate motion vectors

Global Strain Rate Map

GEON IDV(Integrated Data Viewer)

[http://geon.unavco.org]

Page 4: Workflow-Driven Ontologies for the Geosciences

GEON Cyberinfrastructure WorkshopBeijing, China, July 21-23, 2006

CyberinfrastructureS-wave tomography models

GPS plate motion vectors

Global Strain Rate Map

GEON IDV

Distributed sources of informationInformation in different formats

Distributed tools and applications

Page 5: Workflow-Driven Ontologies for the Geosciences

GEON Cyberinfrastructure WorkshopBeijing, China, July 21-23, 2006

Cyberinfrastructure

• People and resources connected through the web

• Enhanced collaboration over distance, time, and disciplines– Interoperate across institutions and disciplines– Preserve and maintain availability of software

and data

Page 6: Workflow-Driven Ontologies for the Geosciences

GEON Cyberinfrastructure WorkshopBeijing, China, July 21-23, 2006

Cyberinfrastructure

• People and resources connected through the web

• Enhanced collaboration over distance, time, and disciplines– Interoperate across institutions and disciplines– Preserve and maintain availability of software

and data

Page 7: Workflow-Driven Ontologies for the Geosciences

GEON Cyberinfrastructure WorkshopBeijing, China, July 21-23, 2006

Ontologies

• A specification of a conceptualization

• Concepts (or classes of objects)– Concept1: S-wave tomography model (TM)– Concept2: Geospatial representation

• Relationships between concepts– S-wave TM HAS Geospatial Representation

Page 8: Workflow-Driven Ontologies for the Geosciences

GEON Cyberinfrastructure WorkshopBeijing, China, July 21-23, 2006

Workflows

• Recipes for accomplishing some complex task• Composition of service modules (CI services)• Automate tedious and time-consuming tasks • Useful for experiment replication• Example:

Page 9: Workflow-Driven Ontologies for the Geosciences

GEON Cyberinfrastructure WorkshopBeijing, China, July 21-23, 2006

Workflows

• Recipes for accomplishing some complex task• Composition of service modules (CI services)• Automate tedious and time-consuming tasks • Useful for experiment recreation• Example:

S-wave tomography data

Create ModelS-wave

tomography model

Service to get the data

Service to transform data

Transformed data outcome

Page 10: Workflow-Driven Ontologies for the Geosciences

GEON Cyberinfrastructure WorkshopBeijing, China, July 21-23, 2006

Cyberinfrastructure

[B. Ludäescher, 2006]

Page 11: Workflow-Driven Ontologies for the Geosciences

GEON Cyberinfrastructure WorkshopBeijing, China, July 21-23, 2006

Cyberinfrastructure

[B. Ludäescher, 2006]

Workflows

Ontologies

Page 12: Workflow-Driven Ontologies for the Geosciences

GEON Cyberinfrastructure WorkshopBeijing, China, July 21-23, 2006

Purpose of talk

• Show an approach for scientists to capture knowledge in a way that can be leveraged towards CI– Create ontology specifications– Generate workflows from ontologies

Page 13: Workflow-Driven Ontologies for the Geosciences

GEON Cyberinfrastructure WorkshopBeijing, China, July 21-23, 2006

Purpose of talk

• Show an approach for scientists to capture knowledge in a way that can be leveraged towards CI– Create ontology specifications– Generate workflows from ontologies

Workflow-Driven Ontologies(WDOs)

Page 14: Workflow-Driven Ontologies for the Geosciences

GEON Cyberinfrastructure WorkshopBeijing, China, July 21-23, 2006

Example: Gravity WDO

Geoscientist

I use geophysical data to elucidate the tectonic

development of the North American craton

I want to produce a gravity data contour map. These are the

steps that I go through to do it:

Contour Map

Grid

Gravity Data

Get the data

Create a grid of uniformly

distributed points from this

data

Use the grid as input to render

the map

Dr. Randy Keller

Page 15: Workflow-Driven Ontologies for the Geosciences

GEON Cyberinfrastructure WorkshopBeijing, China, July 21-23, 2006

Capture Knowledge

Contour Map

Grid

Gravity Data

Different types of Information

Page 16: Workflow-Driven Ontologies for the Geosciences

GEON Cyberinfrastructure WorkshopBeijing, China, July 21-23, 2006

Capture Knowledge

Contour Map

Grid

Gravity Data

Information

Raw Data

Processed Data

Product

How is the information transformed?

Is converted to

Is rendered into

Page 17: Workflow-Driven Ontologies for the Geosciences

GEON Cyberinfrastructure WorkshopBeijing, China, July 21-23, 2006

Capture Knowledge

Contour Map

Grid

Gravity Data

Information

Raw Data

Processed Data

Product

Contouring Algorithm

Gridding Algorithm

Methods

Is input into

Is input into

Outputs

Outputs

Is converted to

Is rendered into

Page 18: Workflow-Driven Ontologies for the Geosciences

GEON Cyberinfrastructure WorkshopBeijing, China, July 21-23, 2006

Class Hierarchy for WDOsRoot

Information Methods

Data Product

Raw Data Processed Data

Gravity Data Grid Contour Map

Gridding

Contouring

Common classes for all WDOs

Classes specific to the Gravity WDO

Page 19: Workflow-Driven Ontologies for the Geosciences

GEON Cyberinfrastructure WorkshopBeijing, China, July 21-23, 2006

Workflow specification generated from Gravity WDO

Root

Information Methods

Data Product

Raw Data Processed Data

Gravity Data Grid

Gridding

Is input into

Outputs

CI Service1:Gravity Data

Extraction

CI Service2:Gridding

Result

Mapping between WDO classes and CI services

Page 20: Workflow-Driven Ontologies for the Geosciences

GEON Cyberinfrastructure WorkshopBeijing, China, July 21-23, 2006

From workflow specification to workflow implementation

• Workflow engines:– Kepler scientific workflows (GEON et al.)– OWL-S (Semantic Web)– Many others…

• Workflow specifications produced from WDOs can potentially be “realized” in any service-oriented workflow engine

Page 21: Workflow-Driven Ontologies for the Geosciences

GEON Cyberinfrastructure WorkshopBeijing, China, July 21-23, 2006

Benefits of WDOs

• Scientific products drive the creation of the WDO– Incremental development

• WDO serves as roadmap for future CI service development– Identify missing services for potentially useful

workflows

• Generated workflows serve as a gauge for the usefulness of an ontology

Page 22: Workflow-Driven Ontologies for the Geosciences

GEON Cyberinfrastructure WorkshopBeijing, China, July 21-23, 2006

Status

• Gravity WDO prototype– Workflows in the process of being

implemented in the Kepler Scientific Workflow Engine

• WDO Assistant and API software

Page 23: Workflow-Driven Ontologies for the Geosciences

GEON Cyberinfrastructure WorkshopBeijing, China, July 21-23, 2006

The Gravity WDO• First WDO prototype (Flor Salcedo, Randy Keller,

and Ann Gates)

Page 24: Workflow-Driven Ontologies for the Geosciences

GEON Cyberinfrastructure WorkshopBeijing, China, July 21-23, 2006

Status

• Gravity WDO prototype– Workflows in the process of being

implemented in the Kepler Scientific Workflow Engine

• WDO Assistant and API software

Page 25: Workflow-Driven Ontologies for the Geosciences

GEON Cyberinfrastructure WorkshopBeijing, China, July 21-23, 2006

WDO Assistant and API

• Prototype built on top of the Jena API – Java programming language

• Three modes of operation– Brainstorming– Elicitation– Workflow Generation

Page 26: Workflow-Driven Ontologies for the Geosciences

GEON Cyberinfrastructure WorkshopBeijing, China, July 21-23, 2006

WDO Assistant and API

• Brainstorming mode– Scientists define concepts that relate to CI

information and methods

Page 27: Workflow-Driven Ontologies for the Geosciences

GEON Cyberinfrastructure WorkshopBeijing, China, July 21-23, 2006

WDO Assistant and API

• Elicitation mode– Scientists define relationships between

concepts

Page 28: Workflow-Driven Ontologies for the Geosciences

GEON Cyberinfrastructure WorkshopBeijing, China, July 21-23, 2006

WDO Assistant and API

• Workflow Generation mode– Scientists choose information concept for

which to generate a workflow, as well as target workflow engine

Page 29: Workflow-Driven Ontologies for the Geosciences

GEON Cyberinfrastructure WorkshopBeijing, China, July 21-23, 2006

Future Work

• CI-Miner– Provenance information– Trust information– Preferences

Page 30: Workflow-Driven Ontologies for the Geosciences

GEON Cyberinfrastructure WorkshopBeijing, China, July 21-23, 2006

OWL onts.

GenericCI Portal

WDOs

CompositeOWL-SService

WFGen

AtomicOWL-S Service

PSW

A Service

Answer/provenancevisualization

CI-Trust

CI Miner

PML

TrustNet

CI-Base(IWBase)

Serviceexecution

OWL-SAPI

CI-Browser

ontologiescalls

uses

Legend

creates

TrustRecommendation

CI-Browser

WDO API

JENA

CI BackgroundTools

WDOAssistant

Knowledgecapture

Protégé, SWOOP

Page 31: Workflow-Driven Ontologies for the Geosciences

GEON Cyberinfrastructure WorkshopBeijing, China, July 21-23, 2006

Summary

• In order to realize the goals of CI there is a need to– Capture domain knowledge– Use the domain knowledge to “glue” resources

together

• The WDO approach– Allows scientists (not computer programmers) to

incrementally capture knowledge as needed– Facilitates communication between scientists and

computer programmers to produce CI resources that “stick” to other resources

Page 32: Workflow-Driven Ontologies for the Geosciences

GEON Cyberinfrastructure WorkshopBeijing, China, July 21-23, 2006

Thank you