introduction to ogsa-dai neil chue hong ogsa-dai project manager 14 th february 2006 ggf16, athens

12
Introduction to OGSA- DAI Neil Chue Hong OGSA-DAI Project Manager 14 th February 2006 GGF16, Athens

Upload: dorothy-patrick

Post on 29-Jan-2016

225 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Introduction to OGSA-DAI Neil Chue Hong OGSA-DAI Project Manager 14 th February 2006 GGF16, Athens

Introduction to OGSA-DAI

Neil Chue HongOGSA-DAI Project Manager

14th February 2006GGF16, Athens

Page 2: Introduction to OGSA-DAI Neil Chue Hong OGSA-DAI Project Manager 14 th February 2006 GGF16, Athens

GGF16, Feb 2006 ©

2

Data Services: challenges Scale

Many sites, large collections, many uses Longevity

Research requirements outlive technical decisions Diversity

No “one size fits all” solutions will work Primary Data, Data Products, Meta Data, Administrative data, …

Many Data Resources Independently owned & managed Geographically distributed

and I haven’t even mentioned security yet!

Page 3: Introduction to OGSA-DAI Neil Chue Hong OGSA-DAI Project Manager 14 th February 2006 GGF16, Athens

GGF16, Feb 2006 ©

3

Use Cases for Data Services Data Filtering:

Single source producing large amounts of data distributed to many sites downstream

Data Discovery: many sources, many query entry points in a linked system

Data Translation: source to sink, conversion of data model / structure

Data Federation: many sources, linked to provide view as a single source

Data Replication full or partial copies to improve throughput

Data Integration (model aggregation) e.g. integration of time variant data, streams, files

Data Integration (knowledge expansion) forming links between databases to increase knowledge

Page 4: Introduction to OGSA-DAI Neil Chue Hong OGSA-DAI Project Manager 14 th February 2006 GGF16, Athens

GGF16, Feb 2006 ©

5

Requirements on Data Services? Common Data Model e.g. RowSet

Common Query Language(s) e.g. XQuery, SQL

Standard access to data resource schema information physical data resource information for optimisation purposes data resource descriptive information for discovery / integration

Single, seamless security model

Dynamic publication and discovery

Multiple, efficient delivery methods

Move computation towards data

Data aggregation functionality

Replication information

Page 5: Introduction to OGSA-DAI Neil Chue Hong OGSA-DAI Project Manager 14 th February 2006 GGF16, Athens

GGF16, Feb 2006 ©

6

OGSA-DAI In One Slide An engineered extensible framework

for data access and integration. Expose heterogeneous data

resources to a grid through web services.

Interact with data resources: Queries and updates. Data transformation / compression Data delivery.

Customise for your project using Additional Activities Client Toolkit APIs Data Resource handlers

A base for higher-level services federation, mining, visualisation,…

Page 6: Introduction to OGSA-DAI Neil Chue Hong OGSA-DAI Project Manager 14 th February 2006 GGF16, Athens

GGF16, Feb 2006 ©

7

MySQL

OGSA-DAI service

Engine

SQLQuery

JDBCData

Resources

Activities

DB2

GZip GridFTPXPath

XMLDB

XIndice

readFile

File

SWISSPROT

XSLT

SQLServer

Data-bases

ApplicationApplicationClient ToolkitClient Toolkit

Page 7: Introduction to OGSA-DAI Neil Chue Hong OGSA-DAI Project Manager 14 th February 2006 GGF16, Athens

GGF16, Feb 2006 ©

8

MySQL

OGSA-DAI service

Engine

SQLQuery

JDBC

SQL

JDBC

SQL

JDBC

SQL

JDBC

SQL

JDBC

MultipleSQL GDS

SQLQuery

Page 8: Introduction to OGSA-DAI Neil Chue Hong OGSA-DAI Project Manager 14 th February 2006 GGF16, Athens

GGF16, Feb 2006 ©

9

Distributed Query Processing Higher level services

building on OGSA-DAI Queries mapped to

algebraic expressions for evaluation

Parallelism represented by partitioning queries Use exchange operators

table_scan(protein)

table_scantermID=S92(proteinTerm)

reduce

reduce

hash_join(proteinId)

op_call(Blast)

reduce

exchange

exchange

3,4

1 2

Page 9: Introduction to OGSA-DAI Neil Chue Hong OGSA-DAI Project Manager 14 th February 2006 GGF16, Athens

GGF16, Feb 2006 ©

10

DQP architecture

Co-ordinator

Evaluator Evaluator Evaluator

OGSA-DAI

OGSA-DAI

OGSA-DAI

OGSA-DAI

Query SQL & OQL

OGSA-DAI activity

WS-I only

Using client toolkit

All interfaces that aresupported by toolkit

Page 10: Introduction to OGSA-DAI Neil Chue Hong OGSA-DAI Project Manager 14 th February 2006 GGF16, Athens

GGF16, Feb 2006 ©

11

Contributing to OGSA-DAI

Additional functionality: Provide activities which implement specific

functionality Provide extra client functionality Provide different security mechanisms Provide higher level components and applications

Different levels of contributions Based on OGSA-DAI? Works with OGSA-DAI? Part of OGSA-DAI?

Page 11: Introduction to OGSA-DAI Neil Chue Hong OGSA-DAI Project Manager 14 th February 2006 GGF16, Athens

GGF16, Feb 2006 ©

12

Future plans A new version of the OGSA-DAI Engine

better support for concurrency, sessions, monitoring and notification

Implementing new DAIS specifications Key things that we will be addressing:

Performance (particularly format representation and transport) Security Model which can be applied across platforms Transactions provision More data integration facilities

Integration with other components registries (e.g. GRIMOIRES) workflow editors (e.g. Taverna)

Working with new projects e.g. CancerGrid, iSpider, GEODE

Page 12: Introduction to OGSA-DAI Neil Chue Hong OGSA-DAI Project Manager 14 th February 2006 GGF16, Athens

GGF16, Feb 2006 ©

13

Further information The OGSA-DAI Project Site:

http://www.ogsadai.org.uk The DAIS-WG site:

http://forge.gridforum.org/projects/dais-wg/ OGSA-DAI Users Mailing list

[email protected] General discussion on grid DAI matters

Formal support for OGSA-DAI releases http://bugs.ogsadai.org.uk

OGSA-DAI training courses