http://forge.gridforum.org/projects/dfdl-wg/ dfdl wg session 1 summary of status wg co-chairs: mike...

28
http://forge.gridforum.org/projects/dfdl- DFDL WG Session 1 Summary of Status WG Co-Chairs: Mike Beckerle, Ascential Software Alan Chappell, PNNL Martin Westhead (Two note takers please?) Tuesday, 2004-09-21 17h30+02:00 Brussels (BE.CEST) in Room D005. 11h30 New York, (US.EDT, UTC-4) 08h30 San Francisco (US.PDT, UTC-7)

Upload: ami-garrett

Post on 03-Jan-2016

234 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Http://forge.gridforum.org/projects/dfdl-wg/ DFDL WG Session 1 Summary of Status WG Co-Chairs:  Mike Beckerle, Ascential Software Alan Chappell, PNNL

http://forge.gridforum.org/projects/dfdl-wg/

DFDL WG Session 1Summary of StatusWG Co-Chairs:

Mike Beckerle, Ascential Software Alan Chappell, PNNL Martin Westhead

(Two note takers please?)

Tuesday, 2004-09-21 17h30+02:00 Brussels (BE.CEST) in Room D005.

11h30 New York, (US.EDT, UTC-4) 08h30 San Francisco (US.PDT, UTC-7)

Page 2: Http://forge.gridforum.org/projects/dfdl-wg/ DFDL WG Session 1 Summary of Status WG Co-Chairs:  Mike Beckerle, Ascential Software Alan Chappell, PNNL

http://forge.gridforum.org/projects/dfdl-wg/

Abstract

Progress and Status Review - This session will review the progress since GGF11 which is substantial, and also provide enough context for those interested in getting involved who haven't been following along.

Page 3: Http://forge.gridforum.org/projects/dfdl-wg/ DFDL WG Session 1 Summary of Status WG Co-Chairs:  Mike Beckerle, Ascential Software Alan Chappell, PNNL

http://forge.gridforum.org/projects/dfdl-wg/

ADF BoF

Sep 22 11h00-12h30ACCESS TO DATA IN FILES-WG CHARTER DISCUSSION BOF (DATA) Calendar: QB (200p)Details:Title: ADF - Access to Data in FilesSession Leaders: Neil Chue Hong, Brian M CollinsAgenda:- Summary of General Focus, Milestones and Objectives- Discussion of Relationships specifically with DFDL and DAIS as well as other GGF Groups- Charter Update

Page 4: Http://forge.gridforum.org/projects/dfdl-wg/ DFDL WG Session 1 Summary of Status WG Co-Chairs:  Mike Beckerle, Ascential Software Alan Chappell, PNNL

http://forge.gridforum.org/projects/dfdl-wg/

DFDL WG Session 2

Room D008

Thursday, 2004-09-23 11h00+02:00 Brussels (BE.CEST)

05h00 New York (US.EDT, UTC-4)

02h00 San Francisco (US.PDT, UTC-7)

Page 5: Http://forge.gridforum.org/projects/dfdl-wg/ DFDL WG Session 1 Summary of Status WG Co-Chairs:  Mike Beckerle, Ascential Software Alan Chappell, PNNL

http://forge.gridforum.org/projects/dfdl-wg/

Since GGF11 (June 2004)

Latest working examplesdocument/jdm20040720.zip

DFDL Represenation Properties: Basic Set document/ggf-dfdl-rep-properties-

proposal-basic-001.doc

Page 6: Http://forge.gridforum.org/projects/dfdl-wg/ DFDL WG Session 1 Summary of Status WG Co-Chairs:  Mike Beckerle, Ascential Software Alan Chappell, PNNL

http://forge.gridforum.org/projects/dfdl-wg/

DFDL-WG Session 1 Summary of Status

Agenda Review of purposes/goals

XML / XSD impact: Data ModelExamples w/latest updates

• as evolved now: different from primer document

General discussion (40 mins) Overview of other sessions (5 mins)

Page 7: Http://forge.gridforum.org/projects/dfdl-wg/ DFDL WG Session 1 Summary of Status WG Co-Chairs:  Mike Beckerle, Ascential Software Alan Chappell, PNNL

http://forge.gridforum.org/projects/dfdl-wg/

Data Interchange Formats

Prescriptive: Put your data in this format! XML – textual Binary – ASN.1, XDR, NetCDF, HDF, EBML,…

Descriptive: What format is your data in? Commercial products ASN1 Encoding Control Notation

• ITU-T X.692

DFDL

Page 8: Http://forge.gridforum.org/projects/dfdl-wg/ DFDL WG Session 1 Summary of Status WG Co-Chairs:  Mike Beckerle, Ascential Software Alan Chappell, PNNL

http://forge.gridforum.org/projects/dfdl-wg/

Why Descriptive?

Allows us to achieve two goals simultaneously:

1. Interoperability Modern and Legacy data formats

2. Performance! Density

• Fewest bytes to represent data without resorting to compression

Optimized I/O• Seekable random access• Memory mapped, aligned

• Without sacrificing general access

Page 9: Http://forge.gridforum.org/projects/dfdl-wg/ DFDL WG Session 1 Summary of Status WG Co-Chairs:  Mike Beckerle, Ascential Software Alan Chappell, PNNL

http://forge.gridforum.org/projects/dfdl-wg/

Why the GGF for DFDL?

Grids are about big-data and big-computation problemsSimplistic solutions like “use XML”

won’t cut it! Grids are about universal data

interchange

Page 10: Http://forge.gridforum.org/projects/dfdl-wg/ DFDL WG Session 1 Summary of Status WG Co-Chairs:  Mike Beckerle, Ascential Software Alan Chappell, PNNL

http://forge.gridforum.org/projects/dfdl-wg/

General Features

Basic ASCII/Binary Read capabilities Inclusion of static info, e.g. “units” Validated input (from XML Schema) Defaulted input for missing values Reference – use of a previously read value in subsequent

expressions Choice – use of a previously read value to select among format

variations Multi-layer – description of an intermediate representation not

exposed in the final result Multiple input streams Basic Math – in DFDL expressions Basic Math – to generate new representations/values “Push-back” capability New type/transform specification

Page 11: Http://forge.gridforum.org/projects/dfdl-wg/ DFDL WG Session 1 Summary of Status WG Co-Chairs:  Mike Beckerle, Ascential Software Alan Chappell, PNNL

http://forge.gridforum.org/projects/dfdl-wg/

Desires

Support very efficient parsers Hints about lengths so data can be accessed

randomly Avoid requiring data copies But allow caching (e.g. based on hints) Allow ‘black-box’ transforms (e.g. unzip)

Support round-tripping Link transforms and inverses Infer inverses for lossless transforms

Keep simple cases simple Default transforms and parameters

Page 12: Http://forge.gridforum.org/projects/dfdl-wg/ DFDL WG Session 1 Summary of Status WG Co-Chairs:  Mike Beckerle, Ascential Software Alan Chappell, PNNL

http://forge.gridforum.org/projects/dfdl-wg/

Related Standards Efforts

Prescriptive systems:W3C binary XML

(http://www.w3.org/XML/Binary/)• Formed, but discussion group has no

items.• CORRECTION - there is activity now.

Descriptive systems:ASN1 Encoding Control Notation

• ITU-T X.692

Page 13: Http://forge.gridforum.org/projects/dfdl-wg/ DFDL WG Session 1 Summary of Status WG Co-Chairs:  Mike Beckerle, Ascential Software Alan Chappell, PNNL

http://forge.gridforum.org/projects/dfdl-wg/

XML Synergy

Use XSD to describe the logical data Use annotations within the XSD to

describe the representation of it.

Page 14: Http://forge.gridforum.org/projects/dfdl-wg/ DFDL WG Session 1 Summary of Status WG Co-Chairs:  Mike Beckerle, Ascential Software Alan Chappell, PNNL

http://forge.gridforum.org/projects/dfdl-wg/

XSD Types

Elements A.k.a. fields

Sequence groups, All groups All = unordered group

Choice A.k.a. union, redefine,

Vectors Use element with minOccurs, maxOccurs.

Nillability A.k.a. Nullable values

Page 15: Http://forge.gridforum.org/projects/dfdl-wg/ DFDL WG Session 1 Summary of Status WG Co-Chairs:  Mike Beckerle, Ascential Software Alan Chappell, PNNL

http://forge.gridforum.org/projects/dfdl-wg/

XML/XSD – basic types

anySimpleType

string QName NOTATION float double decimal boolean base64Binary hexBinary anyURI

normalizedString

token

language Name NMTOKEN

NMTOKENSNCName

ID IDREF ENTITY

IDREFS ENTITIES

integer

long nonPositiveInteger nonNegativeInteger

negativeInteger positiveInteger unsignedLong

unsignedInt

unsignedShort

unsignedByte

int

short

byte

date time dateTime gYear gYearMonth gMonth gMonthDay gDay duration

Page 16: Http://forge.gridforum.org/projects/dfdl-wg/ DFDL WG Session 1 Summary of Status WG Co-Chairs:  Mike Beckerle, Ascential Software Alan Chappell, PNNL

http://forge.gridforum.org/projects/dfdl-wg/

Example 1: XML

<w>5</w><x>7839372</x><y>8.6E-200</y><z>-7.1E8</z>

Page 17: Http://forge.gridforum.org/projects/dfdl-wg/ DFDL WG Session 1 Summary of Status WG Co-Chairs:  Mike Beckerle, Ascential Software Alan Chappell, PNNL

http://forge.gridforum.org/projects/dfdl-wg/

Example 1: XSD

<xs:sequence><xs:element name="w" type=“int"/><xs:element name="x" type=“int"/><xs:element name="y" type=“double"/><xs:element name="z" type=“float"/>

</xs:sequence>

Page 18: Http://forge.gridforum.org/projects/dfdl-wg/ DFDL WG Session 1 Summary of Status WG Co-Chairs:  Mike Beckerle, Ascential Software Alan Chappell, PNNL

http://forge.gridforum.org/projects/dfdl-wg/

Example 1 DFDL - binary

0000 0005 0077 9e8c

169a 54dd 0a1b 4a3f

ce29 46f6

Page 19: Http://forge.gridforum.org/projects/dfdl-wg/ DFDL WG Session 1 Summary of Status WG Co-Chairs:  Mike Beckerle, Ascential Software Alan Chappell, PNNL

http://forge.gridforum.org/projects/dfdl-wg/

Example 1 DFDL - binary

<xs:complexType name="example1"> <xs:annotation> <xs:appinfo> <repType>binary</repType> <byteOrder>bigEndian</byteOrder> </xs:appinfo> </xs:annotation> <xs:sequence> <xs:element name="w" type="int"/> <xs:element name="x" type="int "/> <xs:element name="y" type="double "/> <xs:element name="z" type="float "/> </xs:sequence> </xs:complexType>

Page 20: Http://forge.gridforum.org/projects/dfdl-wg/ DFDL WG Session 1 Summary of Status WG Co-Chairs:  Mike Beckerle, Ascential Software Alan Chappell, PNNL

http://forge.gridforum.org/projects/dfdl-wg/

Example 1 DFDL - textual

“5, 7839372, 8.6E-200, -7.1E8”

Page 21: Http://forge.gridforum.org/projects/dfdl-wg/ DFDL WG Session 1 Summary of Status WG Co-Chairs:  Mike Beckerle, Ascential Software Alan Chappell, PNNL

http://forge.gridforum.org/projects/dfdl-wg/

Example 1 DFDL - textual

<xs:complexType name="example1"> <xs:annotation> <xs:appinfo> <repType>text</repType> <charset>UTF-8</charset> <decimalSeparator>.</decimalSeparator> <separator>,</separator>

</xs:appinfo></xs:annotation><xs:sequence>

<xs:element name="w" type="int"/><xs:element name="x" type="int"/><xs:element name="y" type="double"/><xs:element name="z" type="float"/>

</xs:sequence></xs:complexType>

Page 22: Http://forge.gridforum.org/projects/dfdl-wg/ DFDL WG Session 1 Summary of Status WG Co-Chairs:  Mike Beckerle, Ascential Software Alan Chappell, PNNL

http://forge.gridforum.org/projects/dfdl-wg/

More Examples

Currently working on a set of examples which illustrate each of the DFDL constructs

Page 23: Http://forge.gridforum.org/projects/dfdl-wg/ DFDL WG Session 1 Summary of Status WG Co-Chairs:  Mike Beckerle, Ascential Software Alan Chappell, PNNL

http://forge.gridforum.org/projects/dfdl-wg/

Examples (so far)

BasicMathInExpressions.xsd BasicMathInRepresentations.xsd Choice.xsd DefaultValuePropagation.xsd DefaultedInput.xsd IncludeTransform.xsd IncludeType.xsd & NewDFDLType.xsd MultiLayer.xsd MultiStreams.xsd NewDFDLTransform.xsd NewExternalDFDLTransform.xsd NewMixedDFDLTransform.xsd Reference.xsd StaticInformation.xsd ValidatedInput.xsd

Page 24: Http://forge.gridforum.org/projects/dfdl-wg/ DFDL WG Session 1 Summary of Status WG Co-Chairs:  Mike Beckerle, Ascential Software Alan Chappell, PNNL

http://forge.gridforum.org/projects/dfdl-wg/

Current Status

Fairly clear idea of required features and data types

Large library of simple, real world, and pathological examples

Converging on an annotated XML Schema syntax for DFDL

Page 25: Http://forge.gridforum.org/projects/dfdl-wg/ DFDL WG Session 1 Summary of Status WG Co-Chairs:  Mike Beckerle, Ascential Software Alan Chappell, PNNL

http://forge.gridforum.org/projects/dfdl-wg/

Status

We think we have:A good handle on requirements/use casesCommon idea of functionality/ overall

designA not quite complete syntax based on

XML SchemaA set of “unit-test” examples to help in

finalizing the syntax

Page 26: Http://forge.gridforum.org/projects/dfdl-wg/ DFDL WG Session 1 Summary of Status WG Co-Chairs:  Mike Beckerle, Ascential Software Alan Chappell, PNNL

http://forge.gridforum.org/projects/dfdl-wg/

Opportunities

Good time to help with syntax Once that exists:

Standardized language bindings/APIsDFDL implementationsDFDL-based ADF Data Virtualization

ServicesExtensions for specific domains/uses…

Page 27: Http://forge.gridforum.org/projects/dfdl-wg/ DFDL WG Session 1 Summary of Status WG Co-Chairs:  Mike Beckerle, Ascential Software Alan Chappell, PNNL

http://forge.gridforum.org/projects/dfdl-wg/

Next Steps

Session 2: Charter-revisions discussion Review of specific examples IBM Websphere Business Integrator

DFDL presentation

Page 28: Http://forge.gridforum.org/projects/dfdl-wg/ DFDL WG Session 1 Summary of Status WG Co-Chairs:  Mike Beckerle, Ascential Software Alan Chappell, PNNL

http://forge.gridforum.org/projects/dfdl-wg/

END