semantic types webinar slides

21
expressor Thought leadership webinar Semantic T ypes: Making ETL data mapping simpler and easier to maintain Bill Kehoe, Chief Architect Wiqar Chaudry, Product Manager www.expressor-software.com

Upload: expressor-software

Post on 08-Apr-2018

223 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Semantic Types Webinar Slides

8/7/2019 Semantic Types Webinar Slides

http://slidepdf.com/reader/full/semantic-types-webinar-slides 1/21

expressor Thought leadership webinarSemantic Types: Making ETL data mapping simpler and easier to maintain 

• Bill Kehoe, Chief Architect

• Wiqar Chaudry, Product Manager

www.expressor-software.com

Page 2: Semantic Types Webinar Slides

8/7/2019 Semantic Types Webinar Slides

http://slidepdf.com/reader/full/semantic-types-webinar-slides 2/21

copyright 2011 expressor software corporation2

Bill Kehoe

Bill Kehoe is a founding engineer at expressor and has been a keydeveloper since the original 1.0 version of the product.  He is now achief architect providing technical leadership on all aspects of productengineering. Previously Bill was an architect at Blue Agave Software where he wasthe lead developer for the data sub-system of a supply chainmanagement product.  Bill also held a senior architect role at Versataand a senior developer and program manager at Sybase.  At Sybase,he architected and led the development for SQL Debug, a clientserver application for debugging SYBASE Transact-SQL storedprocedures. 

Bill graduated Magna Cum Laude from Tufts University in CivilEngineering and is a member of the Tau Beta Pi Engineering honorsociety.  He has also done post graduate work at Harvard University.

Page 3: Semantic Types Webinar Slides

8/7/2019 Semantic Types Webinar Slides

http://slidepdf.com/reader/full/semantic-types-webinar-slides 3/21

copyright 2011 expressor software corporation3

Today’s Agenda 

expressor Studio 3.1 product overview Semantic Types

What are they?

Sample Application

Studio DemoMultiple data source formats mapped to a

common semantic type

2011 product roadmap Q&A

Page 4: Semantic Types Webinar Slides

8/7/2019 Semantic Types Webinar Slides

http://slidepdf.com/reader/full/semantic-types-webinar-slides 4/21

copyright 2011 expressor software corporation4

expressor Studio 3.1

Free & downloadable

Integrated visual development studio for ETLapplications

Built-in productivity tools and wizards– Automatically capture DB connectivity details

– Automatically capture metadata

– Access to standard business rules, e.g. date formatconversions

– Automatic type conversion

Just-in-time error notification and help

Graphical library system for reusing design

assets Fast data processing engine

Semantic Types– Field name and type standardization capabilities

– More to come in upcoming 3.x releases

www.expressorStudio.com

Page 5: Semantic Types Webinar Slides

8/7/2019 Semantic Types Webinar Slides

http://slidepdf.com/reader/full/semantic-types-webinar-slides 5/21

copyright 2011 expressor software corporation5

expressor

 

Semantic Types  

Page 6: Semantic Types Webinar Slides

8/7/2019 Semantic Types Webinar Slides

http://slidepdf.com/reader/full/semantic-types-webinar-slides 6/21

copyright 2011 expressor software corporation6

Traditional Point-to-point vs. Canonical Mapping

Point-to-point Mapping(traditional ETL)

Semantic Types improvetime-to-value through

greater reuse andsimplified data mappings

Canonical Mapping(expressor)

SemanticType

Page 7: Semantic Types Webinar Slides

8/7/2019 Semantic Types Webinar Slides

http://slidepdf.com/reader/full/semantic-types-webinar-slides 7/21copyright 2011 expressor software corporation7

What is a Semantic Type?

An abstract data interface independent of:

Physical formatting details such as delimiters,character encoding, field formats like date, time,currency, etc.

Defines the logical structure of data, free ofpositional constraints (i.e. field order independent)

Enables rule expression (constraints, data quality,transformations and derivations) independent of

external interface “baggage”  Enables rapid application assembly

Eases Data Governance / Data Lineage Tracking

 

 

Page 8: Semantic Types Webinar Slides

8/7/2019 Semantic Types Webinar Slides

http://slidepdf.com/reader/full/semantic-types-webinar-slides 8/21copyright 2011 expressor software corporation8

Physical vs. Semantic

Physical metadata Semantic metadata

FieldColumn

[Leaf] Element 

 [Attribute of]

Atomic Semantic Type

CSV RecordDatabase Table

Array

 Composite Semantic Type

 [Non-Leaf] Element

Nested Table 

 Nested Composite Type(i.e. Composite Attribute)

Page 9: Semantic Types Webinar Slides

8/7/2019 Semantic Types Webinar Slides

http://slidepdf.com/reader/full/semantic-types-webinar-slides 9/21copyright 2011 expressor software corporation9

Recurring Data Integration challenge

One logical target schema

Apply common set of rules to ensure logical dataintegrity, but … 

Input data is messy!

Dozens of external formats to consume

Data quality varies depending on data source

Goal: Easily assemble a single, maintainableapplication that can consume all sources of dataand supports extensions over time

Page 10: Semantic Types Webinar Slides

8/7/2019 Semantic Types Webinar Slides

http://slidepdf.com/reader/full/semantic-types-webinar-slides 10/21copyright 2011 expressor software corporation10

Sample application: Exploration expenses

Data warehouse for analyzing energyexploration expenses

Sub-contractors used for site work

One target data warehouse table

One set of validation rules

Multiple, contractor-specific expense dataformats

 

 

Contract_ID : integer

Item_ID: varchar(20)Item_Description : varchar(1024)

ExpType : varchar(10)

Amount : decimal(10,2)

StartDepth : Integer

EndDepth : integer

ExpenseDate : date

EXPENSE

Contractor1

Contractor2

Contractor3

Validation

Rules

Page 11: Semantic Types Webinar Slides

8/7/2019 Semantic Types Webinar Slides

http://slidepdf.com/reader/full/semantic-types-webinar-slides 11/21copyright 2011 expressor software corporation11

   

ContractExpense 

Semantic Type example: ContractExpense

ContractIdentifierString 

ItemIdentifierString 

ItemDescriptionString 

ExpenseTypeString 

MonetaryAmountDecimal 

BookDateDatetime 

DepthInteger 

ContractIdentifier

ItemIdentifier

ItemDescription

ExpenseType Amount

ExpenseBookDate

EndDepth

StartDepth

Page 12: Semantic Types Webinar Slides

8/7/2019 Semantic Types Webinar Slides

http://slidepdf.com/reader/full/semantic-types-webinar-slides 12/21copyright 2011 expressor software corporation12

Semantic Type example: ContractExpense

Contractor 1 Data Format

ContractId ItemId ItemDescription ExpType Amount Date StartDepth EndDepth

Schema: Contractor1

   

ContractExpense 

ContractIdentifier

ItemIdentifier

ItemDescription

ExpenseType

MonetaryAmount

BookDate

Depth

ContractIdentifier

ItemIdentifier

ItemDescription

ExpenseType Amount

ExpenseBookDate

EndDepth

StartDepth

Page 13: Semantic Types Webinar Slides

8/7/2019 Semantic Types Webinar Slides

http://slidepdf.com/reader/full/semantic-types-webinar-slides 13/21copyright 2011 expressor software corporation13

   

ContractExpense 

ContractIdentifier

ItemIdentifier

ItemDescription

ExpenseType

MonetaryAmount

BookDate

Depth

ContractIdentifier

ItemIdentifier

ItemDescription

ExpenseType Amount

ExpenseBookDate

EndDepth

StartDepth

Schema: Contractor1

Semantic Type example: ContractExpense

Contractor 1 Data Format

ContractId ItemId ItemDescription ExpType Amount Date StartDepth EndDepth

Page 14: Semantic Types Webinar Slides

8/7/2019 Semantic Types Webinar Slides

http://slidepdf.com/reader/full/semantic-types-webinar-slides 14/21

copyright 2011 expressor software corporation14

   

ContractExpense 

ContractIdentifier

ItemIdentifier

ItemDescription

ExpenseType

MonetaryAmount

BookDate

Depth

ContractIdentifier

ItemIdentifier

ItemDescription

ExpenseType Amount

ExpenseBookDate

EndDepth

StartDepth

Semantic Type example: ContractExpense

Contractor 1 Data Format

ContractId ItemId ItemDescription ExpType Amount Date StartDepth EndDepth

Contractor 2 Data Format

Contract Date ItemId Amount ItemDescription ExpType StartDepth EndDepth

Schema: Contractor1

Schema: Contractor2

Page 15: Semantic Types Webinar Slides

8/7/2019 Semantic Types Webinar Slides

http://slidepdf.com/reader/full/semantic-types-webinar-slides 15/21

copyright 2011 expressor software corporation15

   

ContractExpense 

ContractIdentifier

ItemIdentifier

ItemDescription

ExpenseType

MonetaryAmount

BookDate

Depth

ContractIdentifier

ItemIdentifier

ItemDescription

ExpenseType Amount

ExpenseBookDate

EndDepth

StartDepth

Semantic Type example: ContractExpense

Contractor 1 Data Format

ContractId ItemId ItemDescription ExpType Amount Date StartDepth EndDepth

Contractor 2 Data Format

Contract Date ItemId Amount ItemDescription ExpType StartDepth EndDepth

Schema: Contractor1

Schema: Contractor2

Different fieldorders, date andnumeric valueformats

Page 16: Semantic Types Webinar Slides

8/7/2019 Semantic Types Webinar Slides

http://slidepdf.com/reader/full/semantic-types-webinar-slides 16/21

copyright 2011 expressor software corporation16

   

ContractExpense 

ContractIdentifier

ItemIdentifier

ItemDescription

ExpenseType

MonetaryAmount

BookDate

Depth

ContractIdentifier

ItemIdentifier

ItemDescription

ExpenseType Amount

ExpenseBookDate

EndDepth

StartDepth

Schema: Contractor2

Semantic Type example: ContractExpense

Contractor 1 Data Format

ContractId ItemId ItemDescription ExpType Amount Date StartDepth EndDepth

Contractor 2 Data Format

Contract Date ItemId Amount ItemDescription ExpType StartDepth EndDepth

9.999,99 

Page 17: Semantic Types Webinar Slides

8/7/2019 Semantic Types Webinar Slides

http://slidepdf.com/reader/full/semantic-types-webinar-slides 17/21

Page 18: Semantic Types Webinar Slides

8/7/2019 Semantic Types Webinar Slides

http://slidepdf.com/reader/full/semantic-types-webinar-slides 18/21

copyright 2011 expressor software corporation18

expressor Studio – Multiple data source formats

mapped to a common Semantic Type

Demo

On demand Webinar Link: http://bit.ly/fW27F7   

 

Page 19: Semantic Types Webinar Slides

8/7/2019 Semantic Types Webinar Slides

http://slidepdf.com/reader/full/semantic-types-webinar-slides 19/21

copyright 2011 expressor software corporation19

Download our

Semantic TypesWhite paper

 www.expressor-

software.com/semantic-types  

Semantic Types white paper

Page 20: Semantic Types Webinar Slides

8/7/2019 Semantic Types Webinar Slides

http://slidepdf.com/reader/full/semantic-types-webinar-slides 20/21

copyright 2011 expressor software corporation20

Thank You!

 

Questions?  

Page 21: Semantic Types Webinar Slides

8/7/2019 Semantic Types Webinar Slides

http://slidepdf.com/reader/full/semantic-types-webinar-slides 21/21

[email protected]

Thank You!

www.expressor-software.com