semantic types webinar slides
TRANSCRIPT
8/7/2019 Semantic Types Webinar Slides
http://slidepdf.com/reader/full/semantic-types-webinar-slides 1/21
expressor Thought leadership webinarSemantic Types: Making ETL data mapping simpler and easier to maintain
• Bill Kehoe, Chief Architect
• Wiqar Chaudry, Product Manager
www.expressor-software.com
8/7/2019 Semantic Types Webinar Slides
http://slidepdf.com/reader/full/semantic-types-webinar-slides 2/21
copyright 2011 expressor software corporation2
Bill Kehoe
Bill Kehoe is a founding engineer at expressor and has been a keydeveloper since the original 1.0 version of the product. He is now achief architect providing technical leadership on all aspects of productengineering. Previously Bill was an architect at Blue Agave Software where he wasthe lead developer for the data sub-system of a supply chainmanagement product. Bill also held a senior architect role at Versataand a senior developer and program manager at Sybase. At Sybase,he architected and led the development for SQL Debug, a clientserver application for debugging SYBASE Transact-SQL storedprocedures.
Bill graduated Magna Cum Laude from Tufts University in CivilEngineering and is a member of the Tau Beta Pi Engineering honorsociety. He has also done post graduate work at Harvard University.
8/7/2019 Semantic Types Webinar Slides
http://slidepdf.com/reader/full/semantic-types-webinar-slides 3/21
copyright 2011 expressor software corporation3
Today’s Agenda
expressor Studio 3.1 product overview Semantic Types
What are they?
Sample Application
Studio DemoMultiple data source formats mapped to a
common semantic type
2011 product roadmap Q&A
8/7/2019 Semantic Types Webinar Slides
http://slidepdf.com/reader/full/semantic-types-webinar-slides 4/21
copyright 2011 expressor software corporation4
expressor Studio 3.1
Free & downloadable
Integrated visual development studio for ETLapplications
Built-in productivity tools and wizards– Automatically capture DB connectivity details
– Automatically capture metadata
– Access to standard business rules, e.g. date formatconversions
– Automatic type conversion
Just-in-time error notification and help
Graphical library system for reusing design
assets Fast data processing engine
Semantic Types– Field name and type standardization capabilities
– More to come in upcoming 3.x releases
www.expressorStudio.com
8/7/2019 Semantic Types Webinar Slides
http://slidepdf.com/reader/full/semantic-types-webinar-slides 5/21
copyright 2011 expressor software corporation5
expressor
Semantic Types
8/7/2019 Semantic Types Webinar Slides
http://slidepdf.com/reader/full/semantic-types-webinar-slides 6/21
copyright 2011 expressor software corporation6
Traditional Point-to-point vs. Canonical Mapping
Point-to-point Mapping(traditional ETL)
Semantic Types improvetime-to-value through
greater reuse andsimplified data mappings
Canonical Mapping(expressor)
SemanticType
8/7/2019 Semantic Types Webinar Slides
http://slidepdf.com/reader/full/semantic-types-webinar-slides 7/21copyright 2011 expressor software corporation7
What is a Semantic Type?
An abstract data interface independent of:
Physical formatting details such as delimiters,character encoding, field formats like date, time,currency, etc.
Defines the logical structure of data, free ofpositional constraints (i.e. field order independent)
Enables rule expression (constraints, data quality,transformations and derivations) independent of
external interface “baggage” Enables rapid application assembly
Eases Data Governance / Data Lineage Tracking
8/7/2019 Semantic Types Webinar Slides
http://slidepdf.com/reader/full/semantic-types-webinar-slides 8/21copyright 2011 expressor software corporation8
Physical vs. Semantic
Physical metadata Semantic metadata
FieldColumn
[Leaf] Element
[Attribute of]
Atomic Semantic Type
CSV RecordDatabase Table
Array
Composite Semantic Type
[Non-Leaf] Element
Nested Table
Nested Composite Type(i.e. Composite Attribute)
8/7/2019 Semantic Types Webinar Slides
http://slidepdf.com/reader/full/semantic-types-webinar-slides 9/21copyright 2011 expressor software corporation9
Recurring Data Integration challenge
One logical target schema
Apply common set of rules to ensure logical dataintegrity, but …
Input data is messy!
Dozens of external formats to consume
Data quality varies depending on data source
Goal: Easily assemble a single, maintainableapplication that can consume all sources of dataand supports extensions over time
8/7/2019 Semantic Types Webinar Slides
http://slidepdf.com/reader/full/semantic-types-webinar-slides 10/21copyright 2011 expressor software corporation10
Sample application: Exploration expenses
Data warehouse for analyzing energyexploration expenses
Sub-contractors used for site work
One target data warehouse table
One set of validation rules
Multiple, contractor-specific expense dataformats
Contract_ID : integer
Item_ID: varchar(20)Item_Description : varchar(1024)
ExpType : varchar(10)
Amount : decimal(10,2)
StartDepth : Integer
EndDepth : integer
ExpenseDate : date
EXPENSE
Contractor1
Contractor2
Contractor3
Validation
Rules
8/7/2019 Semantic Types Webinar Slides
http://slidepdf.com/reader/full/semantic-types-webinar-slides 11/21copyright 2011 expressor software corporation11
ContractExpense
Semantic Type example: ContractExpense
ContractIdentifierString
ItemIdentifierString
ItemDescriptionString
ExpenseTypeString
MonetaryAmountDecimal
BookDateDatetime
DepthInteger
ContractIdentifier
ItemIdentifier
ItemDescription
ExpenseType Amount
ExpenseBookDate
EndDepth
StartDepth
8/7/2019 Semantic Types Webinar Slides
http://slidepdf.com/reader/full/semantic-types-webinar-slides 12/21copyright 2011 expressor software corporation12
Semantic Type example: ContractExpense
Contractor 1 Data Format
ContractId ItemId ItemDescription ExpType Amount Date StartDepth EndDepth
Schema: Contractor1
ContractExpense
ContractIdentifier
ItemIdentifier
ItemDescription
ExpenseType
MonetaryAmount
BookDate
Depth
ContractIdentifier
ItemIdentifier
ItemDescription
ExpenseType Amount
ExpenseBookDate
EndDepth
StartDepth
8/7/2019 Semantic Types Webinar Slides
http://slidepdf.com/reader/full/semantic-types-webinar-slides 13/21copyright 2011 expressor software corporation13
ContractExpense
ContractIdentifier
ItemIdentifier
ItemDescription
ExpenseType
MonetaryAmount
BookDate
Depth
ContractIdentifier
ItemIdentifier
ItemDescription
ExpenseType Amount
ExpenseBookDate
EndDepth
StartDepth
Schema: Contractor1
Semantic Type example: ContractExpense
Contractor 1 Data Format
ContractId ItemId ItemDescription ExpType Amount Date StartDepth EndDepth
8/7/2019 Semantic Types Webinar Slides
http://slidepdf.com/reader/full/semantic-types-webinar-slides 14/21
copyright 2011 expressor software corporation14
ContractExpense
ContractIdentifier
ItemIdentifier
ItemDescription
ExpenseType
MonetaryAmount
BookDate
Depth
ContractIdentifier
ItemIdentifier
ItemDescription
ExpenseType Amount
ExpenseBookDate
EndDepth
StartDepth
Semantic Type example: ContractExpense
Contractor 1 Data Format
ContractId ItemId ItemDescription ExpType Amount Date StartDepth EndDepth
Contractor 2 Data Format
Contract Date ItemId Amount ItemDescription ExpType StartDepth EndDepth
Schema: Contractor1
Schema: Contractor2
8/7/2019 Semantic Types Webinar Slides
http://slidepdf.com/reader/full/semantic-types-webinar-slides 15/21
copyright 2011 expressor software corporation15
ContractExpense
ContractIdentifier
ItemIdentifier
ItemDescription
ExpenseType
MonetaryAmount
BookDate
Depth
ContractIdentifier
ItemIdentifier
ItemDescription
ExpenseType Amount
ExpenseBookDate
EndDepth
StartDepth
Semantic Type example: ContractExpense
Contractor 1 Data Format
ContractId ItemId ItemDescription ExpType Amount Date StartDepth EndDepth
Contractor 2 Data Format
Contract Date ItemId Amount ItemDescription ExpType StartDepth EndDepth
Schema: Contractor1
Schema: Contractor2
Different fieldorders, date andnumeric valueformats
8/7/2019 Semantic Types Webinar Slides
http://slidepdf.com/reader/full/semantic-types-webinar-slides 16/21
copyright 2011 expressor software corporation16
ContractExpense
ContractIdentifier
ItemIdentifier
ItemDescription
ExpenseType
MonetaryAmount
BookDate
Depth
ContractIdentifier
ItemIdentifier
ItemDescription
ExpenseType Amount
ExpenseBookDate
EndDepth
StartDepth
Schema: Contractor2
Semantic Type example: ContractExpense
Contractor 1 Data Format
ContractId ItemId ItemDescription ExpType Amount Date StartDepth EndDepth
Contractor 2 Data Format
Contract Date ItemId Amount ItemDescription ExpType StartDepth EndDepth
9.999,99
8/7/2019 Semantic Types Webinar Slides
http://slidepdf.com/reader/full/semantic-types-webinar-slides 17/21
8/7/2019 Semantic Types Webinar Slides
http://slidepdf.com/reader/full/semantic-types-webinar-slides 18/21
copyright 2011 expressor software corporation18
expressor Studio – Multiple data source formats
mapped to a common Semantic Type
Demo
On demand Webinar Link: http://bit.ly/fW27F7
8/7/2019 Semantic Types Webinar Slides
http://slidepdf.com/reader/full/semantic-types-webinar-slides 19/21
copyright 2011 expressor software corporation19
Download our
Semantic TypesWhite paper
www.expressor-
software.com/semantic-types
Semantic Types white paper
8/7/2019 Semantic Types Webinar Slides
http://slidepdf.com/reader/full/semantic-types-webinar-slides 20/21
copyright 2011 expressor software corporation20
Thank You!
Questions?
8/7/2019 Semantic Types Webinar Slides
http://slidepdf.com/reader/full/semantic-types-webinar-slides 21/21
Thank You!
www.expressor-software.com