proper architecture considerations for data warehouse ... · pdf filewarehouse methodology...

29
Copyright © 2002, SAS Institute Inc. All rights reserved. Proper Architecture Considerations For Data Warehouse Implementation Mark Nilan Data Warehouse Product Manager SAS Asia Pacific

Upload: vankien

Post on 31-Jan-2018

215 views

Category:

Documents


0 download

TRANSCRIPT

Copyright © 2002, SAS Institute Inc. All rights reserved.

Proper Architecture Considerations ForData Warehouse Implementation

Mark NilanData Warehouse Product ManagerSAS Asia Pacific

Copyright © 2002, SAS Institute Inc. All rights reserved.

2

Why Do we Need This? How Do We Do

This?

Explaining The Intelligence Layer

Copyright © 2002, SAS Institute Inc. All rights reserved.

3

Agenda

How Do We Architect A Solution Properly?! The Packaged DW Solutions That SAS Provides! The Building Products For The Solution! Architectural Blueprints For Data Quality & Transformation! Layers Of ConstructionWhy Do We Need An Architectural Approach?! Risk Reduction! Simplicity & Efficiency! Approach To Project Deliverables! Metadata Management

Copyright © 2002, SAS Institute Inc. All rights reserved.

4

Explaining The Intelligence Layer

Why Do We Need This? How Do We Do

This?

Copyright © 2002, SAS Institute Inc. All rights reserved.

5

Packaged Architecture

Intelligence Value Chain

DATA

INTELLIGENCE

Copyright © 2002, SAS Institute Inc. All rights reserved.

6

Intelligence Value Chain

DATA

INTELLIGENCE

Copyright © 2002, SAS Institute Inc. All rights reserved.

7

Intelligence Value Chain

Architecture ComponentsRapid

Warehouse Methodology& TIS, BIS, IIS

BASE SAS Or DWA Generated

Code & SAS Cleanse

SAS MVA Server, MEA DBMS SPD Server

Enterprise GuideWEB AF/WEB EIS

SAS IDP

Enterprise Miner,ETS, OR, Insight

& SAS Stat

DATA

INTELLIGENCE

Copyright © 2002, SAS Institute Inc. All rights reserved.

8

Where Do We Start?Use SAS Rapid Warehousing Methodology.

! Created from:• The collected experience of

hundreds of SAS projects• Industry ‘best practice’.

! Successful because:• Business focused• Breaks project down into small,

defined, manageable pieces rather than large chunks.

• Iterative, flexible and dynamic• Uses Joint Application

Development and Rapid Application Development delivery techniques

Methodology - Seven Phases

On-going maintenance and Administration

Assessment

Requirements

Design

Deployment

Review

Final Test

Construction

Copyright © 2002, SAS Institute Inc. All rights reserved.

9

Copyright © 2002, SAS Institute Inc. All rights reserved.

10

There are currently three available:-!SAS® Telecommunications Intelligence Solution!SAS® Banking Intelligence Solution!SAS® Insurance Intelligence Solution

Intelligence Solutions

Copyright © 2002, SAS Institute Inc. All rights reserved.

11

!Industry specific solutions - customizable to fit customer’s business environment

!Analytical models!Dimensional models!Business rules

! Data Models - Logical and physical data models ! Methodologies - best practice guidelines !System Integration Services - experienced consulting and implementation services

Assemble To Order

Copyright © 2002, SAS Institute Inc. All rights reserved.

12

Rapid Warehouse

Methodology

BASE SAS Or DWA Generated

Code

SAS MVA Server, MEA DBMS

SPDS

Enterprise GuideWEB AF/WEB EIS

SAS Intrnet

Enterprise Miner,ETS, OR, Insight

& SAS Stat

DATA

INTELLIGENCE

Architecture Components

Copyright © 2002, SAS Institute Inc. All rights reserved.

13

Architecture Components

Copyright © 2002, SAS Institute Inc. All rights reserved.

14

Ope

ratio

nal D

ata

Defin

ition

sO

pera

tiona

l Dat

a De

finiti

ons

ODDODD

Cha

nged

Dat

a C

aptu

reC

hang

ed D

ata

Cap

ture

CDCCDC

Valid

ated

and

Cle

anse

dVa

lidat

ed a

nd C

lean

sed

StagingStagingLayerLayer

Inte

grat

ed, D

eriv

ed a

nd T

rans

form

edIn

tegr

ated

, Der

ived

and

Tra

nsfo

rmed

Stag

edSt

aged

EnterpriseEnterpriseLayerLayer

Det

ail D

ata

Det

ail D

ata

Foun

datio

n D

ata

Foun

datio

n D

ata

Sum

mar

y D

ata

Sum

mar

y D

ata

Exce

ptio

nsEx

cept

ions

ErrorError

Mar

t Dat

aM

art D

ata

MartMartLayerLayer

RawRaw

Extr

acte

dEx

trac

ted

Copyright © 2002, SAS Institute Inc. All rights reserved.

15

RawODDCDC

ErrorStaging

Enterprise

Mart

Copyright © 2002, SAS Institute Inc. All rights reserved.

16

Raw ODDCDCError

Staging

Enterprise

Mart

Data is extracted from the source systems using the appropriate extraction adaptor. Some data will be operational system dumps and will reside in a RAW data area.

The ODD Area contains SAS views of RBDMS tables or SAS tablesCDC contains SAS tables of the previous loads data for comparison in change data capture. Or EAI queues, log captures or whatever method…

Exceptions area will contain all erroneous records in related tables from any of the validation processes

STAGING Area contains Tables that are waiting for other Tables to allow a consistent slice for warehouse loading. Further processing of this data may happen after all the data required for the slice is present.

Enterprise Area contains the Fact and Dimension tables of the Historic Normal Form Schema. The tables at this point are at source systems granularity (i.e. stores lowest available level of granularity). The SUMMARY Mart area stores system summaries of Enterprise layer tables. It is the key part of the Intelligence Architecture.

Copyright © 2002, SAS Institute Inc. All rights reserved.

17

So a real life Architecture may be:

Copyright © 2002, SAS Institute Inc. All rights reserved.

But Why Use An Architected Approach?

Why Do we Need This?

How Do We Do This?

Copyright © 2002, SAS Institute Inc. All rights reserved.

19

!The SAS Intelligence Architecture is a low-risk framework that integrates our industry-leading capabilities in data warehousing, advanced analytics, and business intelligence to provide…

an intelligence framework that can grow and adjust with yourenterprise.

Lower Risk

Copyright © 2002, SAS Institute Inc. All rights reserved.

20

Independent Marts = Higher Risk

Inventory

Product

Orders

Contacts

Billing

Marketing

Finance

Sales

Personal datamarts

External data

Metadata

WEB

WAP telephone

PDA

Workstation

WWW browser

Operational datastore

An Information Architecture based on Data MartsWithout an architectural approach

Metadata

?

Metadata

Operational Systems Departmental data

data marts

Copyright © 2002, SAS Institute Inc. All rights reserved.

21

Proper Intelligence ArchitectureWHY Because it is the foundation –solidarity is required in its nature….

! A gateway to all the functionality of the intelligence layers above it

! The foundation for quality! Repeatable, “assembled to order”

yet unique! ROI measurement as a base is

easier to compare to other packaged DW and Datamarts

Copyright © 2002, SAS Institute Inc. All rights reserved.

22

Aspects of Warehousing Projects

Architecture

Tools

Methodologies

People/Skills

Copyright © 2002, SAS Institute Inc. All rights reserved.

23

Aspects of Warehousing Projects

Architecture

Tools

Methodologies

People/Skills

Copyright © 2002, SAS Institute Inc. All rights reserved.

24

Metadata Management Architecture

Copyright © 2002, SAS Institute Inc. All rights reserved.

25

Metadata Management Architecture

Metadata – Is it the DW Life Saver?

Copyright © 2002, SAS Institute Inc. All rights reserved.

26

The Metadata Lifesaver?! Common metadata

architecture is also of paramount importance…

! SAS V8 Hub & Spoke is common across the Blueprint at all levels

! SAS V9 supports CWM standard XML based metadata

! Without common metadata, the successful data warehouse cause is lost!

Copyright © 2002, SAS Institute Inc. All rights reserved.

Conclusion

Two questions for consideration…How Do I Properly Architect A Solution?! Use SAS. SAS is moving from a ‘build to order’ mentality to an assemble to order architectural approach! SAS offers proven rapid DW methodology & Intelligence Solutions in Banking, Telco, Insurance. Other industries coming! Architecture includes not just design, it is an instruction list of how to construct & extract source data, assemble changed data, staged data, detail data, summary data and data marts. ! The design & instruction includes the ‘plumbing’ too. That is Enterprise Application Integration, like MQ series, DCOM & CORBA agent technologies and traditional TCP

Copyright © 2002, SAS Institute Inc. All rights reserved.

Why Do I Need An Architectural Approach?! Given that the ‘HOW’ consisted of a blueprint design and process instructions to deliver, what benefit does this give me?! Lower risk – follow the instructions in the methodology and industry solutions. This is an ‘on-ramp’ to success.! Complex data warehousing is assembling pieces like design, processes, people & tools in an iterative fashion. Done once – done many times. This eases project implementation pains. ! In a sea of diversity, the metadata manager is a life preserver for the warehouse architect – it provides centralised clarity and control to provide management of the architectural approach.

Conclusion

Copyright © 2002, SAS Institute Inc. All rights reserved.

29

Assemble To Order

Intelligence SolutionsBlueprint Design