granularity in the data warehouse - building the data warehouse

20
Building The Data Building The Data Warehouse Warehouse by Inmon by Inmon Chapter 4: Granularity in the Data Warehouse http://it-slideshares.blogspot.com/

Upload: bondaigia

Post on 12-Apr-2015

107 views

Category:

Documents


8 download

DESCRIPTION

Chapter 4 Granularity in the Data WarehouseBuilding The Data Warehouse Course Description Course IntroductionData Warehouse - the modern way to build systems is to separate the operational from the informational or analytical processing and data. Today we know with certainty the following:• Data warehouses are built under a different development methodology than applications. Not keeping this in mind is a recipe for disaster.• Data warehouses are fundamentally different from data marts. The two do not mix—they are like oil and water.• Data warehouses deliver on their promise, unlike many overhyped technologies that simply faded away.• Data warehouses attract huge amounts of data, to the point that entirely new approaches to the management of large amounts of data are required.This course is discussions of specific technologies, about the analytical (the decision support systems (DSS)) environment and the structuring of data in that environment, and issues to a guideline for the designer and the developer.

TRANSCRIPT

Page 1: Granularity in the Data Warehouse - Building the Data Warehouse

Building The Data Building The Data WarehouseWarehouseby Inmonby Inmon

Chapter 4: Granularity in the Data Warehouse

http://it-slideshares.blogspot.com/

Page 2: Granularity in the Data Warehouse - Building the Data Warehouse

4.04.0 Introduce - Granularity in the Data Introduce - Granularity in the Data WarehouseWarehouse

Determining the proper level of granularity of the data that will reside in the data warehouse.

Granularity is important to the warehouse architect because it affects all the environments that depend on the warehouse for data.

Page 3: Granularity in the Data Warehouse - Building the Data Warehouse

4.1 Raw Estimates4.1 Raw Estimates

The raw estimate of the number of rows of data that will reside in the data warehouse tells the architect a great deal.

Page 4: Granularity in the Data Warehouse - Building the Data Warehouse

4.2 Input to the Planning Process4.2 Input to the Planning Process

The estimate of rows and DASD then serves as input to the planning process

Page 5: Granularity in the Data Warehouse - Building the Data Warehouse

4.3 Data in Overflow4.3 Data in Overflow

Compare the total number of rows in the warehouse environment:

Page 6: Granularity in the Data Warehouse - Building the Data Warehouse

4.3 Data in Overflow (ct)4.3 Data in Overflow (ct)

There will be more expertise available in managing the data warehouse volumes of data.

Hardware costs will have dropped to some extent.

More powerful software tools will be available.

The end user will be more sophisticated.

Page 7: Granularity in the Data Warehouse - Building the Data Warehouse

4.3.1 Overflow Storage 4.3.1 Overflow Storage

Page 8: Granularity in the Data Warehouse - Building the Data Warehouse

4.3.1 Overflow Storage (ct)4.3.1 Overflow Storage (ct)

Page 9: Granularity in the Data Warehouse - Building the Data Warehouse

4.4 What the Levels of Granularity 4.4 What the Levels of Granularity Will BeWill Be

Page 10: Granularity in the Data Warehouse - Building the Data Warehouse

4.5 Some Feedback Loop Techniques4.5 Some Feedback Loop TechniquesFollowing are techniques to make the

feedback loop harmonious:Build the first parts of the data

warehouse in very small, very fast steps, and carefully listen to the end users’ comments at the end of each step of development. Be prepared to make adjustments quickly.

If available, use prototyping and allow the feedback loop to function using observations gleaned from the prototype.

Page 11: Granularity in the Data Warehouse - Building the Data Warehouse

4.5 Some Feedback Loop Techniques 4.5 Some Feedback Loop Techniques (ct)(ct)

Look at how other people have built their levels of granularity and learn from their experience.

Go through the feedback process with an experienced user who is aware of the process occurring. Under no circumstances should you keep your users in the dark as to the dynamics of the feedback loop.

Look at whatever the organization has now that appears to be working, and use those functional requirements as a guideline.

Execute joint application design (JAD) sessions and simulate the output to achieve the desired feedback.

Page 12: Granularity in the Data Warehouse - Building the Data Warehouse

4.5 Some Feedback Loop Techniques 4.5 Some Feedback Loop Techniques (ct)(ct)

Granularity of data can be raised in many ways, such as the following:

Summarize data from the source as it goes into the target.

Average or otherwise calculate data as it goes into the target.

Push highest and/or lowest set values into the target.

Push only data that is obviously needed into the target.

Use conditional logic to select only a subset of records to go into the target.

Page 13: Granularity in the Data Warehouse - Building the Data Warehouse

4.6 Levels of Granularity—Banking 4.6 Levels of Granularity—Banking EnvironmentEnvironment

Page 14: Granularity in the Data Warehouse - Building the Data Warehouse

4.6 Levels of Granularity—Banking Environment 4.6 Levels of Granularity—Banking Environment (ct)(ct)

Page 15: Granularity in the Data Warehouse - Building the Data Warehouse

4.6 Levels of Granularity—Banking Environment 4.6 Levels of Granularity—Banking Environment (ct)(ct)

Page 16: Granularity in the Data Warehouse - Building the Data Warehouse

4.6 Levels of Granularity—Banking Environment 4.6 Levels of Granularity—Banking Environment (ct)(ct)

Page 17: Granularity in the Data Warehouse - Building the Data Warehouse

4.6 Levels of Granularity—Banking Environment 4.6 Levels of Granularity—Banking Environment (ct)(ct)

Page 18: Granularity in the Data Warehouse - Building the Data Warehouse

4.6 Levels of Granularity—Banking 4.6 Levels of Granularity—Banking Environment (ct)Environment (ct)

Page 19: Granularity in the Data Warehouse - Building the Data Warehouse

4.7 Feeding the Data Marts4.7 Feeding the Data Marts

Specification level of granularity the data marts will need. The data that resides in the data warehouse must be at the lowest level of granularity needed by any of the data marts.

Page 20: Granularity in the Data Warehouse - Building the Data Warehouse

4.8 Summary4.8 Summary

Choosing the proper levels of granularity for the architected environment is vital to success.

The worst stance that can be taken is to design all the levels of granularity a priori, and then build the data warehouse.

The process of granularity design begins with a raw estimate of how large the warehouse will be on the one-year and the five-year horizon.

There is an important feedback loop for the data warehouse environment.

Another important consideration is the levels of granularity needed by the different architectural components that will be fed from the data warehouse.

http://it-slideshares.blogspot.com/