datawarehouse & bi introduction

25
Dataware Housing & Business Intelligence An Overview By Shivmohan Purohit

Upload: shivmohan-purohit

Post on 07-Nov-2014

1.321 views

Category:

Technology


6 download

DESCRIPTION

Data warehouse and Business Intelligence Introduction

TRANSCRIPT

Page 1: Datawarehouse & bi introduction

Dataware Housing &

Business IntelligenceAn Overview

By Shivmohan Purohit

Page 2: Datawarehouse & bi introduction

Agenda

• Introduction• Data Warehousing• Online Analytical Processing• Data Mining• Q & A

2

Page 3: Datawarehouse & bi introduction

What a firm/ Organization want to know….

3

Which are our lowest/highest margin

customers ?

Which are our lowest/highest margin

customers ?

Who are my customers and what products are they buying?

Who are my customers and what products are they buying?

Which customers are most likely to go to the competition ?

Which customers are most likely to go to the competition ?

What impact will new products/services

have on revenue and margins?

What impact will new products/services

have on revenue and margins?

What product prom--otions have the biggest

impact on revenue?

What product prom--otions have the biggest

impact on revenue?

What is the most effective distribution

channel?

What is the most effective distribution

channel?

Page 4: Datawarehouse & bi introduction

What is a Data Warehouse?

A single, complete and consistent store of data obtained from a variety of different sources made available to end users in a what they can understand and use in a business context.

4

Page 5: Datawarehouse & bi introduction

What is Data Warehousing?

A process of transforming data into information and making it available to users in a timely enough manner to make a difference

5

Data

Information

Page 6: Datawarehouse & bi introduction

Data Warehousing -- It is a process

• Technique for assembling and managing data from various sources for the purpose of answering business questions. Thus making decisions that were not previous possible

• A decision support database maintained separately from the organization’s operational database

6

Page 7: Datawarehouse & bi introduction

Data Warehousing• A data warehouse is a

– subject-oriented– integrated– time-varying– non-volatile

collection of data that is used primarily in organizational decision making.

7

Page 8: Datawarehouse & bi introduction

• A data warehouse is organized around the major subjects of the organization such as customer, supplier, product, sales, etc..,

• Data warehouse provides a simple and concise view around a particular subject by excluding data that are not useful to the decision support process.

8

Data Warehousing

Page 9: Datawarehouse & bi introduction

Type of DW Users

9

Explorers: Seek out the unknown and previously unsuspected rewards hiding in the detailed data

Farmers: Harvest information from known access paths

Tourists: Browse information

Page 10: Datawarehouse & bi introduction

Application-Orientation vs. Subject-Orientation

10

Application-Orientation

Operational Database

LoansCredit Card

Trust

Savings

Subject-Orientation

DataWarehouse

Customer

VendorProduct

Activity

Page 11: Datawarehouse & bi introduction

Functioning of Data warehousing

11

Data Source Cleaning Transformation

Data Warehouse

New Update

Page 12: Datawarehouse & bi introduction

Data Warehouse Architecture

12

Data Warehouse Engine

Optimized Loader

ExtractionCleansing

AnalyzeQuery

Metadata Repository

RelationalDatabases

LegacyData

Purchased Data

ERPSystems

Page 13: Datawarehouse & bi introduction

Star Schema

• A single fact table and for each dimension one dimension table

• Does not capture hierarchies directly

13

T ime

prod

cust

city

fact

date, custno, prodno, cityname, ...

Page 14: Datawarehouse & bi introduction

Snowflake schema

• Represent dimensional hierarchy directly by normalizing tables.

• Easy to maintain and saves storage

14

T ime

prod

cust

city

fact

date, custno, prodno, cityname, ...

region

Page 15: Datawarehouse & bi introduction

OLAP(Online analytical processing)

• A data warehouse stores data , but OLAP transform the data warehouse data into specific meaningful information.

• Therefore OLAP provides a user friendly environment for interactive data analysis.

15

Page 16: Datawarehouse & bi introduction

OLAP OPERATION on the Multidimensional data

–Roll-up(GROUP)–Drill down(Less)–Slice and Dice(Pie)–Pivot(rotate)

16

Page 17: Datawarehouse & bi introduction

Multi-dimensional Data

• “Hey…I sold $100M worth of goods”

17

MonthMonth1 1 22 3 3 4 4 776 6 5 5

Pro

du

ctP

rod

uct

Toothpaste Toothpaste

JuiceJuiceColaColaMilk Milk

CreamCream

Soap Soap

Regio

n

Regio

n

WWS S

N N

Dimensions: Dimensions: Product, Region, TimeProduct, Region, TimeHierarchical summarization pathsHierarchical summarization paths

Product Product Region Region TimeTimeIndustry Country YearIndustry Country Year

Category Region Quarter Category Region Quarter

Product City Month WeekProduct City Month Week

Office DayOffice Day

Page 18: Datawarehouse & bi introduction

“Slicing and Dicing”

18

Product

Sales Channel

Regio

ns

Retail Direct Special

Household

Telecomm

Video

Audio IndiaFar East

Europe

The Telecomm Slice

Page 19: Datawarehouse & bi introduction

Roll-up and Drill Down

• Sales Channel• Region• Country• State • Location Address• Sales Representative

19

Roll

Up

Higher Level ofAggregation

Low-levelDetails

Drill-D

ow

n

Page 20: Datawarehouse & bi introduction

Nature of OLAP Analysis• Aggregation -- (total sales,

percent-to-total)• Comparison -- Budget vs.

Expenses• Ranking -- Top 10, quartile

analysis• Access to detailed and aggregate

data• Complex criteria specification• Visualization

20

Page 21: Datawarehouse & bi introduction

Data Mining

• Data mining is sorting through data to identify patterns and establish relationships.

21

Page 22: Datawarehouse & bi introduction

Data Mining (cont.)

22

Page 23: Datawarehouse & bi introduction

Data Mining works with Warehouse Data

• Data Warehousing provides the Enterprise with a memory

23

• Data Mining provides the Enterprise with intelligence

Page 24: Datawarehouse & bi introduction

24

Cleaning and Integration Databases

Data Warehouse

Flat Files

PatternsKnowledge

Selection and transformation

Data Mining

Data Mining Process

Page 25: Datawarehouse & bi introduction

Thanks

Shivmohan Purohit

Q &A Discussion