itec423 data warehousing introduction to the course asst. prof. dr. nazife dimililer spring...
TRANSCRIPT
ITEC423 DATA WAREHOUSING
INTRODUCTION TO THE COURSE
Asst. Prof. Dr. Nazife DimililerSpring 2010-2011
Information
Class : CTL002 Schedule
Tuesday 12:30-14:20 Thursday 12:30-14:20
Office : CT 206 Phone : 630 1034 Email : [email protected] Books
Ponniah P., Data Warehousing Fundamentals for IT Professionals, John Wiley & Sons, 2010
MS SQL server Analysis services
Assesment
Attendance Attendance is
mandatory. Missing more than 60% of classes disqualifies you from make ups
Grading 4xQuizzes : 20% Midterm :30% Final : 45% Lab performance
(Attendance??) 5% Optional Work upto 5-
10% Project Research Design Homework
Objectives and Learning Outcomes of the course
Objectives Provide a solid background in data warehousing Show the differences between databases and data
warehousing Define the process of designing a data warehouse Design and implement a data warehouse
Learning outcomes Describe the differences between OLTP systems and data
warehouses. Describe the need for data warehousing Analyze and transform business requirements into a
dimensional model in order to build a data warehouse Transform the dimensional model into a physical data
design Implement a high quality data warehouse or data mart Understand multidimensional query concepts
Schedule
Class/Week
Topic Reading
1 Introduction Chapter 1
2 Building blocks of a data warehouse Chapter 2
3 Trends in Data warehousing Chapter 3
4 Planning and Project Management Chapter 4
5 Defining Business Requirements Chapters 5 & 6
6 Architectural Components Chapters 7 & 8
7 Role of Metadata Chapter 9
8 Dimensional Modeling Chapters 10 & 11
9 Data extraction, transformation and loading
Chapter 12
10 OLAP in Data Warehouse Chapter 15
11 Data mining Basics Chapter 17
12 Physical Design Process Chapter 18
13 Deployment and Maintenance Chapters 19 & 20
Learning Procedures
Lectures Power point slides Discussions
Applications Step-by-step
tutorials Case studies
Homework/Project Problems Research/Homework
Operational Databases (OLTP Systems)
Every company uses a number of operational databases to store daily transactions
All activities are recorded Performed by users Stored in databases
Operational databases are designed and optimized for insert/delete/update
Majority of transactions involve single records
Operational Databases (OLTP Systems)
Market Sales
Accounting
Accounting Software
Market Sales Software
Estate Sales
Estate Agency
Software
abcd
1234
dfsfh
data
abcd
1234
dfsfh
data
abcd
1234
dfsfh
data
What is Business Information?
Information contained in the operational databases and external resources of a company
Utilized for gaining insights that drive strategic and tactical business decisions
Help make decisions faster Encompasses a broad category of
technologies gather, store, access, and analyze data
What is Business Intelligence? computer-based techniques used in
spotting, digging-out, and analyzing business data, such as sales revenue by products and/or departments, or by associated costs and incomes
broad category of applications and technologies for gathering, storing, analyzing, and providing access to data to help clients make better business decisions.
What is business Intelligence? environment in which business users
receive information that is reliable, secure, consistent, understandable, easily manipulated and timely enable business users to conduct analyses
that yield an overall understanding of where the business has been, where it is now, and where it will be in the near future.
empowers knowledge workers to make more informed, smarter business decisions faster
Key concepts in Business Intelligence Management makes decisions Requires information from various/diverse
sources Information should be in required format Past data is important Results should be produced immediately Managers should be able pose ad-hoc queries
Accounting
Accounting Software
Estate Sales
Estate Agency
Software
Market Sales
Market Sales Software
Query
Query
Business Intelligence
QueryQuery
I need the number of dairy products
sold by each branch per month
for the last 10 years!
I need these NOW!!!
Is there a correlation between apt sales and dairy product sales?
Prepare a graph showing amount of dairy products and
number of apts sold in each month for the
last 5 years.
Accounting
Accounting
Software
Estate
Sales
Estate Agency
Software
Market
Sales
Market Sales
Software
Business Intelligence
product
company
category
price
branch
employee
ExtractTransformLoad
All mark
et sales
All property sales
All bills
Contains historical data as
well
STAR SCHEMA
What Can a Data Warehouse Do?Some of the benefits of a DW are: Immediate information delivery to
management Data integration from across and
even outside the organization Future vision from historical trends Tools for looking at data in new ways Freedom from IS department resource
limitations
Example of Data Warehouse Applications-I
Sales Analysis Determine real-time product sales to make vital
pricing and distribution decisions. Analyze historical product sales to determine
success or failure attributes. Evaluate successful products and determine key
success factors. Use corporate data to understand the margin as well
as the revenue implications of a decision. Rapidly identify a preferred customer segments
based on revenue and margin. Quickly isolate past preferred customers who no
longer buy. Identify daily what product is in the manufacturing
and distribution pipeline. Instantly determine which salespeople are
performing, on both a revenue and margin basis, and which are behind.
Example of Data Warehouse Applications-II
Financial Analysis Compare actual to budgets on an
annual, monthly and month-to-date basis.
Review past cash flow trends and forecast future needs.
Identify and analyze key expense generators.
Instantly generate a current set of key financial ratios and indicators.
Receive near-real-time, interactive financial statements.
Example of Data Warehouse Applications-III
Human Resource Analysis Evaluate trends in benefit program use. Identify the wage and benefits costs to
determine company-wide variation. Review compliance levels for EEOC and other
regulated activities.Other Areas Warehouses have also been applied to areas
such as: Logistics Inventory Purchasing detailed transaction analysis load balancing …
What is Data Warehouse?
What is Data Warehouse?
•A data warehouse is a central repositorycentral repository for all or significant parts of the data that an enterprise's various business systems collect. •Data warehousing emphasizes the capture of data from diversediverse sources for useful analysis and access•Data warehouse helps get information to answer questions. •It is not meant for direct data entry; •batch updates are the norm for refreshing warehouses.
•Data mart is a subset of a data warehouse subset of a data warehouse based on a specific department, function or subject•Applications of data warehouses include data mining, Web Mining, and decision support systems (DSS), Business Intelligence (BI).
What is a data warehouse?
“A data warehouse is a subject-oriented, Integrated (consolidated) time-variant, and nonvolatile collection of data in support of
management’s decision-making process.”W. H. Inmon