cse 5331/7331 f'071 cse 5331/7331 fall 2007 dimensional modeling margaret h. dunham department...

21
CSE 5331/7331 F' 07 1 CSE 5331/7331 CSE 5331/7331 Fall 2007 Fall 2007 Dimensional Modeling Dimensional Modeling Margaret H. Dunham Margaret H. Dunham Department of Computer Science and Department of Computer Science and Engineering Engineering Southern Methodist University Southern Methodist University Some slides extracted from Some slides extracted from Data Mining, Introductory and Advanced Topics Data Mining, Introductory and Advanced Topics , Prentice , Prentice Hall, 2002. Hall, 2002.

Upload: ella-allison

Post on 08-Jan-2018

221 views

Category:

Documents


4 download

DESCRIPTION

3CSE 5331/7331 F'07 Multidimensional Model Example Fig 2 [1]

TRANSCRIPT

Page 1: CSE 5331/7331 F'071 CSE 5331/7331 Fall 2007 Dimensional Modeling Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist University

CSE 5331/7331 F'07 1

CSE 5331/7331CSE 5331/7331Fall 2007Fall 2007

Dimensional ModelingDimensional Modeling

Margaret H. DunhamMargaret H. DunhamDepartment of Computer Science and EngineeringDepartment of Computer Science and Engineering

Southern Methodist UniversitySouthern Methodist University

Some slides extracted from Some slides extracted from Data Mining, Introductory and Advanced TopicsData Mining, Introductory and Advanced Topics, Prentice Hall, 2002., Prentice Hall, 2002.

Page 2: CSE 5331/7331 F'071 CSE 5331/7331 Fall 2007 Dimensional Modeling Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist University

2CSE 5331/7331 F'07

Dimensional ModelingDimensional Modeling View data in a hierarchical manner more as View data in a hierarchical manner more as

business executives mightbusiness executives might Useful in decision support systems and miningUseful in decision support systems and mining Dimension:Dimension: collection of logically related collection of logically related

attributes; axis for modeling data.attributes; axis for modeling data. Facts:Facts: data stored data stored Ex: Dimensions – products, locations, dateEx: Dimensions – products, locations, date

Facts – quantity, unit priceFacts – quantity, unit price

Page 3: CSE 5331/7331 F'071 CSE 5331/7331 Fall 2007 Dimensional Modeling Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist University

3CSE 5331/7331 F'07

Multidimensional Model ExampleMultidimensional Model Example

Fig 2 [1]

Page 4: CSE 5331/7331 F'071 CSE 5331/7331 Fall 2007 Dimensional Modeling Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist University

4CSE 5331/7331 F'07

Cube view of DataCube view of Data

Fig 4 [1]

Page 5: CSE 5331/7331 F'071 CSE 5331/7331 Fall 2007 Dimensional Modeling Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist University

5CSE 5331/7331 F'07

Aggregation HierarchiesAggregation Hierarchies

Page 6: CSE 5331/7331 F'071 CSE 5331/7331 Fall 2007 Dimensional Modeling Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist University

6CSE 5331/7331 F'07

Multidimensional SchemasMultidimensional Schemas Star Schema shows facts and dimensionsStar Schema shows facts and dimensions

– Center of the star has facts shown in fact tablesCenter of the star has facts shown in fact tables– Outside of the facts, each diemnsion is shown Outside of the facts, each diemnsion is shown

separately in dimension tablesseparately in dimension tables– Access to fact table from dimension table via joinAccess to fact table from dimension table via join

SELECT Quantity, PriceSELECT Quantity, PriceFROM Facts, LocationFROM Facts, LocationWhere (Facts.LocationID = Location.LocationID) andWhere (Facts.LocationID = Location.LocationID) and(Location.City = ‘Dallas’)(Location.City = ‘Dallas’)

– View as relations, problem volume of data and View as relations, problem volume of data and indexingindexing

Page 7: CSE 5331/7331 F'071 CSE 5331/7331 Fall 2007 Dimensional Modeling Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist University

7CSE 5331/7331 F'07

Star SchemaStar Schema

Page 8: CSE 5331/7331 F'071 CSE 5331/7331 Fall 2007 Dimensional Modeling Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist University

8CSE 5331/7331 F'07

Flattened StarFlattened Star

Page 9: CSE 5331/7331 F'071 CSE 5331/7331 Fall 2007 Dimensional Modeling Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist University

9CSE 5331/7331 F'07

Normalized StarNormalized Star

Page 10: CSE 5331/7331 F'071 CSE 5331/7331 Fall 2007 Dimensional Modeling Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist University

10CSE 5331/7331 F'07

Snowflake SchemaSnowflake Schema

Page 11: CSE 5331/7331 F'071 CSE 5331/7331 Fall 2007 Dimensional Modeling Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist University

11CSE 5331/7331 F'07

OLAP IntroductionOLAP Introduction

OLAP by ExampleOLAP by Examplehttp://perso.orange.fr/bernard.lupin/englishttp://perso.orange.fr/bernard.lupin/english/index.htmh/index.htm What is OLAP?What is OLAP?http://www.olapreport.com/fasmi.htmhttp://www.olapreport.com/fasmi.htm

Page 12: CSE 5331/7331 F'071 CSE 5331/7331 Fall 2007 Dimensional Modeling Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist University

12CSE 5331/7331 F'07

OLAPOLAP Online Analytic Processing (OLAP):Online Analytic Processing (OLAP): provides more provides more

complex queries than OLTP.complex queries than OLTP. OnLine Transaction Processing (OLTP):OnLine Transaction Processing (OLTP): traditional traditional

database/transaction processing.database/transaction processing. Dimensional data; cube view Dimensional data; cube view Support ad hoc queryingSupport ad hoc querying Require analysis of dataRequire analysis of data Can be thought of as an extension of some of the basic Can be thought of as an extension of some of the basic

aggregation functions available in SQLaggregation functions available in SQL OLAP tools may be used in DSS systemsOLAP tools may be used in DSS systems Mutlidimentional view is fundamentalMutlidimentional view is fundamental

Page 13: CSE 5331/7331 F'071 CSE 5331/7331 Fall 2007 Dimensional Modeling Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist University

13CSE 5331/7331 F'07

OLAP ImplementationsOLAP Implementations MOLAP (Multidimensional OLAP)MOLAP (Multidimensional OLAP)

– Multidimential Database (MDD)Multidimential Database (MDD)– Specialized DBMS and software system capable of supporting the Specialized DBMS and software system capable of supporting the

multidimensional data directlymultidimensional data directly– Data stored as an n-dimensional array (cube)Data stored as an n-dimensional array (cube)– Indexes used to speed up processingIndexes used to speed up processing

ROLAP (Relational OLAP)ROLAP (Relational OLAP)– Data stored in a relational databaseData stored in a relational database– ROLAP server (middleware) creates the multidimensional view for ROLAP server (middleware) creates the multidimensional view for

the userthe user– Less Complex; Less efficientLess Complex; Less efficient

HOLAP (Hybrid OLAP)HOLAP (Hybrid OLAP)– Not updated frequently – MDDNot updated frequently – MDD– Updated frequently - RDBUpdated frequently - RDB

Page 14: CSE 5331/7331 F'071 CSE 5331/7331 Fall 2007 Dimensional Modeling Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist University

14CSE 5331/7331 F'07

OLAP OperationsOLAP Operations

Single Cell Multiple Cells Slice Dice

Roll Up

Drill Down

Page 15: CSE 5331/7331 F'071 CSE 5331/7331 Fall 2007 Dimensional Modeling Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist University

15CSE 5331/7331 F'07

OLAP OperationsOLAP Operations Simple query – single cell in the cubeSimple query – single cell in the cube SliceSlice – Look at a subcube to get more – Look at a subcube to get more

specific informationspecific information Dice Dice – Rotate cube to look at another – Rotate cube to look at another

dimensiondimension Roll UpRoll Up – Dimension Reduction; Aggregation – Dimension Reduction; Aggregation Drill DownDrill Down Visualization: These operations allow the Visualization: These operations allow the

OLAP users to actually “see” results of an OLAP users to actually “see” results of an operation.operation.

Page 16: CSE 5331/7331 F'071 CSE 5331/7331 Fall 2007 Dimensional Modeling Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist University

16CSE 5331/7331 F'07

Relationship Between TopcsRelationship Between Topcs

Page 17: CSE 5331/7331 F'071 CSE 5331/7331 Fall 2007 Dimensional Modeling Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist University

17CSE 5331/7331 F'07

Decision Support SystemsDecision Support Systems Tools and computer systems that assist Tools and computer systems that assist

management in decision makingmanagement in decision making What if types of questionsWhat if types of questions High level decisionsHigh level decisions Data warehouse – data which supports Data warehouse – data which supports

DSSDSS

Page 18: CSE 5331/7331 F'071 CSE 5331/7331 Fall 2007 Dimensional Modeling Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist University

18CSE 5331/7331 F'07

StarflakeStarflake

Fig 2 [4]

Page 19: CSE 5331/7331 F'071 CSE 5331/7331 Fall 2007 Dimensional Modeling Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist University

19CSE 5331/7331 F'07

Hierarchy of Data CubesHierarchy of Data Cubes

Fig 4 [4]

Page 20: CSE 5331/7331 F'071 CSE 5331/7331 Fall 2007 Dimensional Modeling Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist University

20CSE 5331/7331 F'07

Unified Dimensional ModelUnified Dimensional Model

Microsoft Cube ViewMicrosoft Cube View SQL Server 2005SQL Server 2005http://msdn2.microsoft.com/en-us/library/ms345http://msdn2.microsoft.com/en-us/library/ms345143.aspx143.aspxhttp://cwebbbi.spaces.live.com/Blog/cns!1pi7EThttp://cwebbbi.spaces.live.com/Blog/cns!1pi7ETChsJ1un_2s41jm9Iyg!325.entryChsJ1un_2s41jm9Iyg!325.entry MDX AS2005MDX AS2005http://msdn2.microsoft.com/en-us/library/aa216http://msdn2.microsoft.com/en-us/library/aa216767(SQL.80).aspx767(SQL.80).aspx

Page 21: CSE 5331/7331 F'071 CSE 5331/7331 Fall 2007 Dimensional Modeling Margaret H. Dunham Department of Computer Science and Engineering Southern Methodist University

21CSE 5331/7331 F'07

BibliographyBibliography[1] [1] Anne-Muriel Arigon, Anne Tchounikine, and Maryvonne Miquel, “Handling Anne-Muriel Arigon, Anne Tchounikine, and Maryvonne Miquel, “Handling

Multiple Points of View in a Multimedia Data Warehouse,” Multiple Points of View in a Multimedia Data Warehouse,” ACM Transactions on ACM Transactions on Multimedia Computing, Communications and ApplicationsMultimedia Computing, Communications and Applications , Vol. 2, No. 3, August , Vol. 2, No. 3, August 2006, Pages 199–218.2006, Pages 199–218.

[2] S. Nicholson, “The Bibliomining Process: Data Warehousing and Data Mining [2] S. Nicholson, “The Bibliomining Process: Data Warehousing and Data Mining for Library Decision-Making,” for Library Decision-Making,” Information Technology and Libraries,Information Technology and Libraries, 22(4), 22(4), 2003.2003.

[3] S. Nicholson, “The Basis for Biliomining: Frameworks for Bringing Together [3] S. Nicholson, “The Basis for Biliomining: Frameworks for Bringing Together Usage-Based Data Mining and Bibliometrics through Data Warehousing in Usage-Based Data Mining and Bibliometrics through Data Warehousing in Digital Library Services,” Digital Library Services,” Information Processing & Management,Information Processing & Management, 42(3), May 42(3), May 2006, pp 785-804.2006, pp 785-804.

[4] Jane You, Tharam Dillon, James Liu, Edwige Pissaloux, “On Hierarchical [4] Jane You, Tharam Dillon, James Liu, Edwige Pissaloux, “On Hierarchical Multimedia Information Retrieval,” You, J.; Multimedia Information Retrieval,” You, J.; Proceedings of the 2001 Proceedings of the 2001 International Conference on Image ProcessingInternational Conference on Image Processing, 7-10 Oct 2001, pp 729 – 732., 7-10 Oct 2001, pp 729 – 732.

[5] Torsten Priebe and Gunther Pernul, “Ontology-based Integration of OLAP and [5] Torsten Priebe and Gunther Pernul, “Ontology-based Integration of OLAP and Information Retrieval,” Information Retrieval,” Proceedings of the 14Proceedings of the 14thth International Workshop on International Workshop on Database and expert Systems Applications, 2003.Database and expert Systems Applications, 2003.