data mining with excel 2010 and powerpivot

26
Data Mining with Excel 2010 and PowerPivot Mark Tabladillo Ph.D. http://marktab.net September 18, 2010

Upload: mark-tabladillo

Post on 05-Dec-2014

13.123 views

Category:

Business


1 download

DESCRIPTION

SQL Server Data Mining (Analysis Services) using Excel 2010, PowerPivot add-in, and Data Mining add-in

TRANSCRIPT

Page 1: Data Mining with Excel 2010 and PowerPivot

Data Mining with Excel 2010 and PowerPivot

Mark Tabladillo Ph.D. http://marktab.net September 18, 2010

Page 2: Data Mining with Excel 2010 and PowerPivot

SQL Saturday 46 -- Raleigh NC #sqlsat46

© 2

010 M

ark

Tabla

dill

o P

h.D

.

2

Page 3: Data Mining with Excel 2010 and PowerPivot

MarkTab & Data Mining

© 2

010 M

ark

Tabla

dill

o P

h.D

.

3

Page 4: Data Mining with Excel 2010 and PowerPivot

© 2

010 M

ark

Tabla

dill

o P

h.D

.

4

Page 5: Data Mining with Excel 2010 and PowerPivot

© 2

010 M

ark

Tabla

dill

o P

h.D

.

5

Page 6: Data Mining with Excel 2010 and PowerPivot

Outline

What is Data Mining

What is PowerPivot

Demos

© 2

010 M

ark

Tabla

dill

o P

h.D

.

6

Page 7: Data Mining with Excel 2010 and PowerPivot

Data Mining as a Service

© 2

010 M

ark

Tabla

dill

o P

h.D

.

7

Page 8: Data Mining with Excel 2010 and PowerPivot

Outline

What is Data Mining

What is PowerPivot

Demos

© 2

010 M

ark

Tabla

dill

o P

h.D

.

8

Page 9: Data Mining with Excel 2010 and PowerPivot

Data Mining Definitions

• Data mining

• Machine Learning

• Data mining algorithms -- typically use estimation or optimization to achieve results (as opposed to only calculations).

© 2

010 M

ark

Tabla

dill

o P

h.D

.

9

Page 10: Data Mining with Excel 2010 and PowerPivot

Data Mining Tasks

• Supervised

• Answer known, what is correlated?

• Unsupervised

• Answer unknown (unspecified), what are the groups?

• Forecasting

• Given a trend, what is next?

© 2

010 M

ark

Tabla

dill

o P

h.D

.

10

Value Slide

Page 11: Data Mining with Excel 2010 and PowerPivot

Data Mining Add-In for Excel

• Requires Analysis Services instance

• Version 10.00.2531.00 (April 2009)

• 32-Bit Add-In

• Microsoft .NET Framework 2.0 (32-bit)

• Office 2007 (Professional, Professional Plus, Ultimate, Enterprise)

• SQL Server Enterprise or Standard (or Developer) 2008 or higher

© 2

010 M

ark

Tabla

dill

o P

h.D

.

11

Page 12: Data Mining with Excel 2010 and PowerPivot

The Analyze Tab

© 2

010 M

ark

Tabla

dill

o P

h.D

.

12

Page 13: Data Mining with Excel 2010 and PowerPivot

The Analyze Tab

© 2

010 M

ark

Tabla

dill

o P

h.D

.

13

Menu Option Data Mining Algorithm

Analyze Key Influencers Naïve Bayes

Detect Categories Clustering

Fill from Example Logistic Regression

Forecast Time Series

Highlight Exceptions Clustering

Scenario Analysis (Goal Seek) Logistic Regression

Scenario Analysis (What If) Logistic Regression

Prediction Calculator Logistic Regression

Shopping Basket Analysis Association Rules

Page 14: Data Mining with Excel 2010 and PowerPivot

Data Mining Tab

© 2

010 M

ark

Tabla

dill

o P

h.D

.

14

Page 15: Data Mining with Excel 2010 and PowerPivot

Data Mining Tab

© 2

010 M

ark

Tabla

dill

o P

h.D

.

15 Many

Page 16: Data Mining with Excel 2010 and PowerPivot

Data Mining Capacities

© 2

010 M

ark

Tabla

dill

o P

h.D

.

16

SQL Server 2008 R2 Analysis Services Object

Maximum sizes/numbers

Maximum data mining models per structure

2^31-1 = 2,147,483,647

Maximum data mining structures per solution

2^31-1 = 2,147,483,647

Maximum data mining structures per Analysis Services database

2^31-1 = 2,147,483,647

Maximum data mining attributes (variables) per structure

2^31-1 = 2,147,483,647

Reference:

http://www.marktab.net/datamining/index.php/2010/08/01/sql-server-

data-mining-capacities-2008-r2/

Page 17: Data Mining with Excel 2010 and PowerPivot

Data Mining Tab

© 2

010 M

ark

Tabla

dill

o P

h.D

.

17

Page 18: Data Mining with Excel 2010 and PowerPivot

Outline

What is Data Mining

What is PowerPivot

Demos

© 2

010 M

ark

Tabla

dill

o P

h.D

.

18

Page 19: Data Mining with Excel 2010 and PowerPivot

PowerPivot for Excel

• Take advantage of familiar Excel tools and features

• Process massive amounts of data in seconds

• Load even the largest data sets from virtually any source

• Use powerful new analytical capabilities, such as Data Analysis Expressions (DAX)

• Make the most of multi-core processors and gigabytes of memory

© 2

010 M

ark

Tabla

dill

o P

h.D

.

19

Page 20: Data Mining with Excel 2010 and PowerPivot

PowerPivot for Excel Sources

• SQL Server

• SQL Azure

• Oracle, Teradata, Sybase, Informix, IBM DB2

• OLEDB/ODBC

• Analysis Services (SSAS)

• Reporting Services (SSRS)

• Excel, Text File

© 2

010 M

ark

Tabla

dill

o P

h.D

.

20

Page 21: Data Mining with Excel 2010 and PowerPivot

PowerPivot Reference

• http://www.powerpivot.com (Product Site)

• http://www.powerpivotpro.com (Blog Site)

© 2

010 M

ark

Tabla

dill

o P

h.D

.

21

Page 22: Data Mining with Excel 2010 and PowerPivot

Outline

What is Data Mining

What is PowerPivot

Demos

© 2

010 M

ark

Tabla

dill

o P

h.D

.

22

Page 23: Data Mining with Excel 2010 and PowerPivot

Resources

• MarkTab.NET Blog, links, video resources and information for data mining

• Blog: http://marktab.net/datamining

• Twitter: @MarkTabNet

© 2

010 M

ark

Tabla

dill

o P

h.D

.

23

Page 24: Data Mining with Excel 2010 and PowerPivot

© 2

010 M

ark

Tabla

dill

o P

h.D

.

24

Page 25: Data Mining with Excel 2010 and PowerPivot

Regroup and Conclusion

• Main Points from this Presentation

© 2

010 M

ark

Tabla

dill

o P

h.D

.

25

Page 26: Data Mining with Excel 2010 and PowerPivot

Contact Information

• Mark Tabladillo http://marktab.net

• Also on: Twitter @marktabnet Linked In

© 2

010 M

ark

Tabla

dill

o P

h.D

.

26