data mining by example - building predictive model using microsoft decision trees
TRANSCRIPT
![Page 1: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/1.jpg)
Data Mining By Example – Building Predictive Model Using Microsoft
Decision Trees
by Shaoli Lu
![Page 2: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/2.jpg)
Microsoft Decision Trees
• Developed by Microsoft research team, the Microsoft Decision Trees algorithm is a hybrid decision tree algorithm that supports classification and regression
![Page 3: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/3.jpg)
Goal
• To predict a prospect’s likelihood of purchasing a bike
![Page 4: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/4.jpg)
Prerequisite
• An SQL Server instance created (2005 or above)
• SQL Server Analysis Service (SSAS) –Multidimensional Feature Installed
(this is used to host and browse the mining structures; cube is not required for data mining!)
• AdventureWorksDW database attached(download from CodePlex - tailor to the SQL Server version you have)
• Visual Studio 2010 or above with SQL Server Data Tools (SSDT) installed
![Page 5: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/5.jpg)
My Demo Setup
• Visual Studio 2010
• SQL Server 2012
![Page 6: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/6.jpg)
Create Data Mining Project
• Name the project as DM Decision Trees (DM = Data Mining)
![Page 7: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/7.jpg)
![Page 8: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/8.jpg)
Create Data Source and Impersonation
![Page 9: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/9.jpg)
![Page 10: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/10.jpg)
Create Data Source View
![Page 11: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/11.jpg)
![Page 12: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/12.jpg)
Create Mining Structure
• Choose Microsoft Decision Trees model
• Select Data Source View
• Choose training data
• Select Input/Predict parameters
• Set content types
• Set Holdout percentage
• Name the mining structure and model
![Page 13: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/13.jpg)
![Page 14: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/14.jpg)
![Page 15: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/15.jpg)
![Page 16: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/16.jpg)
![Page 17: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/17.jpg)
![Page 18: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/18.jpg)
![Page 19: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/19.jpg)
![Page 20: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/20.jpg)
![Page 21: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/21.jpg)
![Page 22: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/22.jpg)
![Page 23: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/23.jpg)
![Page 24: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/24.jpg)
Deploy the mining structure and model
![Page 25: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/25.jpg)
![Page 26: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/26.jpg)
Process the mining model
• This is also called “training the model”
![Page 27: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/27.jpg)
![Page 28: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/28.jpg)
Mining Model Viewer
• Identify dominant attributes
• Left is associative with more important attributes
• Rich visualization is good for data exploration as well
![Page 29: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/29.jpg)
![Page 30: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/30.jpg)
Mining Model Accuracy Chart
• This is called “Testing the Model” using the Holdout data
• Lift chart
• Profit chart
![Page 31: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/31.jpg)
![Page 32: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/32.jpg)
![Page 33: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/33.jpg)
![Page 34: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/34.jpg)
Mining Model Prediction
• Singleton query
• Mass prediction
![Page 35: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/35.jpg)
![Page 36: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/36.jpg)
![Page 37: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/37.jpg)
![Page 38: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/38.jpg)
![Page 39: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/39.jpg)
![Page 40: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/40.jpg)
![Page 41: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/41.jpg)
Browse mining model on SQL Server
• Decision trees
• Dependency network
![Page 42: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/42.jpg)
![Page 43: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/43.jpg)
![Page 44: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/44.jpg)
![Page 45: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/45.jpg)
Summary
• Microsoft Decision Trees is a powerful data mining model, yet it is easy to build, train and use
• Can perform both Singleton (e.g. embed in an app) and Mass Predictions (e.g. targeted marketing)
• Holdout data can be used to test trained model• Rich visualizations such as Lift/Profit Charts and
Dependency Network can facilitate analysis and data exploration
• Relational database can be used for data mining; cube is not required
![Page 46: Data mining by example - building predictive model using microsoft decision trees](https://reader031.vdocuments.mx/reader031/viewer/2022030402/58a34b4e1a28ab62248b66e3/html5/thumbnails/46.jpg)
The End