automatic machine learning using python & scikit-learn

42

Upload: abhishek-thakur

Post on 21-Apr-2017

3.207 views

Category:

Data & Analytics


5 download

TRANSCRIPT

Page 1: Automatic Machine Learning using Python & scikit-learn
Page 2: Automatic Machine Learning using Python & scikit-learn
Page 3: Automatic Machine Learning using Python & scikit-learn

○○○○○○

Page 4: Automatic Machine Learning using Python & scikit-learn

https://competitions.codalab.org/competitions/2321Image Source: http://www.causality.inf.ethz.ch/AutoML/spiral.png

Page 5: Automatic Machine Learning using Python & scikit-learn
Page 6: Automatic Machine Learning using Python & scikit-learn

○○○○○

Page 7: Automatic Machine Learning using Python & scikit-learn

AutoCompete: A Framework for Machine Learning Competitions, A.Thakur and A Krohn-Grimberghe, ICML AutoML Workshop, 2015

Page 8: Automatic Machine Learning using Python & scikit-learn
Page 9: Automatic Machine Learning using Python & scikit-learn
Page 10: Automatic Machine Learning using Python & scikit-learn
Page 11: Automatic Machine Learning using Python & scikit-learn
Page 12: Automatic Machine Learning using Python & scikit-learn

● Numerical Data:○ Do nothing

Page 13: Automatic Machine Learning using Python & scikit-learn

● Numerical Data:○ Do nothing

● Categorical Data:○ Label encoding○ One-hot encoding

Page 14: Automatic Machine Learning using Python & scikit-learn

● Numerical Data:○ Do nothing

● Categorical Data:○ Label encoding○ One-hot encoding

Page 15: Automatic Machine Learning using Python & scikit-learn

● Numerical Data:○ Do nothing

● Categorical Data:○ Label encoding○ One-hot encoding

Page 16: Automatic Machine Learning using Python & scikit-learn

● Numerical Data:○ Do nothing

● Text Data:○ Counts○ TF-IDF

Page 17: Automatic Machine Learning using Python & scikit-learn

● Numerical Data:○ Do nothing

● Text Data:○ Counts○ TF-IDF

Page 18: Automatic Machine Learning using Python & scikit-learn
Page 19: Automatic Machine Learning using Python & scikit-learn
Page 20: Automatic Machine Learning using Python & scikit-learn
Page 21: Automatic Machine Learning using Python & scikit-learn
Page 22: Automatic Machine Learning using Python & scikit-learn
Page 23: Automatic Machine Learning using Python & scikit-learn
Page 24: Automatic Machine Learning using Python & scikit-learn
Page 25: Automatic Machine Learning using Python & scikit-learn

● Multiple ways of feature selection

● Random forest based feature importances

● Feature importances from GBM

● Chi2 feature selection

● Greedy feature selection

Page 26: Automatic Machine Learning using Python & scikit-learn

● Multiple ways of feature selection

● Random forest based feature importances

● Feature importances from GBM

● Chi2 feature selection

● Greedy feature selection

Page 27: Automatic Machine Learning using Python & scikit-learn

● Multiple ways of feature selection

● Random forest based feature importances

● Feature importances from GBM

● Chi2 feature selection

● Greedy feature selection

Page 28: Automatic Machine Learning using Python & scikit-learn

● Multiple ways of feature selection

● Random forest based feature importances

● Feature importances from GBM

● Chi2 feature selection

● Greedy feature selection

Page 29: Automatic Machine Learning using Python & scikit-learn

● Multiple ways of feature selection

● Random forest based feature importances

● Feature importances from GBM

● Chi2 feature selection

● Greedy feature selection

Page 30: Automatic Machine Learning using Python & scikit-learn
Page 31: Automatic Machine Learning using Python & scikit-learn

● Grid Search● Random Search

Page 32: Automatic Machine Learning using Python & scikit-learn

● Classification:○ Random Forest○ GBM○ Logistic Regression○ Naive Bayes○ Support Vector Machines○ k-Nearest Neighbors ● Grid Search

● Random Search

Page 33: Automatic Machine Learning using Python & scikit-learn

● Classification:○ Random Forest○ GBM○ Logistic Regression○ Naive Bayes○ Support Vector Machines○ k-Nearest Neighbors

● Regression○ Random Forest○ GBM○ Linear Regression○ Ridge○ Lasso○ SVR

● Grid Search● Random Search

Page 34: Automatic Machine Learning using Python & scikit-learn

To Appear: AutoCompete 2.0: A Framework for Optimizing Parameters of Neural Networks, A.Thakur, ICML AutoML Workshop, System Desc Track, 2016

Page 35: Automatic Machine Learning using Python & scikit-learn

○○○

○○

○○

Page 36: Automatic Machine Learning using Python & scikit-learn
Page 37: Automatic Machine Learning using Python & scikit-learn

Results on Newsgroups-20 dataset

Page 38: Automatic Machine Learning using Python & scikit-learn

AutoML Final1 Results

Page 39: Automatic Machine Learning using Python & scikit-learn

AutoML Final4 Results

Page 40: Automatic Machine Learning using Python & scikit-learn

AutoML GPU Track Results

Page 41: Automatic Machine Learning using Python & scikit-learn
Page 42: Automatic Machine Learning using Python & scikit-learn

● @abhi1thakur● bit.ly/thakurabhishek● kaggle.com/abhishek