machine learning, decision trees, overfittingawm/10701/slides/dtreesandoverfitting-9-13-05.pdf ·...

31

Machine Learning, Decision Trees, Overfitting Machine Learning 10-701 Tom M. Mitchell Center for Automated Learning and Discovery Carnegie Mellon University September 13, 2005 Recommended reading: Mitchell, Chapter 3

Upload: hanguyet

Post on 09-Jul-2018

220 views

Category:

Documents

0 download

Report

Download

Embed Size (px):

TRANSCRIPT

Page 1: Machine Learning, Decision Trees, Overfittingawm/10701/slides/DTreesAndOverfitting-9-13-05.pdf · Machine Learning, Decision Trees, Overfitting Machine Learning 10-701 ... 2005 Recommended

Machine Learning,Decision Trees, Overfitting

Machine Learning 10-701

Tom M. MitchellCenter for Automated Learning and Discovery

Carnegie Mellon University

September 13, 2005

Recommended reading: Mitchell, Chapter 3

Page 2: Machine Learning, Decision Trees, Overfittingawm/10701/slides/DTreesAndOverfitting-9-13-05.pdf · Machine Learning, Decision Trees, Overfitting Machine Learning 10-701 ... 2005 Recommended

Machine Learning:

Study of algorithms that• improve their performance• at some task• with experience

Page 3: Machine Learning, Decision Trees, Overfittingawm/10701/slides/DTreesAndOverfitting-9-13-05.pdf · Machine Learning, Decision Trees, Overfitting Machine Learning 10-701 ... 2005 Recommended

Learning to Predict Emergency C-Sections

9714 patient records, each with 215 features

[Sims et al., 2000]

Page 4: Machine Learning, Decision Trees, Overfittingawm/10701/slides/DTreesAndOverfitting-9-13-05.pdf · Machine Learning, Decision Trees, Overfitting Machine Learning 10-701 ... 2005 Recommended

Object Detection

Example training images for each orientation

(Prof. H. Schneiderman)

Page 5: Machine Learning, Decision Trees, Overfittingawm/10701/slides/DTreesAndOverfitting-9-13-05.pdf · Machine Learning, Decision Trees, Overfitting Machine Learning 10-701 ... 2005 Recommended

Text Classification

Company home page

vs

Personal home page

vs

Univeristy home page

vs

…

Page 6: Machine Learning, Decision Trees, Overfittingawm/10701/slides/DTreesAndOverfitting-9-13-05.pdf · Machine Learning, Decision Trees, Overfitting Machine Learning 10-701 ... 2005 Recommended

Reading a noun (vs verb)

[Rustandi et al., 2005]

Page 7: Machine Learning, Decision Trees, Overfittingawm/10701/slides/DTreesAndOverfitting-9-13-05.pdf · Machine Learning, Decision Trees, Overfitting Machine Learning 10-701 ... 2005 Recommended

Growth of Machine Learning

• Machine learning is preferred approach to– Speech recognition, Natural language processing– Computer vision– Medical outcomes analysis– Robot control– …

• This trend is accelerating– Improved machine learning algorithms – Improved data capture, networking, faster computers– Software too complex to write by hand– New sensors / IO devices– Demand for self-customization to user, environment

Page 8: Machine Learning, Decision Trees, Overfittingawm/10701/slides/DTreesAndOverfitting-9-13-05.pdf · Machine Learning, Decision Trees, Overfitting Machine Learning 10-701 ... 2005 Recommended

Decision tree learning

Page 9: Machine Learning, Decision Trees, Overfittingawm/10701/slides/DTreesAndOverfitting-9-13-05.pdf · Machine Learning, Decision Trees, Overfitting Machine Learning 10-701 ... 2005 Recommended

Each internal node: test one attribute Xi

Each branch from a node: selects one value for Xi

Each leaf node: predict Y (or P(Y|X ∈ leaf))

How would you represent

AB ∨ CD(¬E)?

Page 10: Machine Learning, Decision Trees, Overfittingawm/10701/slides/DTreesAndOverfitting-9-13-05.pdf · Machine Learning, Decision Trees, Overfitting Machine Learning 10-701 ... 2005 Recommended

Page 11: Machine Learning, Decision Trees, Overfittingawm/10701/slides/DTreesAndOverfitting-9-13-05.pdf · Machine Learning, Decision Trees, Overfitting Machine Learning 10-701 ... 2005 Recommended

node = Root

[ID3, C4.5, …]

Page 12: Machine Learning, Decision Trees, Overfittingawm/10701/slides/DTreesAndOverfitting-9-13-05.pdf · Machine Learning, Decision Trees, Overfitting Machine Learning 10-701 ... 2005 Recommended

EntropyEntropy H(X) of a random variable X

H(X) is the expected number of bits needed to encode a randomly drawn value of X (under most efficient code)

Why? Information theory:• Most efficient code assigns -log2P(X=i) bits to encode

the message X=i• So, expected number of bits is:

Page 13: Machine Learning, Decision Trees, Overfittingawm/10701/slides/DTreesAndOverfitting-9-13-05.pdf · Machine Learning, Decision Trees, Overfitting Machine Learning 10-701 ... 2005 Recommended

Sample Entropy

Page 14: Machine Learning, Decision Trees, Overfittingawm/10701/slides/DTreesAndOverfitting-9-13-05.pdf · Machine Learning, Decision Trees, Overfitting Machine Learning 10-701 ... 2005 Recommended

Page 15: Machine Learning, Decision Trees, Overfittingawm/10701/slides/DTreesAndOverfitting-9-13-05.pdf · Machine Learning, Decision Trees, Overfitting Machine Learning 10-701 ... 2005 Recommended

Page 16: Machine Learning, Decision Trees, Overfittingawm/10701/slides/DTreesAndOverfitting-9-13-05.pdf · Machine Learning, Decision Trees, Overfitting Machine Learning 10-701 ... 2005 Recommended

Page 17: Machine Learning, Decision Trees, Overfittingawm/10701/slides/DTreesAndOverfitting-9-13-05.pdf · Machine Learning, Decision Trees, Overfitting Machine Learning 10-701 ... 2005 Recommended

Page 18: Machine Learning, Decision Trees, Overfittingawm/10701/slides/DTreesAndOverfitting-9-13-05.pdf · Machine Learning, Decision Trees, Overfitting Machine Learning 10-701 ... 2005 Recommended

Page 19: Machine Learning, Decision Trees, Overfittingawm/10701/slides/DTreesAndOverfitting-9-13-05.pdf · Machine Learning, Decision Trees, Overfitting Machine Learning 10-701 ... 2005 Recommended

Page 20: Machine Learning, Decision Trees, Overfittingawm/10701/slides/DTreesAndOverfitting-9-13-05.pdf · Machine Learning, Decision Trees, Overfitting Machine Learning 10-701 ... 2005 Recommended

Page 21: Machine Learning, Decision Trees, Overfittingawm/10701/slides/DTreesAndOverfitting-9-13-05.pdf · Machine Learning, Decision Trees, Overfitting Machine Learning 10-701 ... 2005 Recommended

Page 22: Machine Learning, Decision Trees, Overfittingawm/10701/slides/DTreesAndOverfitting-9-13-05.pdf · Machine Learning, Decision Trees, Overfitting Machine Learning 10-701 ... 2005 Recommended

Assume X values known,

labels Y encoded

Page 23: Machine Learning, Decision Trees, Overfittingawm/10701/slides/DTreesAndOverfitting-9-13-05.pdf · Machine Learning, Decision Trees, Overfitting Machine Learning 10-701 ... 2005 Recommended

Page 24: Machine Learning, Decision Trees, Overfittingawm/10701/slides/DTreesAndOverfitting-9-13-05.pdf · Machine Learning, Decision Trees, Overfitting Machine Learning 10-701 ... 2005 Recommended

Page 25: Machine Learning, Decision Trees, Overfittingawm/10701/slides/DTreesAndOverfitting-9-13-05.pdf · Machine Learning, Decision Trees, Overfitting Machine Learning 10-701 ... 2005 Recommended

Page 26: Machine Learning, Decision Trees, Overfittingawm/10701/slides/DTreesAndOverfitting-9-13-05.pdf · Machine Learning, Decision Trees, Overfitting Machine Learning 10-701 ... 2005 Recommended

Page 27: Machine Learning, Decision Trees, Overfittingawm/10701/slides/DTreesAndOverfitting-9-13-05.pdf · Machine Learning, Decision Trees, Overfitting Machine Learning 10-701 ... 2005 Recommended

Page 28: Machine Learning, Decision Trees, Overfittingawm/10701/slides/DTreesAndOverfitting-9-13-05.pdf · Machine Learning, Decision Trees, Overfitting Machine Learning 10-701 ... 2005 Recommended

Page 29: Machine Learning, Decision Trees, Overfittingawm/10701/slides/DTreesAndOverfitting-9-13-05.pdf · Machine Learning, Decision Trees, Overfitting Machine Learning 10-701 ... 2005 Recommended

Page 30: Machine Learning, Decision Trees, Overfittingawm/10701/slides/DTreesAndOverfitting-9-13-05.pdf · Machine Learning, Decision Trees, Overfitting Machine Learning 10-701 ... 2005 Recommended

Page 31: Machine Learning, Decision Trees, Overfittingawm/10701/slides/DTreesAndOverfitting-9-13-05.pdf · Machine Learning, Decision Trees, Overfitting Machine Learning 10-701 ... 2005 Recommended

What you should know:• Well posed function approximation problems:

– Instance space, X– Sample of labeled training data, D = { <xi, yi>}– Hypothesis space, H = { f: X Y }

• Learning is a search/optimization problem over H– Various objective functions– Today: minimize training error (0-1 loss)

• Decision tree learning– Greedy top-down learning of decision trees (ID3, C4.5, ...)– Overfitting and tree/rule post-pruning– Extensions…

Machine Learning: Decision Trees

Machine Learning Lecture 10 Decision Trees G53MLE Machine Learning Dr Guoping Qiu1

Machine Learning Part I Machine Learning I –Machine Learning Introduction –Induction of Decision Trees –Perceptron and Simple Neural Network Learning –Naïve

Machine Learning, Decision Trees, Overfittingawm/15781/slides/DTreesAndOverfitting-9-13-05.… · Decision Trees, Overfitting Machine Learning 10-701 Tom M ... Decision tree learning

supervised learning decision treesmathcs.pugetsound.edu/~alchambers/classes/cs151... · What is supervised learning?! Decision trees Machine Learning ! The goal of machine learning

Machine Learning: Decision Trees Homework 4 assigned

Incremental induction of decision trees - Machine Learning Laboratory

Machine Learning - School of Computingzhe/pdf/lec-4-decision... · LearningDecision Trees Machine Learning Fall 2017 Supervised Learning: The Setup 1 Machine Learning Spring 2018

Machine Learning 4. Decision Trees - Universität …...Course on Machine Learning, winter term 2006 13/ 46 Machine Learning / 2. Splits Types of Decision Trees A decision tree is

Decision trees for machine learning

COM24111: Machine Learning Decision Trees Gavin Brown gbrown

Machine Learning Queens College Lecture 2: Decision Trees

Decision Trees Machine Learning

TDT4173 Machine Learning - Decision Trees, Hypothesis Testing

Machine Learning Approach based on Decision Trees

Machine Learning: Decision Trees Chapter 18.1-18.3

Monitoring of Chestnut Trees Using Machine Learning

Machine Learning - A. Supervised Learning A.6. Decision Trees · Machine Learning 3. Learning Decision Trees Learning Regression Trees Fit criteria such as the smallest residual sum

[@IndeedEng] Machine Learning at Indeed: Scaling Decision Trees

Machine Learning, Decision Trees, Overfittingtom/10601_sp09/lectures/DTreesAndOver... · 2009. 1. 14. · Machine Learning, Decision Trees, Overfitting Machine Learning 10-601 Tom

ICS 273A Intro Machine Learning decision trees, random forests,

ICS 178 Intro Machine Learning decision trees, random forests, bagging, boosting

Machine Learning Decision Trees (2). Beispiel (Wiederholung)

Machine Learning Lecture 12 - Computer vision · Machine Learning – Lecture 12 Randomized Trees, Forests, ... – Chi-square test ... (Weka) –

Machine Learning - A. Supervised Learning A.7. Decision Trees · Machine Learning 2. Splits Types of Decision Trees A decision tree is called univariate, n-ary, with complete splits

13 Machine Learning Supervised Decision Trees

CS 6375 Machine Learning - HLTRIyangl/cs6375/slides/intro.pdf · CS 6375 Machine Learning Lecture 1: Introduction ... Machine Learning, Tom Mitchell ... trees, neural networks,

Machine Learning Lecture 3 Decision Trees

Data Mining and Machine Learning Decision Trees and ID3

Machine Learning: Decision Trees - University ofpages.cs.wisc.edu/~jerryzhu/cs540/handouts/dt.pdf · Machine Learning: Decision Trees CS540 Jerry Zhu University of Wisconsin-Madison

2013-1 Machine Learning Lecture 02 - Andrew Moore : Decision Trees

COMP 551 –Applied Machine Learning Lecture 10: Decision trees · 2018-02-06 · Main types of machine learning problems • Supervised learning – Classification – Regression

2013-1 Machine Learning Lecture 02 - Decision Trees

Information Extraction Lecture 6 – Decision Trees (Basic Machine Learning)

Network Anomaly Detection Using Machine Learning | A ...€¦ · using machine learning, use of decision trees and Naïve base algorithms of machine learning, artificial neural network