lecture 10: svm and mira

14

Machine Learning for Language Technology Lecture 10: SVM and MIRA Marina San5ni Department of Linguis5cs and Philology Uppsala University, Uppsala, Sweden Autumn 2014 Acknowledgement: Thanks to Prof. Joakim Nivre for course design and materials 1

Upload: marina-santini

Post on 05-Dec-2014

206 views

Category:

Education

1 download

Report

Download

Embed Size (px):

DESCRIPTION

Outline: margin, maximizing margin, the norm, support vectors machines, SVM, Margin Infused Relaxed Algorithm, MIRA

TRANSCRIPT

Page 1: Lecture 10: SVM and MIRA

Machine Learning for Language Technology Lecture 10: SVM and MIRA

Marina San5ni Department of Linguis5cs and Philology Uppsala University, Uppsala, Sweden

Autumn 2014

Acknowledgement: Thanks to Prof. Joakim Nivre for course design and materials

1

Page 2: Lecture 10: SVM and MIRA

Margin

Page 3: Lecture 10: SVM and MIRA

Maximizing Margin (i)

Page 4: Lecture 10: SVM and MIRA

Maximizing Margin (ii)

Page 5: Lecture 10: SVM and MIRA

Maximizing Margin (iii)

Page 6: Lecture 10: SVM and MIRA

Max Margin = Min Norm

Page 7: Lecture 10: SVM and MIRA

Maximizing the margin

Linear Classifiers: Repe55on & Extension 7

•  The no5on of margin: a way of predic5ng what it will be a good separa5on on the test set.

•  Intui5vely, if we make the margin between opposite groups as wide as possible, our chances to guess correct in the test set should increase.

•  the generaliza5on error on unseen test data is propor5onal to the inverse of the margin: the larger the margin, the smaller the generaliza5on error

Page 8: Lecture 10: SVM and MIRA

Support Vector Machines (SVM) (i)

Page 9: Lecture 10: SVM and MIRA

Support Vector Machines (SVM) (ii)

Page 10: Lecture 10: SVM and MIRA

Margin Infused Relaxed Algorithm (MIRA)

Page 11: Lecture 10: SVM and MIRA

MIRA

Page 12: Lecture 10: SVM and MIRA

Perceptron vs. SVMs/MIRA

Linear Classifiers: Repe55on & Extension 12

Perceptron SVMs/MIRA If the training set is separable by some margin, the Perceptron will find a weight vector that separates the data, but it will not necessarily pick up the vector that maximizes the margin. If we are lucky, it will be a vector with the largest margin, but there will be no guarantee.

SVMs/MIRA want a weight vector that maximizes the margin to 1. Here the margin is normalized to 1. So we put a constraint on the weight vector saying that the weight should be such that when you computes the norm we should get 1. We keep the margin fixed and minimize the norm. That is, we want the smallest weight vector that gives us margin 1.

We do not minimize the norm, we minimize the norm squared divided by 2 to make the math easier (trust the people who suggested this J )

Page 13: Lecture 10: SVM and MIRA

Summary

Page 14: Lecture 10: SVM and MIRA

The end

Machine Learning Lecture 9 Bayes Decision Theory ng · Extensions 3 B. Leibe ng ‘18 Recap: Support Vector Machine (SVM) • Basic idea The SVM tries to find a classifier which maximizes

Lecture 2: The SVM classifier

SVM TR53 SVM TR107 - Star Foils Tecniche/SVM TR...SVM TR53 THERMAL TRANSFER OVERPRINTER Esempi di stampa SVM TR107 I marcatori multipista trasversali soddisfano le esigenze di stampare

Lecture12 - SVM

ECE595 / STAT598: Machine Learning I Lecture 19 Support ... · Outline Support Vector Machine Lecture 19 SVM 1: The Concept of Max-Margin Lecture 20 SVM 2: Dual SVM Lecture 21 SVM

ECE595 / STAT598: Machine Learning I Lecture 20 Support ...Outline Support Vector Machine Lecture 19 SVM 1: The Concept of Max-Margin Lecture 20 SVM 2: Dual SVM Lecture 21 SVM 3: Kernel

Machine Learning Queens College Lecture 13: SVM Again

Lecture 3: SVM dual, kernels and regression

Lecture 1: linear SVM in the primal

AI Study · 2004. 11. 16. · SVM -O 155- 10 ETRIOII'/ 4.1 SVM 2007119} non-eye non-eye —L 719-1 SVM 1 Ù)A5L, Radial Basis Function(RBF) SVM non-eye SVM* 17 non-eye SVM 71 gl EL

Lecture 6: The SVM classifier - ml-tau-2014.wdfiles.comml-tau-2014.wdfiles.com/local--files/course-schedule/SVMLecture... · Lecture 6: The SVM classifier ... x = quadprog(H,f,A,b)

עבוד אותות במערכת החושים סמסטר א' תש"ע mira/Senses2009 Lecture 13 mira/Senses2009

MIRA, SVM, k-NNxial/Teaching/2018S/slides/IntroAI_21.pdf• Similarity for classification • Case-based reasoning • Predict an instance’s label using similar instances • Nearest-neighbor

ECE595 / STAT598: Machine Learning I Lecture 21 Support ...Support Vector Machine Lecture 19 SVM 1: The Concept of Max-Margin Lecture 20 SVM 2: Dual SVM Lecture 21 SVM 3: Soft SVM

Daniele Loiacono - Politecnico di Milanohome.deib.polimi.it/loiacono/uploads/Teaching/DMTM/DMTM Suppor… · Training SVM ν-SVM Multi-class SVM Regression. Daniele Loiacono SVM Training

Lecture 3: SVM dual, kernels and multiple classes

עבוד אותות במערכת החושים סמסטר א' תש"ע eng.tau.ac.il/~mira/Senses2009 Lecture 13

Principles of Programming Languages Lecture 1 Slides by Daniel Deutch, based on lecture notes by Prof. Mira Balaban

Stützvektormethode (SVM)

Lecture 13: SVM Again

Lecture 2: linear SVM in the Dual

Lecture 5: More on Kernels. SVM regression and classi cationdprecup/courses/ML/Lectures/ml-lecture0… · Solving SVM regression We introduce slack variables, ˘+ i, ˘ i to account

Mira Moto, Mira Pace , Mira Minilite and Mira Miniduoresources.kohler.com/.../pdf/1082138_w2_n_mira_moto...The Mira Moto, Mira Minilite and Mira Minilite Eco are thermostatic mixers

SupportVectorMachine(SVM) · 2020. 10. 13. · SupportVectorMachine(SVM) Whataresupportvectormachines(SVM)? LikeLDAandLogisticregression,anSVMisalsoalinearclassiﬁerbut seekstoﬁndamaximum