pattern recognition & machine learning

21
Pattern Recognition & Machine Learning Debrup Chakraborty [email protected] .mx

Upload: corazon-soria

Post on 01-Jan-2016

57 views

Category:

Documents


4 download

DESCRIPTION

Pattern Recognition & Machine Learning. Debrup Chakraborty [email protected]. The Initials. Time: Wednesday: 16 hrs to 18 hrs Friday : 16 hrs to 18 hrs. Course website: http://delta.cs.cinvestav.mx/ ~debrup/Machine_Learning.html. Books: - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Pattern Recognition & Machine Learning

Pattern Recognition & Machine Learning

Debrup Chakraborty

[email protected]

Page 2: Pattern Recognition & Machine Learning

The Initials

Time: Wednesday: 16 hrs to 18 hrs Friday : 16 hrs to 18 hrs

Course website:

http://delta.cs.cinvestav.mx/~debrup/Machine_Learning.html

Books:

1) Pattern Classification: Duda, Hart and Stork

2) Machine Learning: Mitchell

3) Neural Networks : Haykin

Page 3: Pattern Recognition & Machine Learning

The Initials (contd.)

Grading policies:

4 homeworks (20%)

2 exams (30%)

1 project/term paper ( 50%)

The lectures would be in English, I am really sorry about it.

Page 4: Pattern Recognition & Machine Learning

We are good at recognizing patterns

We can recognize faces with ease

We can understand spoken words

We can read handwritings

……… many more

It would be nice to make machines perform these tasks.

Informally pattern recognition deals with techniques and methods to make machines recognize patterns.

Page 5: Pattern Recognition & Machine Learning

Satellite image of Kolkata, in 4 channels: Red, Green, Blue and Infrared

Which parts are land and which are water?

Where is the airport?

Page 6: Pattern Recognition & Machine Learning

Protein Fold Prediction1) Proteins are sequences of amino acid molecules.

There are 20 distinct type of amino acids and their sequences can form many-many protein molecules.

2) A amino acid chain must fold in a certain manner which helps it in its specified activity.

3) The activity and function of a protein depends on the way it folds. Thus the fold information is necessary for determining the function of a protein.

Problem: Given an amino acid sequence, tell me the fold it will undergo.

Page 7: Pattern Recognition & Machine Learning

Text Categorization

Given a large corpus of text documents (Say news items).

Tell me the categories of the documents.

Similar problems:

Classify music in a music corpus

Classify images in a image corpus

Retrive documents/music/images from a database which are similar to x.

Page 8: Pattern Recognition & Machine Learning

Other Problems

•Predict whether a patient hospitalized due to heart attack will have a second heart attack. The prediction is to be based on demographic, diet and clinical measurements of the patient.

•Predict the price of a stock in 6 months from now on the basis of company performance measures and econimic data.

•Identify the numbers in a handwritten zip code (CP) from digitized images.

•Estimate the amount of glucose in the blood of a diabetic patient from the infra red absorption spectrum of the blood.

•Estimate the risk factors for prostrate cancer based on clinical and demographic variables.

Page 9: Pattern Recognition & Machine Learning

Pattern Recognition

(What the experts say?)

Duda Hart, 1973 : “A field concerned with machine recognition of meaningful regularities”.

Bezdek, 1981: “Pattern recognition is the search for structure in data”

Page 10: Pattern Recognition & Machine Learning

Types of Data

Object Data

Relational Data

4

Htb

Co4

legs20

Wt

Page 11: Pattern Recognition & Machine Learning

Object Data – Numeric Features

The characteristics of an object is encoded in a vector called the Feature Vector

Each component of the vector represents some attribute of the object. These components are called features.

Example:

Iris data : A data in R4. Representing iris flowers. The features are the sepal length, sepal width, petal length and petal width of 150 iris flowers of 3 different types.

Multichannel satellite Image: Images captured by different sensors which captures the frequency information of the electromagnetic radiation from earths surface.

Page 12: Pattern Recognition & Machine Learning

Relational Data

A relational matrix whose values are assigned by humans or computed from features:

R r rij n n ij

Russian German Chinese Japanese

1.0 0.6 0.2 0.25

0.6 1.0 0.1 0.2

0.2 0.1 1.0 0.8

0.25 0.2 0.8 1.0

Russian

German

Chinese

Japanese

Page 13: Pattern Recognition & Machine Learning

Pattern Recognition Systems

Preprocessing

Feature extraction

Feature analysis

The main recognition task

Page 14: Pattern Recognition & Machine Learning

Recognition Involves Learning

Learning in this context means:

1) Extracting knowledge from past experience

2) Representing the knowledge efficiently

3) Using the knowledge for future predictions/recognitions

Page 15: Pattern Recognition & Machine Learning

Types of Learning

Supervised learning

Learning with a teacher

Unsupervised learning

Learning without a teacher

Reinforcement learning

Learning with a critic

Page 16: Pattern Recognition & Machine Learning

Supervised Learning

Training Set

Learning Algorithm

hNew example

Prediction

Page 17: Pattern Recognition & Machine Learning

Supervised Learning (Contd.)

Supervised learning systems varies according to the type of function learned.

Basic types:

Function approximation systems (Numeric outputs)

qp RRh :

Classifier systems (outputs are classes)

qp RRh :

Page 18: Pattern Recognition & Machine Learning

Supervised Learning (Contd.)

)(xy S

It is assumed that x and y bear a (unknown) functional relationship say

It is assumed that (xi, yi) is generated from a fixed (but unknown) time invariant probability distribution.

L i Ni i x y, , , , ...,1 2The Training set

Goal: To find h which closely resembles S given L.

Page 19: Pattern Recognition & Machine Learning

Supervised Learning (Contd.)

How to measure whether h resembles S.

Training error: The error on training data points

Test error (Generalization Error): The error on points not in the training set (difficult to measure)

))((1

ii

N

ii yxhJ

Page 20: Pattern Recognition & Machine Learning

Bad Generalization – An example

x

xx

x

y

y

y

y

Page 21: Pattern Recognition & Machine Learning

Thank You