object recognition by parts object recognition started with line segments. - roberts recognized...
Post on 22-Dec-2015
248 views
TRANSCRIPT
![Page 1: Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to](https://reader035.vdocuments.mx/reader035/viewer/2022062715/56649d815503460f94a658b1/html5/thumbnails/1.jpg)
Object Recognition by Parts• Object recognition started with line segments.
- Roberts recognized objects from line segments and junctions.
- This led to systems that extracted linear features..
- CAD-model-based vision works well for industrial.
• An “appearance-based approach” was first developed for face recognition and later generalized up to a point.
• The new interest operators have led to a new kind of recognition by “parts” that can handle a variety of objects that were previously difficult or impossible.
![Page 2: Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to](https://reader035.vdocuments.mx/reader035/viewer/2022062715/56649d815503460f94a658b1/html5/thumbnails/2.jpg)
Object Class Recognitionby Unsupervised Scale-Invariant Learning
R. Fergus, P. Perona, and A. Zisserman
Oxford University and Caltech
CVPR 2003
won the best student paper award
![Page 3: Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to](https://reader035.vdocuments.mx/reader035/viewer/2022062715/56649d815503460f94a658b1/html5/thumbnails/3.jpg)
Goal:
• Enable Computers to Recognize Different Categories of Objects in Images.
![Page 4: Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to](https://reader035.vdocuments.mx/reader035/viewer/2022062715/56649d815503460f94a658b1/html5/thumbnails/4.jpg)
![Page 5: Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to](https://reader035.vdocuments.mx/reader035/viewer/2022062715/56649d815503460f94a658b1/html5/thumbnails/5.jpg)
Approach
• An object is a random constellation of parts (from Burl, Weber and Perona, 1998).
• The parts are detected by an interest operator (Kadir’s).
• The parts can be recognized by appearance.
• Objects may vary greatly in scale.
• The constellation of parts for a given object is learned from training images
![Page 6: Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to](https://reader035.vdocuments.mx/reader035/viewer/2022062715/56649d815503460f94a658b1/html5/thumbnails/6.jpg)
Components
• Model– Generative Probabilistic Model including
Location, Scale, and Appearance of Parts
• Learning– Estimate Parameters Via EM Algorithm
• Recognition– Evaluate Image Using Model and Threshold
![Page 7: Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to](https://reader035.vdocuments.mx/reader035/viewer/2022062715/56649d815503460f94a658b1/html5/thumbnails/7.jpg)
Model: Constellation Of Parts
Fischler & Elschlager, 1973
Yuille, ‘91 Brunelli & Poggio, ‘93 Lades, v.d. Malsburg et al. ‘93 Cootes, Lanitis, Taylor et al. ‘95 Amit & Geman, ‘95, ‘99 Perona et al. ‘95, ‘96, ’98, ‘00
![Page 8: Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to](https://reader035.vdocuments.mx/reader035/viewer/2022062715/56649d815503460f94a658b1/html5/thumbnails/8.jpg)
Parts Selected by Interest Operator
Kadir and Brady's Interest Operator. Finds Maxima in Entropy Over Scale and Location
![Page 9: Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to](https://reader035.vdocuments.mx/reader035/viewer/2022062715/56649d815503460f94a658b1/html5/thumbnails/9.jpg)
Representation of Appearance
11x11 patch
c1
c2
Normalize
Projection ontoPCA basis
c15
121 dimensions was too big, so they used PCA to reduce to 10-15.
![Page 10: Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to](https://reader035.vdocuments.mx/reader035/viewer/2022062715/56649d815503460f94a658b1/html5/thumbnails/10.jpg)
Learning a Model
• An object class is represented by a generative model with P parts and a set of parameters .
• Once the model has been learned, a decision procedure must determine if a new image contains an instance of the object class or not.
• Suppose the new image has N interesting features with locations X, scales S and appearances A.
![Page 11: Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to](https://reader035.vdocuments.mx/reader035/viewer/2022062715/56649d815503460f94a658b1/html5/thumbnails/11.jpg)
Generative Probabilistic Model
Bayesian Decision Rule
R is the likelihood ratio.
is the maximum likelihood value of the parameters of the object and bg of the background.
h is the hypothesis as to which P of the N features in the image are the object, implemented as a vector of length P with values from 0 to N indicating which image feature corresponds to each object feature.
H is the set of all hypotheses;Its size is O(NP).
Top-Down Formulation
![Page 12: Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to](https://reader035.vdocuments.mx/reader035/viewer/2022062715/56649d815503460f94a658b1/html5/thumbnails/12.jpg)
Appearance
Gaussian Part Appearance PDF Guausian Appearance PDF
The appearance (A) of each part phas a Gaussian density withmean cp and covariance VP. Background model has mean cbg
and covariance Vbg.
The vector d of length P has a 1 forvisible parts and 0 for occluded parts.dp stands for d[p].
Object Background
![Page 13: Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to](https://reader035.vdocuments.mx/reader035/viewer/2022062715/56649d815503460f94a658b1/html5/thumbnails/13.jpg)
Shape as Location
Gaussian Shape PDF Uniform Shape PDF
Object Background
Object shape is represented by a joint Gaussian density of the locations (X) of features within a hypothesis transformed into a scale-invariant space.
is the area of the image
f is number of foregroundfeatures in the hypothesis.
![Page 14: Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to](https://reader035.vdocuments.mx/reader035/viewer/2022062715/56649d815503460f94a658b1/html5/thumbnails/14.jpg)
Scale
Prob. of detection
Gaussian Relative Scale PDF
Log(scale)
0.8 0.75 0.9
Poission PDF On # Detections
UniformRelative Scale PDF
Log(scale)
The relative scale of each part is modeled by a Gaussian density withmean tp and covariance Up.
r is a range.f is number of visible features.
![Page 15: Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to](https://reader035.vdocuments.mx/reader035/viewer/2022062715/56649d815503460f94a658b1/html5/thumbnails/15.jpg)
Occlusion and Part Statistics
• First term: Poisson distribution (mean M) models the number of features in the background.
• Second term: (constant) 1/(number of combinations of ft features out of a total of Nt)
• Third term: gives probability for possible occlusion patterns.
![Page 16: Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to](https://reader035.vdocuments.mx/reader035/viewer/2022062715/56649d815503460f94a658b1/html5/thumbnails/16.jpg)
Learning
• Train Model Parameters Using EM:
• Optimize Parameters• Optimize Assignments• Repeat Until Convergence
scalelocation
appearance
occlusion
![Page 17: Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to](https://reader035.vdocuments.mx/reader035/viewer/2022062715/56649d815503460f94a658b1/html5/thumbnails/17.jpg)
Recognition
Make This:
Greater Than Threshold
![Page 18: Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to](https://reader035.vdocuments.mx/reader035/viewer/2022062715/56649d815503460f94a658b1/html5/thumbnails/18.jpg)
RESULTS
• Initially tested on the Caltech-4 data set– motorbikes– faces– airplanes– cars
• Now there is a much bigger data set: the Caltech-101 http://www.vision.caltech.edu/archive.html
![Page 19: Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to](https://reader035.vdocuments.mx/reader035/viewer/2022062715/56649d815503460f94a658b1/html5/thumbnails/19.jpg)
MotorbikesEqual error rate: 7.5%
![Page 20: Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to](https://reader035.vdocuments.mx/reader035/viewer/2022062715/56649d815503460f94a658b1/html5/thumbnails/20.jpg)
Background Images
![Page 21: Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to](https://reader035.vdocuments.mx/reader035/viewer/2022062715/56649d815503460f94a658b1/html5/thumbnails/21.jpg)
Frontal facesEqual error rate: 4.6%
![Page 22: Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to](https://reader035.vdocuments.mx/reader035/viewer/2022062715/56649d815503460f94a658b1/html5/thumbnails/22.jpg)
AirplanesEqual error rate: 9.8%
![Page 23: Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to](https://reader035.vdocuments.mx/reader035/viewer/2022062715/56649d815503460f94a658b1/html5/thumbnails/23.jpg)
Scale-Invariant CatsEqual error rate: 10.0%
![Page 24: Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to](https://reader035.vdocuments.mx/reader035/viewer/2022062715/56649d815503460f94a658b1/html5/thumbnails/24.jpg)
Scale-Invariant carsEqual error rate: 9.7%
![Page 25: Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to](https://reader035.vdocuments.mx/reader035/viewer/2022062715/56649d815503460f94a658b1/html5/thumbnails/25.jpg)
Robustness of Algorithm
![Page 26: Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to](https://reader035.vdocuments.mx/reader035/viewer/2022062715/56649d815503460f94a658b1/html5/thumbnails/26.jpg)
Accuracy
Initial Pre-Scaled Experiments
![Page 27: Object Recognition by Parts Object recognition started with line segments. - Roberts recognized objects from line segments and junctions. - This led to](https://reader035.vdocuments.mx/reader035/viewer/2022062715/56649d815503460f94a658b1/html5/thumbnails/27.jpg)
ROC equal error rates
Scale-Invariant Learning and Recognition: