beyond bags of features: spatial pyramid matching for recognizing natural scene categories svetlana...

Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories Svetlana Lazebnik, Cordelia Schmid, Jean Ponce Presented by: Lubomir Bourdev Many of the slides by: Svetlana Lazebnik

Upload: reginald-hood

Post on 01-Jan-2016

223 views

Category:

Documents

0 download

Report

Download

Tags:

Embed Size (px):

TRANSCRIPT

Beyond Bags of Features: Spatial Pyramid Matching for Recognizing

Natural Scene Categories

Svetlana Lazebnik, Cordelia Schmid, Jean Ponce

Presented by: Lubomir Bourdev

Many of the slides by: Svetlana Lazebnik

Page 2: Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories Svetlana Lazebnik, Cordelia Schmid, Jean Ponce Presented by:

Key Idea

• Pyramid Match Kernel (Grauman & Darrell)Pyramid in feature space, ignore location

• Spatial Pyramid (this work)Pyramid in image space, quantize features

Page 3: Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories Svetlana Lazebnik, Cordelia Schmid, Jean Ponce Presented by:

Algorithm1. Extract interest point descriptors (dense scan)

2. Construct visual word dictionary

3. Build spatial histograms

4. Create intersection kernels

5. Train an SVM

Page 4: Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories Svetlana Lazebnik, Cordelia Schmid, Jean Ponce Presented by:

Algorithm1. Extract interest point descriptors (dense scan)

2. Construct visual word dictionary

3. Build spatial histograms

4. Create intersection kernels

5. Train an SVM

Weak (edge orientations) Strong (SIFT)

Page 5: Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories Svetlana Lazebnik, Cordelia Schmid, Jean Ponce Presented by:

Algorithm1. Extract interest point descriptors (dense scan)

2. Construct visual word dictionary

3. Build spatial histograms

4. Create intersection kernels

5. Train an SVM

- Vector quantization

- Usually K-means clustering

- Vocabulary size (16 to 400)

Page 6: Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories Svetlana Lazebnik, Cordelia Schmid, Jean Ponce Presented by:

Algorithm1. Extract interest point descriptors (dense scan)

2. Construct visual word dictionary

3. Build spatial histograms

4. Create intersection kernels

5. Train an SVM

Page 7: Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories Svetlana Lazebnik, Cordelia Schmid, Jean Ponce Presented by:

Algorithm1. Extract interest point descriptors (dense scan)

2. Construct visual word dictionary

3. Build spatial histograms

4. Create intersection kernels

5. Train an SVM

Page 8: Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories Svetlana Lazebnik, Cordelia Schmid, Jean Ponce Presented by:

Algorithm1. Extract interest point descriptors (dense scan)

2. Construct visual word dictionary

3. Build spatial histograms

4. Create intersection kernels

5. Train an SVM

Page 9: Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories Svetlana Lazebnik, Cordelia Schmid, Jean Ponce Presented by:

Page 10: Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories Svetlana Lazebnik, Cordelia Schmid, Jean Ponce Presented by:

Page 11: Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories Svetlana Lazebnik, Cordelia Schmid, Jean Ponce Presented by:

Page 12: Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories Svetlana Lazebnik, Cordelia Schmid, Jean Ponce Presented by:

Page 13: Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories Svetlana Lazebnik, Cordelia Schmid, Jean Ponce Presented by:

Page 14: Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories Svetlana Lazebnik, Cordelia Schmid, Jean Ponce Presented by:

Page 15: Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories Svetlana Lazebnik, Cordelia Schmid, Jean Ponce Presented by:

Page 16: Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories Svetlana Lazebnik, Cordelia Schmid, Jean Ponce Presented by:

My experiment: Butterfly Classification

Peacock Zebra

Page 17: Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories Svetlana Lazebnik, Cordelia Schmid, Jean Ponce Presented by:

Butterflies

• Dataset from Lazebnik / Schmid / Ponce

70 train / 64 test

50 train / 41 test

• Images centered on the butterfly• Significant background clutter• Large pose/viewpoint variations • Scale variations: up to x4

Page 18: Beyond Bags of Features: Spatial Pyramid Matching for Recognizing Natural Scene Categories Svetlana Lazebnik, Cordelia Schmid, Jean Ponce Presented by:

Butterfly Results

Linear Intersection

Weak (16) 82.6% 82.6%

Strong (200) 81.9% 89.5%

Dims

200

Linear Intersection

Weak (16) 88.6% 86.7%

Strong (200) 84.8% 89.5%

Dims

1360

17000

Spatial pyramid levels: 1 (No pyramid)

Spatial pyramid levels: 4

CSP Examples Tamara Berg CS 560 Artificial Intelligence Many slides throughout the course adapted from Svetlana Lazebnik, Dan Klein, Stuart Russell, Andrew

Bayes Nets & HMMs Tamara Berg CS 560 Artificial Intelligence Many slides throughout the course adapted from Svetlana Lazebnik, Dan Klein, Stuart Russell,

Introduction to Computer Vision Based on slides by Jinxiang Chai, Svetlana Lazebnik, Guodong Guo Assembled and modified by Longin Jan Latecki September

MDPs (cont) & Reinforcement Learning Tamara Berg CS 560 Artificial Intelligence Many slides throughout the course adapted from Svetlana Lazebnik, Dan Klein,

Выделение объектовcmcmsu.info/download/cv2015_05_detection.pdf · Many slides adopted from Svetlana Lazebnik, Ondra Chum, Alyosha Efros, Mark Everingham, Pedro Felzenszwalb,

Classification III Tamara Berg CS 560 Artificial Intelligence Many slides throughout the course adapted from Svetlana Lazebnik, Dan Klein, Stuart Russell,

CS143, Brown James Hays Stereo and Structure from Motion Many slides by Kristen Grauman, Robert Collins, Derek Hoiem, Alyosha Efros, and Svetlana Lazebnik

A Sparse Texture Representation Using Affine-Invariant Regions Svetlana Lazebnik, Jean Ponce Svetlana Lazebnik, Jean Ponce Beckman Institute University

Piggyback: Adapting a Single Network to Multiple Tasks by ......Arun Mallya, Dillon Davis, and Svetlana Lazebnik University of Illinois at Urbana-Champaign {amallya2,ddavis14,slazebni}@illinois.edu

Reinforcement Learning Tamara Berg CS 590-133 Artificial Intelligence Many slides throughout the course adapted from Svetlana Lazebnik, Dan Klein, Stuart

L'incomprensibile - Cordelia

Bayesian Networks Tamara Berg CS 560 Artificial Intelligence Many slides throughout the course adapted from Svetlana Lazebnik, Dan Klein, Stuart Russell,

Multiple Tasks by Learning to Mask Weights arXiv:1801 ... · Arun Mallya, Dillon Davis, Svetlana Lazebnik University of Illinois at Urbana-Champaign ... [cs.CV] 16 Mar 2018. 2 Elementwise

Photo by Svetlana Lazebnik Which parts are hard to model? From Alexei Efros

Natural Language Processing Tamara Berg CS 560 Artificial Intelligence Many slides throughout the course adapted from Svetlana Lazebnik, Dan Klein, Stuart

Building a Real Camera - Svetlana Lazebnikslazebni.cs.illinois.edu/spring19/lec03_camera.pdf · lec03_camera Author: Svetlana Lazebnik Created Date: 20190123172642Z

Deep neural networks - Computer Science- UC Davis · Deep neural networks June 1st, 2017 Yong Jae Lee UC Davis Many slides from Rob Fergus, Svetlana Lazebnik, Jia-Bin Huang, Derek

Model Fitting Computer Vision CS 143, Brown James Hays 10/03/11 Slides from Silvio Savarese, Svetlana Lazebnik, and Derek Hoiem

Color and Radiometry Digital Image Synthesis Yung-Yu Chuang with slides by Svetlana Lazebnik, Pat Hanrahan and Matt Pharr

Learning Local Affine Representations for Texture and Object Recognition Svetlana Lazebnik Beckman Institute, University of Illinois at Urbana-Champaign

Search Tamara Berg CS 560 Artificial Intelligence Many slides throughout the course adapted from Dan Klein, Stuart Russell, Andrew Moore, Svetlana Lazebnik,

Cordelia Cordelia tarot...Cordelia Luciérnaga Cordelia tarot El destino en tus manos Cordelia @Luciernaga_Ed @Edicionesluciernaga Ediciones Luciérnaga 9 788417 371692 PVP 14,95 €

Segmentation slides adopted from Svetlana Lazebnik

Classification II Tamara Berg CS 560 Artificial Intelligence Many slides throughout the course adapted from Svetlana Lazebnik, Dan Klein, Stuart Russell,

Semi-local Aﬀine Parts for Object Recognition...Semi-Local Afﬁne Parts for Object Recognition Svetlana Lazebnik1 Cordelia Schmid2 Jean Ponce1 [email protected] [email protected]

Guido Gerig, CS6640 Image Processing, Utahgerig/CS6640-F2012/Materials/CS... · Credit: Svetlana Lazebnik (Computer Vision UNC Chapel Hill, 2008) Source: K. Grauman Fitting Parametric

Bayesian Networks Tamara Berg CS 590-133 Artificial Intelligence Many slides throughout the course adapted from Svetlana Lazebnik, Dan Klein, Stuart Russell,

Games Tamara Berg CS 590-133 Artificial Intelligence Many slides throughout the course adapted from Svetlana Lazebnik, Dan Klein, Stuart Russell, Andrew

arXiv:2004.00713v1 [cs.CV] 1 Apr 2020 · Ahmet Iscen 1, Je rey Zhang 2, Svetlana Lazebnik , and Cordelia Schmid 1 Google Research 2 University of Illinois at Urbana-Champaign Abstract

The Beauty of Local Invariant Features Svetlana Lazebnik Beckman Institute, University of Illinois at Urbana-Champaign IMA Recognition Workshop University

Fred Rothganger, Svetlana Lazebnik, Cordelia Schmid, … · Segmenting, modeling and matching video clips containing multiple moving objects Fred Rothganger, Svetlana Lazebnik, Cordelia

Out of the Box: Reasoning with Graph Convolution …...Out of the Box: Reasoning with Graph Convolution Nets for Factual Visual Question Answering Medhini Narasimhan, Svetlana Lazebnik,

Words & Pictures Clustering and Bag of Words Representations Many slides adapted from Svetlana Lazebnik, Fei-Fei Li, Rob Fergus, and Antonio Torralba

Utility Theory & MDPs Tamara Berg CS 590-133 Artificial Intelligence Many slides throughout the course adapted from Svetlana Lazebnik, Dan Klein, Stuart

Computer Vision Lecture 21 - RWTH Aachen University...4 ng6 From Projective to Affine 19 Slide credit: Svetlana Lazebnik B. Leibe Images from Hartley & Zisserman g6 From Affine to

beyond bags of features: spatial pyramid matching for recognizing natural scene categories svetlana...

Documents

spatial pyramid matching

pyramidspatial pyramid

lazebnik schmid ponce70

cordelia schmid

test50 train

feature space

image space

svmmy experiment