struck: structured output tracking with kernels mike liu...

38
Struck: Structured Output Tracking with Kernels Presented by Mike Liu, Yuhang Ming, and Jing Wang May 24, 2017

Upload: others

Post on 15-Mar-2021

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Struck: Structured Output Tracking with Kernels Mike Liu ...cseweb.ucsd.edu/classes/sp17/cse252C-a/CSE252C_20170524.pdf2017/05/24  · 8 Babenko, Boris, Ming-Hsuan Yang, and Serge

Struck: Structured Output Tracking with Kernels

Presented byMike Liu, Yuhang Ming, and Jing Wang

May 24, 2017

Page 2: Struck: Structured Output Tracking with Kernels Mike Liu ...cseweb.ucsd.edu/classes/sp17/cse252C-a/CSE252C_20170524.pdf2017/05/24  · 8 Babenko, Boris, Ming-Hsuan Yang, and Serge

Motivations

❏ Problem: Tracking❏ Input: Target ❏ Output: Locations over time

http://vision.ucsd.edu/~bbabenko/images/fast.gif

Page 3: Struck: Structured Output Tracking with Kernels Mike Liu ...cseweb.ucsd.edu/classes/sp17/cse252C-a/CSE252C_20170524.pdf2017/05/24  · 8 Babenko, Boris, Ming-Hsuan Yang, and Serge

Tracking Model

● What do we expect from a tracking model○ Able to track arbitrary objects○ Able to locate the object location in next frame correctly

■ Model the appearance of the object■ Eliminate the error caused by object motion, lighting

conditions, and occlusion

3

Page 4: Struck: Structured Output Tracking with Kernels Mike Liu ...cseweb.ucsd.edu/classes/sp17/cse252C-a/CSE252C_20170524.pdf2017/05/24  · 8 Babenko, Boris, Ming-Hsuan Yang, and Serge

Adaptive Tracking-by-detection Model

● Adaptive Tracking-by-detection model○ Adaptive: train the model on-the-fly○ Perform in two stages

■ Objects detection and tracking● Discriminative classifier to capture the object● Estimate the next location using the classifier score

■ Train the classifier● Generate a set of labelled samples using the actual location● Update the classifier

4

Page 5: Struck: Structured Output Tracking with Kernels Mike Liu ...cseweb.ucsd.edu/classes/sp17/cse252C-a/CSE252C_20170524.pdf2017/05/24  · 8 Babenko, Boris, Ming-Hsuan Yang, and Serge

Adaptive Tracking-by-detection Model

5

Page 6: Struck: Structured Output Tracking with Kernels Mike Liu ...cseweb.ucsd.edu/classes/sp17/cse252C-a/CSE252C_20170524.pdf2017/05/24  · 8 Babenko, Boris, Ming-Hsuan Yang, and Serge

Online training methods

● Online multiple Instance Learning

● Online boosting, online SVMs● Online multi-class LPBoost

Babenko, Boris, Ming-Hsuan Yang, and Serge Belongie. "Visual tracking with online multiple instance learning." Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on. IEEE, 2009.

Saffari, Amir, et al. "Online multi-class lpboost." Computer Vision and Pattern Recognition (CVPR), 2010 IEEE Conference on. IEEE, 2010.

Page 7: Struck: Structured Output Tracking with Kernels Mike Liu ...cseweb.ucsd.edu/classes/sp17/cse252C-a/CSE252C_20170524.pdf2017/05/24  · 8 Babenko, Boris, Ming-Hsuan Yang, and Serge

Multiple Instance Learning: object tracking

7Babenko, Boris, Ming-Hsuan Yang, and Serge Belongie. "Visual tracking with online multiple instance learning." Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on. IEEE, 2009.

Page 8: Struck: Structured Output Tracking with Kernels Mike Liu ...cseweb.ucsd.edu/classes/sp17/cse252C-a/CSE252C_20170524.pdf2017/05/24  · 8 Babenko, Boris, Ming-Hsuan Yang, and Serge

Multiple Instance Learning: training model

8Babenko, Boris, Ming-Hsuan Yang, and Serge Belongie. "Visual tracking with online multiple instance learning." Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on. IEEE, 2009.

Update the MIL Classifier using a positive bag of image patches

Page 9: Struck: Structured Output Tracking with Kernels Mike Liu ...cseweb.ucsd.edu/classes/sp17/cse252C-a/CSE252C_20170524.pdf2017/05/24  · 8 Babenko, Boris, Ming-Hsuan Yang, and Serge

Adaptive Tracking-by-detection Model

9

Page 10: Struck: Structured Output Tracking with Kernels Mike Liu ...cseweb.ucsd.edu/classes/sp17/cse252C-a/CSE252C_20170524.pdf2017/05/24  · 8 Babenko, Boris, Ming-Hsuan Yang, and Serge

Problems: Train only with binary labels

Page 11: Struck: Structured Output Tracking with Kernels Mike Liu ...cseweb.ucsd.edu/classes/sp17/cse252C-a/CSE252C_20170524.pdf2017/05/24  · 8 Babenko, Boris, Ming-Hsuan Yang, and Serge

Problems: Training samples are equally weighted

Page 12: Struck: Structured Output Tracking with Kernels Mike Liu ...cseweb.ucsd.edu/classes/sp17/cse252C-a/CSE252C_20170524.pdf2017/05/24  · 8 Babenko, Boris, Ming-Hsuan Yang, and Serge

Problems: Which labeler is the best?

Page 13: Struck: Structured Output Tracking with Kernels Mike Liu ...cseweb.ucsd.edu/classes/sp17/cse252C-a/CSE252C_20170524.pdf2017/05/24  · 8 Babenko, Boris, Ming-Hsuan Yang, and Serge

Structured Output Tracking with Kernels

13

Proposed Approach

Traditional Approach

Page 14: Struck: Structured Output Tracking with Kernels Mike Liu ...cseweb.ucsd.edu/classes/sp17/cse252C-a/CSE252C_20170524.pdf2017/05/24  · 8 Babenko, Boris, Ming-Hsuan Yang, and Serge

Structured Output Tracking with Kernels

Include y as one of the inputTrain not only with negative or positive labels

Output the transformation directly

Include a budget to control the number of support vectors

Page 15: Struck: Structured Output Tracking with Kernels Mike Liu ...cseweb.ucsd.edu/classes/sp17/cse252C-a/CSE252C_20170524.pdf2017/05/24  · 8 Babenko, Boris, Ming-Hsuan Yang, and Serge

Structured Output Tracking

Page 16: Struck: Structured Output Tracking with Kernels Mike Liu ...cseweb.ucsd.edu/classes/sp17/cse252C-a/CSE252C_20170524.pdf2017/05/24  · 8 Babenko, Boris, Ming-Hsuan Yang, and Serge

Structured Output Tracking

Page 17: Struck: Structured Output Tracking with Kernels Mike Liu ...cseweb.ucsd.edu/classes/sp17/cse252C-a/CSE252C_20170524.pdf2017/05/24  · 8 Babenko, Boris, Ming-Hsuan Yang, and Serge

Structured Output Tracking

Page 18: Struck: Structured Output Tracking with Kernels Mike Liu ...cseweb.ucsd.edu/classes/sp17/cse252C-a/CSE252C_20170524.pdf2017/05/24  · 8 Babenko, Boris, Ming-Hsuan Yang, and Serge

Structured Output Tracking● Prediction Function :

❏ F is the discrimina❏ x is the input image

patch❏ Y is the output from the

space of all possible transformations which can be defined as:

Page 19: Struck: Structured Output Tracking with Kernels Mike Liu ...cseweb.ucsd.edu/classes/sp17/cse252C-a/CSE252C_20170524.pdf2017/05/24  · 8 Babenko, Boris, Ming-Hsuan Yang, and Serge

Structured Output SVM

● Prediction Function :

Page 20: Struck: Structured Output Tracking with Kernels Mike Liu ...cseweb.ucsd.edu/classes/sp17/cse252C-a/CSE252C_20170524.pdf2017/05/24  · 8 Babenko, Boris, Ming-Hsuan Yang, and Serge

● Standard Lagrangian duality

● The discriminant function now is:

Structured Output SVM

Page 21: Struck: Structured Output Tracking with Kernels Mike Liu ...cseweb.ucsd.edu/classes/sp17/cse252C-a/CSE252C_20170524.pdf2017/05/24  · 8 Babenko, Boris, Ming-Hsuan Yang, and Serge

● Reparameterization

Structured Output SVM

Page 22: Struck: Structured Output Tracking with Kernels Mike Liu ...cseweb.ucsd.edu/classes/sp17/cse252C-a/CSE252C_20170524.pdf2017/05/24  · 8 Babenko, Boris, Ming-Hsuan Yang, and Serge

● Reparameterized dual SVM

● The discriminant function now is:

Structured Output SVM

Page 23: Struck: Structured Output Tracking with Kernels Mike Liu ...cseweb.ucsd.edu/classes/sp17/cse252C-a/CSE252C_20170524.pdf2017/05/24  · 8 Babenko, Boris, Ming-Hsuan Yang, and Serge

Online Optimization● SMO-style step

○ The set S of current support vectors○ The coefficients○ The derivatives

Page 24: Struck: Structured Output Tracking with Kernels Mike Liu ...cseweb.ucsd.edu/classes/sp17/cse252C-a/CSE252C_20170524.pdf2017/05/24  · 8 Babenko, Boris, Ming-Hsuan Yang, and Serge

Online Optimization● Step Selection Strategies

○ Process New○

○ Process Old○

○ Optimize

Page 25: Struck: Structured Output Tracking with Kernels Mike Liu ...cseweb.ucsd.edu/classes/sp17/cse252C-a/CSE252C_20170524.pdf2017/05/24  · 8 Babenko, Boris, Ming-Hsuan Yang, and Serge

Online Optimization● Adaptive Scheduling

○ A Process New step followed by 10 Reprocess steps

■ A Reprocess step is a Process Old step followed by 10 Optimize steps

REPROCESS

Page 26: Struck: Structured Output Tracking with Kernels Mike Liu ...cseweb.ucsd.edu/classes/sp17/cse252C-a/CSE252C_20170524.pdf2017/05/24  · 8 Babenko, Boris, Ming-Hsuan Yang, and Serge

● Fix the number of support vectors○ Remove the SV which results in smallest impact○ Ensure remains satisfied○ w is measured as:

Budget Mechanism

Page 27: Struck: Structured Output Tracking with Kernels Mike Liu ...cseweb.ucsd.edu/classes/sp17/cse252C-a/CSE252C_20170524.pdf2017/05/24  · 8 Babenko, Boris, Ming-Hsuan Yang, and Serge

Kernel Functions and Image Features

● Use a restriction kernel:

● Straightforward to incorporate different image features:○ Haar○ Raw ○ Histogram

● Straightforward to combine different image features together.

Page 28: Struck: Structured Output Tracking with Kernels Mike Liu ...cseweb.ucsd.edu/classes/sp17/cse252C-a/CSE252C_20170524.pdf2017/05/24  · 8 Babenko, Boris, Ming-Hsuan Yang, and Serge

Experiment - Benchmark

http://vision.ucsd.edu/~bbabenko/project_miltrack.html. Babenko, M. H. Yang, and S. Belongie. Visual Tracking with Online Multiple Instance Learning. In Proc. CVPR, 2009.

Page 29: Struck: Structured Output Tracking with Kernels Mike Liu ...cseweb.ucsd.edu/classes/sp17/cse252C-a/CSE252C_20170524.pdf2017/05/24  · 8 Babenko, Boris, Ming-Hsuan Yang, and Serge

Experiment - Image Features

● Use 6 different types of Haar-like features arranged on a grid at 2 scales on a 4x4 grid, resulting in 192 features.

● Apply a Gaussian kernel.

Page 30: Struck: Structured Output Tracking with Kernels Mike Liu ...cseweb.ucsd.edu/classes/sp17/cse252C-a/CSE252C_20170524.pdf2017/05/24  · 8 Babenko, Boris, Ming-Hsuan Yang, and Serge

Experiment - Tracking● Track 2D translation

○ Search radius of 30 pixels

○ Update the classifier with radius of 60 pixels to ensure stability.

○ Sample from a polar grid using 5 radial and 16 angular divisions.

● Evaluate using Pascal VOC overlap criterion (aka Jaccard similarity of bounding boxes a0

> 50%):

Where Bp is the predicted bounding box and Bgt is the ground truth.

Page 31: Struck: Structured Output Tracking with Kernels Mike Liu ...cseweb.ucsd.edu/classes/sp17/cse252C-a/CSE252C_20170524.pdf2017/05/24  · 8 Babenko, Boris, Ming-Hsuan Yang, and Serge

Experiment - Budget● Uses budget of

20, 50, 100, and infinity.

Page 32: Struck: Structured Output Tracking with Kernels Mike Liu ...cseweb.ucsd.edu/classes/sp17/cse252C-a/CSE252C_20170524.pdf2017/05/24  · 8 Babenko, Boris, Ming-Hsuan Yang, and Serge

Interesting Property

Page 34: Struck: Structured Output Tracking with Kernels Mike Liu ...cseweb.ucsd.edu/classes/sp17/cse252C-a/CSE252C_20170524.pdf2017/05/24  · 8 Babenko, Boris, Ming-Hsuan Yang, and Serge

Benchmark Results

Page 35: Struck: Structured Output Tracking with Kernels Mike Liu ...cseweb.ucsd.edu/classes/sp17/cse252C-a/CSE252C_20170524.pdf2017/05/24  · 8 Babenko, Boris, Ming-Hsuan Yang, and Serge

Experiment - Combining Kernels● Different image features can be combined by averaging multiple kernels:

● Features included are:○ Haar○ Raw○ Histogram

Page 36: Struck: Structured Output Tracking with Kernels Mike Liu ...cseweb.ucsd.edu/classes/sp17/cse252C-a/CSE252C_20170524.pdf2017/05/24  · 8 Babenko, Boris, Ming-Hsuan Yang, and Serge

Combining Kernels Results

Page 37: Struck: Structured Output Tracking with Kernels Mike Liu ...cseweb.ucsd.edu/classes/sp17/cse252C-a/CSE252C_20170524.pdf2017/05/24  · 8 Babenko, Boris, Ming-Hsuan Yang, and Serge

Future Work● Extend output space

○ Include rotation and scale transformations.○ Incorporate object dynamics.

● Extend input space○ Alternative image features.○ Multiple kernel learning.

Page 38: Struck: Structured Output Tracking with Kernels Mike Liu ...cseweb.ucsd.edu/classes/sp17/cse252C-a/CSE252C_20170524.pdf2017/05/24  · 8 Babenko, Boris, Ming-Hsuan Yang, and Serge

Summary● Struck is a tracking by detection framework based on structured output

prediction.● Integrates learning and tracking.

○ Does not rely on a heuristic intermediate step for producing labelled binary samples.

○ Uses an online structured output SVM learning framework.○ Introduced a budget maintenance mechanism for online structured output

SVMs.● Better performance than existing state-of-the-art trackers.