big data - university of pittsburghpeople.cs.pitt.edu/~kovashka/cs3710_sp15/bigdata_phuong.pdfbased...

29
Big data Phuong Pham April 9, 15

Upload: others

Post on 13-Oct-2020

9 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Big data - University of Pittsburghpeople.cs.pitt.edu/~kovashka/cs3710_sp15/bigdata_phuong.pdfBased on Santosh Divvala’sslide Motivation 3 Rearing horse Rolling horse Reining horse

Big dataPhuong Pham

April 9, 15

Page 2: Big data - University of Pittsburghpeople.cs.pitt.edu/~kovashka/cs3710_sp15/bigdata_phuong.pdfBased on Santosh Divvala’sslide Motivation 3 Rearing horse Rolling horse Reining horse

Learning Everything about Anything:Webly-Supervised Visual Concept

LearningBased on Santosh Divvala’s slide

Page 3: Big data - University of Pittsburghpeople.cs.pitt.edu/~kovashka/cs3710_sp15/bigdata_phuong.pdfBased on Santosh Divvala’sslide Motivation 3 Rearing horse Rolling horse Reining horse

Motivation

3

Rearing horse Rolling horse Reining horse Eye horse

What are these?

“horse”

Page 4: Big data - University of Pittsburghpeople.cs.pitt.edu/~kovashka/cs3710_sp15/bigdata_phuong.pdfBased on Santosh Divvala’sslide Motivation 3 Rearing horse Rolling horse Reining horse

A system that can

4

Without human supervision• Biased, non-comprehensive• Concept-specific expertise• Scalability

Page 5: Big data - University of Pittsburghpeople.cs.pitt.edu/~kovashka/cs3710_sp15/bigdata_phuong.pdfBased on Santosh Divvala’sslide Motivation 3 Rearing horse Rolling horse Reining horse

Approach

5

Overall questions? Or we will go into detail of each component

Page 6: Big data - University of Pittsburghpeople.cs.pitt.edu/~kovashka/cs3710_sp15/bigdata_phuong.pdfBased on Santosh Divvala’sslide Motivation 3 Rearing horse Rolling horse Reining horse

Retrieve concept’s variations

• Approximately 5000 n-grams per concept

• Several visually non-salient n-grams e.g. “last horse”, “particular horse”

• Classifier-based pruning (5000 -> 1000)• Train/test thumbnail images, if A.P. < threshold, discard

n-gram

• Find good quality n-gram

6

Page 7: Big data - University of Pittsburghpeople.cs.pitt.edu/~kovashka/cs3710_sp15/bigdata_phuong.pdfBased on Santosh Divvala’sslide Motivation 3 Rearing horse Rolling horse Reining horse

Good quality n-grams

7

quality coverage

Not for finding super-ngrams!

Merge into S if i,j are different?

Page 8: Big data - University of Pittsburghpeople.cs.pitt.edu/~kovashka/cs3710_sp15/bigdata_phuong.pdfBased on Santosh Divvala’sslide Motivation 3 Rearing horse Rolling horse Reining horse

Super-ngrams

8

Merging similar “good quality” ngrams

Page 9: Big data - University of Pittsburghpeople.cs.pitt.edu/~kovashka/cs3710_sp15/bigdata_phuong.pdfBased on Santosh Divvala’sslide Motivation 3 Rearing horse Rolling horse Reining horse

Training models

• Train separate DPM per super-ngram

• Pruning noisy components

• Merging similar appearance clustering (~ 50%)

9

Page 10: Big data - University of Pittsburghpeople.cs.pitt.edu/~kovashka/cs3710_sp15/bigdata_phuong.pdfBased on Santosh Divvala’sslide Motivation 3 Rearing horse Rolling horse Reining horse

Pruning noisy components

10

Noisy components (pruned based on A.P. threshold and frequency)

Page 11: Big data - University of Pittsburghpeople.cs.pitt.edu/~kovashka/cs3710_sp15/bigdata_phuong.pdfBased on Santosh Divvala’sslide Motivation 3 Rearing horse Rolling horse Reining horse

Merging similar components (as super-ngram)11

Page 12: Big data - University of Pittsburghpeople.cs.pitt.edu/~kovashka/cs3710_sp15/bigdata_phuong.pdfBased on Santosh Divvala’sslide Motivation 3 Rearing horse Rolling horse Reining horse

Object detection

12

• Their system is comparable to WS_vid_CVPR12 (better in avg)• WS_Img_ICCV11 is the best model at the cost of fully supervised learning (limitation of annotation)

Page 13: Big data - University of Pittsburghpeople.cs.pitt.edu/~kovashka/cs3710_sp15/bigdata_phuong.pdfBased on Santosh Divvala’sslide Motivation 3 Rearing horse Rolling horse Reining horse

Action detection

13The webly learning give reasonable good result?

Page 14: Big data - University of Pittsburghpeople.cs.pitt.edu/~kovashka/cs3710_sp15/bigdata_phuong.pdfBased on Santosh Divvala’sslide Motivation 3 Rearing horse Rolling horse Reining horse

Future work

14

Page 15: Big data - University of Pittsburghpeople.cs.pitt.edu/~kovashka/cs3710_sp15/bigdata_phuong.pdfBased on Santosh Divvala’sslide Motivation 3 Rearing horse Rolling horse Reining horse

Summary

• Advantages• Cut annotation cost

• Combine linguistic and visual semantic may lead to more interesting research topics

• ?

• Disadvantages• Still not defeat state of the art even with training set at

large scale

• ?

15

Page 16: Big data - University of Pittsburghpeople.cs.pitt.edu/~kovashka/cs3710_sp15/bigdata_phuong.pdfBased on Santosh Divvala’sslide Motivation 3 Rearing horse Rolling horse Reining horse

Questions

16

Page 17: Big data - University of Pittsburghpeople.cs.pitt.edu/~kovashka/cs3710_sp15/bigdata_phuong.pdfBased on Santosh Divvala’sslide Motivation 3 Rearing horse Rolling horse Reining horse

Scene Completion using Millions of Photographs

Based on James Hays’ slides

Page 18: Big data - University of Pittsburghpeople.cs.pitt.edu/~kovashka/cs3710_sp15/bigdata_phuong.pdfBased on Santosh Divvala’sslide Motivation 3 Rearing horse Rolling horse Reining horse

Motivation

18

Page 19: Big data - University of Pittsburghpeople.cs.pitt.edu/~kovashka/cs3710_sp15/bigdata_phuong.pdfBased on Santosh Divvala’sslide Motivation 3 Rearing horse Rolling horse Reining horse

The algorithm

19

Input image Scene Descriptor Image Collection

200 matches20 completionsContext matching+ blending

Page 20: Big data - University of Pittsburghpeople.cs.pitt.edu/~kovashka/cs3710_sp15/bigdata_phuong.pdfBased on Santosh Divvala’sslide Motivation 3 Rearing horse Rolling horse Reining horse

Semantic scene matching

20

→Gist scene descriptor from 6 orientation and 5 scales

Page 21: Big data - University of Pittsburghpeople.cs.pitt.edu/~kovashka/cs3710_sp15/bigdata_phuong.pdfBased on Santosh Divvala’sslide Motivation 3 Rearing horse Rolling horse Reining horse

Find its L2 nearest neighbors in 2.3 million images

21

… 200 total

Page 22: Big data - University of Pittsburghpeople.cs.pitt.edu/~kovashka/cs3710_sp15/bigdata_phuong.pdfBased on Santosh Divvala’sslide Motivation 3 Rearing horse Rolling horse Reining horse

22Graph cut + Poisson blending Hays and Efros, SIGGRAPH 2007

Context matching

Page 23: Big data - University of Pittsburghpeople.cs.pitt.edu/~kovashka/cs3710_sp15/bigdata_phuong.pdfBased on Santosh Divvala’sslide Motivation 3 Rearing horse Rolling horse Reining horse

Graph cut seam finding

23

For missing regions of the existing image

For regions of the image not covered be the scene match

For other pixels

Page 24: Big data - University of Pittsburghpeople.cs.pitt.edu/~kovashka/cs3710_sp15/bigdata_phuong.pdfBased on Santosh Divvala’sslide Motivation 3 Rearing horse Rolling horse Reining horse

Result ranking

24

We assign each of the 200 results a score which is the sum of:

The scene matching distance

The context matching distance (color + texture)

The graph cut cost

Page 25: Big data - University of Pittsburghpeople.cs.pitt.edu/~kovashka/cs3710_sp15/bigdata_phuong.pdfBased on Santosh Divvala’sslide Motivation 3 Rearing horse Rolling horse Reining horse

Qualitative

25

Page 26: Big data - University of Pittsburghpeople.cs.pitt.edu/~kovashka/cs3710_sp15/bigdata_phuong.pdfBased on Santosh Divvala’sslide Motivation 3 Rearing horse Rolling horse Reining horse

Qualitative - failures

26

Page 27: Big data - University of Pittsburghpeople.cs.pitt.edu/~kovashka/cs3710_sp15/bigdata_phuong.pdfBased on Santosh Divvala’sslide Motivation 3 Rearing horse Rolling horse Reining horse

Quantitive

• Our: 37%

• Baseline: 10%

27

Page 28: Big data - University of Pittsburghpeople.cs.pitt.edu/~kovashka/cs3710_sp15/bigdata_phuong.pdfBased on Santosh Divvala’sslide Motivation 3 Rearing horse Rolling horse Reining horse

Human evaluation

28

Page 29: Big data - University of Pittsburghpeople.cs.pitt.edu/~kovashka/cs3710_sp15/bigdata_phuong.pdfBased on Santosh Divvala’sslide Motivation 3 Rearing horse Rolling horse Reining horse

Discussions

29