deep learning for computer vision (2/4): object analytics @ lasalle 2016

Xavier Giroacute i Nieto ldquoDeep learning for vision Objectsrdquo Master in Multimedia La Salle URL (May 2016)DocXavi

Deep Learning for Computer VisionObject Analytics 5 May 2016

Xavier Giroacute-i-Nieto

Master en Creacioacute Multimedia

Xavier Giroacute i Nieto ldquoDeep learning for vision Objectsrdquo Master in Multimedia La Salle URL (May 2016)

One lecture organized in three parts

2

Images (global) Objects (local)

Deep ConvNets for Recognition for

Video (2D+T)


One lecture organized in four parts

3

Detection Recognition

Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



4


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Proposals Hand-crafted

5

Slides credit Marc Bolantildeos

Hand-crafted proposals used to be based on bottom-up proposals

Selective Search (SS) Multiscale Combinatorial Grouping (MCG)

[SS] Uijlings Jasper RR Koen EA van de Sande Theo Gevers and Arnold WM Smeulders Selective search for object recognition International journal of computer vision 104 no 2 (2013) 154-171

[MCG] Arbelaacuteez Pablo Jordi Pont-Tuset Jonathan Barron Ferran Marques and Jitendra Malik Multiscale combinatorial grouping CVPR 2014


Proposals DeepBox

6

Kuo Weicheng Bharath Hariharan and Jitendra Malik Deepbox Learning objectness with convolutional networks ICCV 2015 [software]


Proposals DeepBox

7


Deepbox proposes a very simple method1) Use a state-of-the-art method (Edge Box) to generate initial object proposals2) Rerank them (and possibly discard them) by using DeepBox


Proposals DeepBox Architecture

8


PASCAL VOCAUC = 075 IoU = 05AUC = 062 IoU = 07


AlexNetarchitecture

(heavier)

DeepBoxarchitecture

(lighter)

Small drop


Proposals DeepBox Training

9


1) Initialize layers with AlexNet weights 3) Train on Hard Negatives

2) Train on Sliding WindowsNegative SamplesExtract windows by raster scanning

Positive SamplesHaving GT bounding boxes they

generate samples per instance

with a perturbation of

By using bottom-up proposals from Edge boxes

If GT overlap threshold lt= 03 rarr Negative Samples

If GT overlap threshold gt= 07 rarr Positive Samples


Proposals DeepBox Results

10

DeepBox Edge Boxes DeepBox Edge Boxes




11

With a rather simple approach ConvNets can obtain much better results than previous techniques for Object Proposals




12




13

Increasing not only Detection capabilities of known classes but also of unknown ones (suitable for Object Discovery)




14


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Detection Objects

15


Detection Objects

16

DPM (HOG features)[1] R-CNN [2] SPPnet [3]

Hand-crafted features Deep features

+60

Slide credit Amaia Salvador


Detection Objects

17

Girshick Ross Forrest Iandola Trevor Darrell and Jitendra Malik Deformable Part Models are Convolutional Neural Networks CVPR 2015

Convnets (CNNs) actually learn similar detectors to the ones learned by Deformable Parts-based Models (DPMs)


Detection Objects R-CNN

18

Girshick R Donahue J Darrell T amp Malik J Rich feature hierarchies for accurate object detection and semantic segmentation CVPR 2014



19

Slide credit Joost van de Weijer



20




21


Detection Objects Fast R-CNN

22

Girshick Ross Fast R-CNN ICCV 2015



23




24


Same as SPP[3] but single scale



25

He Kaiming Xiangyu Zhang Shaoqing Ren and Jian Sun Spatial pyramid pooling in deep convolutional networks for visual recognition PAMI 2015




26


H

h

w

h

w

Size of pooling binsh Hrsquo x w Wrsquo

wWrsquo

hHrsquomax pooling

CONV5



27


AlexNet [4] VGG16 [5] VGG_1024 [6]



28


Multi-task loss


Detection Objects Faster R-CNN

29

Ren S He K Girshick R and Sun J 2015 Faster R-CNN Towards real-time object detection with region proposal networks In Advances in Neural Information Processing Systems (pp 91-99) [Python code] [Matlab code]



30


Selective Search CPMC

MCG

Object Proposal computation is the bottleneck in current state of the art object detection systems

Selective Search Van de Sande K E Uijlings J R Gevers T amp Smeulders A W (2011 November) Segmentation as selective search for object recognition InComputer Vision (ICCV) 2011 IEEE International Conference on (pp 1879-1886) IEEECPMC Carreira J amp Sminchisescu C (2010 June) Constrained parametric min-cuts for automatic object segmentation In Computer Vision and Pattern Recognition (CVPR) 2010 IEEE Conference on (pp 3241-3248) IEEEMCG Arbelaacuteez P Pont-Tuset J Barron J Marques F amp Malik J (2014) Multiscale combinatorial grouping In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (pp 328-335)



31



MCG

Replace the usage of external Object Proposals with a Region Proposal Network (RPN)



32


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities

RoI pooling layerFC layersClass scores



33


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




34


Objectness scores(objectno object)

Bounding Box Regression

In practice k = 9 (3 different scales and 3 aspect ratios)



35


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




36


Fast R-CNN



37


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities


4-step training to share features for RPN and Fast R-CNN



38


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals

Step 1 Train RPN initialized with an ImageNet pre-trained model

ImageNet weights(fine tuned)



39


Conv Layer 5

Co

nv

laye

rs

RPN Proposals (learned in 1)

Class probabilities

Step 2 Train Fast R-CNN with learned RPN proposals




40


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals

Step 3 The model trained in 2 is used to initialize RPN and train again

Weights from Step 2(fixed)



41


Conv Layer 5

Co

nv

laye

rs


Class probabilities

Step 4 Fine tune FC layers of Fast R-CNN using same shared convolutional layers as in 3

Weights from Step 2amp3(fixed)



42


Detection Accuracy (Pascal VOC)

Timing in ms (Pascal VOC)



43




44




45


Xavier Giroacute i Nieto ldquoDeep learning for vision Objectsrdquo Master in Multimedia La Salle URL (May 2016) 46

Detection Objects Reinforcement L

Caicedo Juan C and Svetlana Lazebnik Active object localization with deep reinforcement learning ICCV 2015 [Slides by Miriam Bellver]


Detection Objects Reinforcement LObject is localized based on visual features from AlexNet FC6


Detection Objects Reinforcement Slide credit Miacuteriam Bellver

Set of actions A

Transformation actions



Set of actions A

Terminates the sequence of the current search

Marks the region inhibition-of-return (IoR)



Set of states S

(oh)

o = feature vector from pre-trained CNN fc6 4096 dim

h = history of taken actions binary vector dim 90



Reward Function Rground-truthbounding box



Reward Function R for trigger action

The Reward function considers the number of steps as a cost

3

minimum IoU06



Policy function

If the current state is S which should be the next action A

Reinforcement Learning using a Q-learning



The action-value function is estimated using a neural network that

has as many output units as actions the algorithm incorporates a replay-memory to collect experiences category-specific Q-network

Policy of the agent selection action A with maximum estimated value of the learnt action-value function





Datasets for training and testing PASCAL VOC

Two modes of evaluation

1) All attended Regions (AAR)2) Terminal regions (TR)



Best performance with few region proposals






Detection Faces

60


Detection FacesDDFD

61

Farfade Sachin Sudhakar Mohammad Saberian and Li-Jia Li Multi-view Face Detection Using Deep Convolutional Neural Networks ICMR (2015) [software]


Detection Faces DDFD Train

62

Dataset Source Annotated Facial Landmarks in the Wild by TU Graz 25k annotated faces on images downloaded from Flickr 380k manually annotated facial landmarks



63

Randomly samples sub-windows (blocks) Positive examples if Intersection-over Union (IoU) with an annotated

face is larger than 50 and negative sample otherwise

Total samples 200K positive and 20M negative


Detection Faces DDFD Test

64

Test images are rescaled updown 3 times per octave to find different sizes



65

Sliding window of 227x227 over the test image

Source James Hays ldquoObject Category Detetcion Sliding Windowsrdquo (Brown University 2011)



66

Fully-connected layers are converted to convolutional layers which allows processing images from any size

Long Jonathan Evan Shelhamer and Trevor Darrell Fully Convolutional Networks for Semantic Segmentation CVPR 2015



67

This makes possible to Efficiently run the convnet on images of any size Obtain a heat-map of the face etector



68

Non-Maximum Suppression (NMS) to avoid overlapped detections

Source Adrian Rosebrock ldquoNon-Maximum Suppression for Object Detection in Pythonrdquo (Pyimagesearch 2014)


Detection Faces DDFD Results

69



70

Precision vs Recall Curves

- DPM corresponds to Deformable Part-based Models- OpenCV face detector is an implementation of Viola amp Jones- IMPORTANT DPM or Headhunter need extra information about pose or facial landmarks during

training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Faces Recognition FaceNet

Schroff Florian Dmitry Kalenichenko and James Philbin FaceNet A Unified Embedding for Face Recognition and Clustering CVPR 2015

(Extended summary slides by Xavier Giro on the ReadCV seminar)



FacesEuclidean space where distances correspond to face similarity

FaceNet


Faces Recognition FaceNetEnd-to-end learning of an embedding (distance metric learning)

Weinberger Kilian Q and Lawrence K Saul Distance metric learning for large margin nearest neighbor classification The Journal of Machine Learning Research 10 (2009) 207-244


Faces Recognition FaceNetby means of well chosen triplets using curriculum learning

Bengio Yoshua Jeacuterocircme Louradour Ronan Collobert and Jason Weston Curriculum learning In Proceedings of the 26th annual international conference on machine learning pp 41-48 ACM 2009





Zeiler Matthew D and Rob Fergus Visualizing and understanding convolutional networks In Computer

VisionndashECCV 2014 pp 818-833 Springer International Publishing 2014 (Slides by Xavier Giroacute-i-Nieto)

Architecture 1 (NN1) ZF


Faces Recognition FaceNetArchitecture 2 (NN2) GoogLeNet

Szegedy Christian Wei Liu Yangqing Jia Pierre Sermanet Scott Reed Dragomir Anguelov Dumitru Erhan Vincent

Vanhoucke and Andrew Rabinovich Going Deeper With Convolutions CVPR 2015 (Slides by Elisa Sayrol)




Faces Recognition FaceNet Test

LBW 9963 (new record)YouTubeFaces DB 9512


Faces Recognition FaceNet SoftwareSoftware implementation OpenFace


Faces Recognition VGG Face

Parkhi Omkar M Andrea Vedaldi and Andrew Zisserman Deep face recognition Proceedings of the British Machine Vision 1 no 3 (2015) 6 [software]


E Mohedano Salvador A McGuinness K Giroacute-i-Nieto X OConnor N and Marqueacutes F ldquoBags of Local Convolutional Features for Scalable Instance Searchrdquo ICMR 2016

83

Objects Recognition Retrieval



Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome


Instance Retrieval(Instance Object Building Person Placehellip)




v1 = (v11 hellip v1n)

vk = (vk1 hellip vkn)

INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10

Local hand-crafted features(eg SIFT)

Bag of Visual WordsN-Dimensional

feature space High-dimensionalHighly sparse



Krizhevsky A Sutskever I amp Hinton G E (2012) Imagenet classification with deep convolutional neural networks In Advances in neural information processing systems (pp 1097-1105)

Convolutional Neural Networks



Babenko A Slesarev A Chigorin A amp Lempitsky V (2014) Neural codes for image retrieval In ECCV 2014Razavian A Azizpour H Sullivan J amp Carlsson S (2014) CNN features off-the-shelf an astounding baseline for recognition In DeepVision CVPRW 2014

Convolutional Neural Networks FC layers as global feature representation



Babenko A amp Lempitsky V (2015) Aggregating local deep features for image retrieval ICCV 2015Tolias G Sicre R amp Jeacutegou H (2015) Particular object retrieval with integral max-pooling of CNN activations ICLR 2015Kalantidis Y Mellina C amp Osindero S (2015) Cross-dimensional Weighting for Aggregated Deep Convolutional Features arXiv preprint arXiv151204065


summax pooled conv features as global representation



Ng J Yang F amp Davis L (2015) Exploiting local features from deep networks for image retrieval In DeepVision CVPRW 2015


conv features encoded with VLAD as global representation





(336x256)Resolution

conv5_1 from VGG16[1]

(42x32)

25K centroids 25K-D vector


Objects Recognition RetrievalQuery Representation

Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Objects Segmentation

97

Slide credit Eduard Fontdevila

Semantic segmentation assign a category label to all pixels in an image


Objects Segmentation Farabet

98

Farabet Clement Camille Couprie Laurent Najman and Yann LeCun Learning hierarchical features for scene labeling TPAMI 2013



99

Pyramid of three spatial scales



100

The same parameters in the three convnets

theta_i=theta_0=filters weights (H_l) and biases b_l)

Non-linear tanhPooling max



101

Upsampling and concatenation



102

Pixel-wise soft-max classifier



103

Problem No spatial consistency among labels

3 explored solutions

1) Superpixels2) Conditional Random Fields3) Parameter-free multilevel parsing



104

Prediction with a 2-layer network

Solution 1 Superpixels



105


Solution 2 Superpixels + CRF



106

Solution 3 Multi-level parsing

Problems with Solutions 1 amp 2 Observation level

BPT [Garrido Salembier]



107



Contribution Automatically discover the best observation level (optimal cover) for each pixel in the image



108




C2 will be labelled with the class of C5

For each pixel (leaf) i the optimal component is the C_i is the one along the path between the leaf and the root with minimal cost S


Objects Segmentation SDS

109


Hariharan Arbelaez Girshick Malik Simultaneous Detection and Segmentation (ECCV 2014)



110


Interest in obtaining segments not just bounding boxes

Multiscale combinational grouping (MCG) to generate object candidates

Cuts algorithm

Hierarchical segmenter

Grouping strategy to combine

multiscale regions



111


BBOX CNNfeature vector

1

feature vector

2

[1 2]

Finetuned to classify bboxes (with background) so extracting features from the region foreground is

suboptimal

BBOX CNN

vector A

background masked out with the mean image



112


Training 2 networks trained in isolation

Testing results are combined


1

feature vector

2

[1 2]

REGION CNN

vector B



113


Training as a whole (using segmentation overlap)

Testing results are combined (using the output of the penultimate layer)

vector C



114


penultimate fully connected layer

SVM



115




116


Results on pixel IU (Jaccard index) to evaluate semantic segmentation

Convert the output of the final system (C+ref) into a pixel-level

category labeling (using pasting scheme Carreira et al)



117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you

httpsimatgeupceduwebpeoplexavier-giro

httpstwittercomDocXavi

httpswwwfacebookcomProfessorXavi

xaviergiroupcedu



One lecture organized in three parts

2

Images (global) Objects (local)

Deep ConvNets for Recognition for

Video (2D+T)



3


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



4


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



5







Proposals DeepBox

6



Proposals DeepBox

7





8




AlexNetarchitecture

(heavier)

DeepBoxarchitecture

(lighter)

Small drop



9












10





11





12




13





14


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Detection Objects

15


Detection Objects

16



+60



Detection Objects

17





18




19




20




21



22




23




24





25





26


H

h

w

h

w


wWrsquo

hHrsquomax pooling

CONV5



27


AlexNet [4] VGG16 [5] VGG_1024 [6]



28


Multi-task loss



29




30



MCG





31



MCG




32


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




33


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




34







35


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




36


Fast R-CNN



37


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities





38


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





39


Conv Layer 5

Co

nv

laye

rs


Class probabilities





40


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





41


Conv Layer 5

Co

nv

laye

rs


Class probabilities





42






43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




3


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



4


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



5







Proposals DeepBox

6



Proposals DeepBox

7





8




AlexNetarchitecture

(heavier)

DeepBoxarchitecture

(lighter)

Small drop



9












10





11





12




13





14


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Detection Objects

15


Detection Objects

16



+60



Detection Objects

17





18




19




20




21



22




23




24





25





26


H

h

w

h

w


wWrsquo

hHrsquomax pooling

CONV5



27


AlexNet [4] VGG16 [5] VGG_1024 [6]



28


Multi-task loss



29




30



MCG





31



MCG




32


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




33


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




34







35


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




36


Fast R-CNN



37


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities





38


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





39


Conv Layer 5

Co

nv

laye

rs


Class probabilities





40


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





41


Conv Layer 5

Co

nv

laye

rs


Class probabilities





42






43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




4


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



5







Proposals DeepBox

6



Proposals DeepBox

7





8




AlexNetarchitecture

(heavier)

DeepBoxarchitecture

(lighter)

Small drop



9












10





11





12




13





14


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Detection Objects

15


Detection Objects

16



+60



Detection Objects

17





18




19




20




21



22




23




24





25





26


H

h

w

h

w


wWrsquo

hHrsquomax pooling

CONV5



27


AlexNet [4] VGG16 [5] VGG_1024 [6]



28


Multi-task loss



29




30



MCG





31



MCG




32


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




33


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




34







35


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




36


Fast R-CNN



37


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities





38


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





39


Conv Layer 5

Co

nv

laye

rs


Class probabilities





40


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





41


Conv Layer 5

Co

nv

laye

rs


Class probabilities





42






43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




5







Proposals DeepBox

6



Proposals DeepBox

7





8




AlexNetarchitecture

(heavier)

DeepBoxarchitecture

(lighter)

Small drop



9












10





11





12




13





14


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Detection Objects

15


Detection Objects

16



+60



Detection Objects

17





18




19




20




21



22




23




24





25





26


H

h

w

h

w


wWrsquo

hHrsquomax pooling

CONV5



27


AlexNet [4] VGG16 [5] VGG_1024 [6]



28


Multi-task loss



29




30



MCG





31



MCG




32


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




33


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




34







35


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




36


Fast R-CNN



37


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities





38


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





39


Conv Layer 5

Co

nv

laye

rs


Class probabilities





40


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





41


Conv Layer 5

Co

nv

laye

rs


Class probabilities





42






43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu



Proposals DeepBox

6



Proposals DeepBox

7





8




AlexNetarchitecture

(heavier)

DeepBoxarchitecture

(lighter)

Small drop



9












10





11





12




13





14


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Detection Objects

15


Detection Objects

16



+60



Detection Objects

17





18




19




20




21



22




23




24





25





26


H

h

w

h

w


wWrsquo

hHrsquomax pooling

CONV5



27


AlexNet [4] VGG16 [5] VGG_1024 [6]



28


Multi-task loss



29




30



MCG





31



MCG




32


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




33


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




34







35


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




36


Fast R-CNN



37


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities





38


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





39


Conv Layer 5

Co

nv

laye

rs


Class probabilities





40


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





41


Conv Layer 5

Co

nv

laye

rs


Class probabilities





42






43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu



Proposals DeepBox

7





8




AlexNetarchitecture

(heavier)

DeepBoxarchitecture

(lighter)

Small drop



9












10





11





12




13





14


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Detection Objects

15


Detection Objects

16



+60



Detection Objects

17





18




19




20




21



22




23




24





25





26


H

h

w

h

w


wWrsquo

hHrsquomax pooling

CONV5



27


AlexNet [4] VGG16 [5] VGG_1024 [6]



28


Multi-task loss



29




30



MCG





31



MCG




32


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




33


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




34







35


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




36


Fast R-CNN



37


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities





38


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





39


Conv Layer 5

Co

nv

laye

rs


Class probabilities





40


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





41


Conv Layer 5

Co

nv

laye

rs


Class probabilities





42






43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




8




AlexNetarchitecture

(heavier)

DeepBoxarchitecture

(lighter)

Small drop



9












10





11





12




13





14


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Detection Objects

15


Detection Objects

16



+60



Detection Objects

17





18




19




20




21



22




23




24





25





26


H

h

w

h

w


wWrsquo

hHrsquomax pooling

CONV5



27


AlexNet [4] VGG16 [5] VGG_1024 [6]



28


Multi-task loss



29




30



MCG





31



MCG




32


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




33


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




34







35


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




36


Fast R-CNN



37


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities





38


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





39


Conv Layer 5

Co

nv

laye

rs


Class probabilities





40


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





41


Conv Layer 5

Co

nv

laye

rs


Class probabilities





42






43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




9












10





11





12




13





14


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Detection Objects

15


Detection Objects

16



+60



Detection Objects

17





18




19




20




21



22




23




24





25





26


H

h

w

h

w


wWrsquo

hHrsquomax pooling

CONV5



27


AlexNet [4] VGG16 [5] VGG_1024 [6]



28


Multi-task loss



29




30



MCG





31



MCG




32


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




33


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




34







35


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




36


Fast R-CNN



37


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities





38


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





39


Conv Layer 5

Co

nv

laye

rs


Class probabilities





40


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





41


Conv Layer 5

Co

nv

laye

rs


Class probabilities





42






43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




10





11





12




13





14


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Detection Objects

15


Detection Objects

16



+60



Detection Objects

17





18




19




20




21



22




23




24





25





26


H

h

w

h

w


wWrsquo

hHrsquomax pooling

CONV5



27


AlexNet [4] VGG16 [5] VGG_1024 [6]



28


Multi-task loss



29




30



MCG





31



MCG




32


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




33


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




34







35


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




36


Fast R-CNN



37


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities





38


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





39


Conv Layer 5

Co

nv

laye

rs


Class probabilities





40


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





41


Conv Layer 5

Co

nv

laye

rs


Class probabilities





42






43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




11





12




13





14


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Detection Objects

15


Detection Objects

16



+60



Detection Objects

17





18




19




20




21



22




23




24





25





26


H

h

w

h

w


wWrsquo

hHrsquomax pooling

CONV5



27


AlexNet [4] VGG16 [5] VGG_1024 [6]



28


Multi-task loss



29




30



MCG





31



MCG




32


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




33


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




34







35


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




36


Fast R-CNN



37


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities





38


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





39


Conv Layer 5

Co

nv

laye

rs


Class probabilities





40


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





41


Conv Layer 5

Co

nv

laye

rs


Class probabilities





42






43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




12




13





14


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Detection Objects

15


Detection Objects

16



+60



Detection Objects

17





18




19




20




21



22




23




24





25





26


H

h

w

h

w


wWrsquo

hHrsquomax pooling

CONV5



27


AlexNet [4] VGG16 [5] VGG_1024 [6]



28


Multi-task loss



29




30



MCG





31



MCG




32


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




33


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




34







35


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




36


Fast R-CNN



37


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities





38


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





39


Conv Layer 5

Co

nv

laye

rs


Class probabilities





40


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





41


Conv Layer 5

Co

nv

laye

rs


Class probabilities





42






43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




13





14


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Detection Objects

15


Detection Objects

16



+60



Detection Objects

17





18




19




20




21



22




23




24





25





26


H

h

w

h

w


wWrsquo

hHrsquomax pooling

CONV5



27


AlexNet [4] VGG16 [5] VGG_1024 [6]



28


Multi-task loss



29




30



MCG





31



MCG




32


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




33


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




34







35


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




36


Fast R-CNN



37


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities





38


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





39


Conv Layer 5

Co

nv

laye

rs


Class probabilities





40


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





41


Conv Layer 5

Co

nv

laye

rs


Class probabilities





42






43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




14


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Detection Objects

15


Detection Objects

16



+60



Detection Objects

17





18




19




20




21



22




23




24





25





26


H

h

w

h

w


wWrsquo

hHrsquomax pooling

CONV5



27


AlexNet [4] VGG16 [5] VGG_1024 [6]



28


Multi-task loss



29




30



MCG





31



MCG




32


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




33


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




34







35


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




36


Fast R-CNN



37


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities





38


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





39


Conv Layer 5

Co

nv

laye

rs


Class probabilities





40


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





41


Conv Layer 5

Co

nv

laye

rs


Class probabilities





42






43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu



Detection Objects

15


Detection Objects

16



+60



Detection Objects

17





18




19




20




21



22




23




24





25





26


H

h

w

h

w


wWrsquo

hHrsquomax pooling

CONV5



27


AlexNet [4] VGG16 [5] VGG_1024 [6]



28


Multi-task loss



29




30



MCG





31



MCG




32


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




33


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




34







35


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




36


Fast R-CNN



37


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities





38


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





39


Conv Layer 5

Co

nv

laye

rs


Class probabilities





40


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





41


Conv Layer 5

Co

nv

laye

rs


Class probabilities





42






43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu



Detection Objects

16



+60



Detection Objects

17





18




19




20




21



22




23




24





25





26


H

h

w

h

w


wWrsquo

hHrsquomax pooling

CONV5



27


AlexNet [4] VGG16 [5] VGG_1024 [6]



28


Multi-task loss



29




30



MCG





31



MCG




32


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




33


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




34







35


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




36


Fast R-CNN



37


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities





38


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





39


Conv Layer 5

Co

nv

laye

rs


Class probabilities





40


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





41


Conv Layer 5

Co

nv

laye

rs


Class probabilities





42






43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu



Detection Objects

17





18




19




20




21



22




23




24





25





26


H

h

w

h

w


wWrsquo

hHrsquomax pooling

CONV5



27


AlexNet [4] VGG16 [5] VGG_1024 [6]



28


Multi-task loss



29




30



MCG





31



MCG




32


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




33


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




34







35


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




36


Fast R-CNN



37


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities





38


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





39


Conv Layer 5

Co

nv

laye

rs


Class probabilities





40


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





41


Conv Layer 5

Co

nv

laye

rs


Class probabilities





42






43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




18




19




20




21



22




23




24





25





26


H

h

w

h

w


wWrsquo

hHrsquomax pooling

CONV5



27


AlexNet [4] VGG16 [5] VGG_1024 [6]



28


Multi-task loss



29




30



MCG





31



MCG




32


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




33


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




34







35


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




36


Fast R-CNN



37


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities





38


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





39


Conv Layer 5

Co

nv

laye

rs


Class probabilities





40


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





41


Conv Layer 5

Co

nv

laye

rs


Class probabilities





42






43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




19




20




21



22




23




24





25





26


H

h

w

h

w


wWrsquo

hHrsquomax pooling

CONV5



27


AlexNet [4] VGG16 [5] VGG_1024 [6]



28


Multi-task loss



29




30



MCG





31



MCG




32


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




33


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




34







35


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




36


Fast R-CNN



37


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities





38


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





39


Conv Layer 5

Co

nv

laye

rs


Class probabilities





40


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





41


Conv Layer 5

Co

nv

laye

rs


Class probabilities





42






43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




20




21



22




23




24





25





26


H

h

w

h

w


wWrsquo

hHrsquomax pooling

CONV5



27


AlexNet [4] VGG16 [5] VGG_1024 [6]



28


Multi-task loss



29




30



MCG





31



MCG




32


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




33


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




34







35


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




36


Fast R-CNN



37


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities





38


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





39


Conv Layer 5

Co

nv

laye

rs


Class probabilities





40


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





41


Conv Layer 5

Co

nv

laye

rs


Class probabilities





42






43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




21



22




23




24





25





26


H

h

w

h

w


wWrsquo

hHrsquomax pooling

CONV5



27


AlexNet [4] VGG16 [5] VGG_1024 [6]



28


Multi-task loss



29




30



MCG





31



MCG




32


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




33


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




34







35


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




36


Fast R-CNN



37


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities





38


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





39


Conv Layer 5

Co

nv

laye

rs


Class probabilities





40


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





41


Conv Layer 5

Co

nv

laye

rs


Class probabilities





42






43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




22




23




24





25





26


H

h

w

h

w


wWrsquo

hHrsquomax pooling

CONV5



27


AlexNet [4] VGG16 [5] VGG_1024 [6]



28


Multi-task loss



29




30



MCG





31



MCG




32


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




33


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




34







35


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




36


Fast R-CNN



37


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities





38


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





39


Conv Layer 5

Co

nv

laye

rs


Class probabilities





40


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





41


Conv Layer 5

Co

nv

laye

rs


Class probabilities





42






43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




23




24





25





26


H

h

w

h

w


wWrsquo

hHrsquomax pooling

CONV5



27


AlexNet [4] VGG16 [5] VGG_1024 [6]



28


Multi-task loss



29




30



MCG





31



MCG




32


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




33


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




34







35


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




36


Fast R-CNN



37


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities





38


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





39


Conv Layer 5

Co

nv

laye

rs


Class probabilities





40


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





41


Conv Layer 5

Co

nv

laye

rs


Class probabilities





42






43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




24





25





26


H

h

w

h

w


wWrsquo

hHrsquomax pooling

CONV5



27


AlexNet [4] VGG16 [5] VGG_1024 [6]



28


Multi-task loss



29




30



MCG





31



MCG




32


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




33


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




34







35


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




36


Fast R-CNN



37


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities





38


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





39


Conv Layer 5

Co

nv

laye

rs


Class probabilities





40


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





41


Conv Layer 5

Co

nv

laye

rs


Class probabilities





42






43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




25





26


H

h

w

h

w


wWrsquo

hHrsquomax pooling

CONV5



27


AlexNet [4] VGG16 [5] VGG_1024 [6]



28


Multi-task loss



29




30



MCG





31



MCG




32


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




33


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




34







35


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




36


Fast R-CNN



37


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities





38


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





39


Conv Layer 5

Co

nv

laye

rs


Class probabilities





40


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





41


Conv Layer 5

Co

nv

laye

rs


Class probabilities





42






43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




26


H

h

w

h

w


wWrsquo

hHrsquomax pooling

CONV5



27


AlexNet [4] VGG16 [5] VGG_1024 [6]



28


Multi-task loss



29




30



MCG





31



MCG




32


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




33


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




34







35


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




36


Fast R-CNN



37


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities





38


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





39


Conv Layer 5

Co

nv

laye

rs


Class probabilities





40


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





41


Conv Layer 5

Co

nv

laye

rs


Class probabilities





42






43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




27


AlexNet [4] VGG16 [5] VGG_1024 [6]



28


Multi-task loss



29




30



MCG





31



MCG




32


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




33


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




34







35


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




36


Fast R-CNN



37


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities





38


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





39


Conv Layer 5

Co

nv

laye

rs


Class probabilities





40


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





41


Conv Layer 5

Co

nv

laye

rs


Class probabilities





42






43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




28


Multi-task loss



29




30



MCG





31



MCG




32


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




33


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




34







35


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




36


Fast R-CNN



37


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities





38


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





39


Conv Layer 5

Co

nv

laye

rs


Class probabilities





40


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





41


Conv Layer 5

Co

nv

laye

rs


Class probabilities





42






43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




29




30



MCG





31



MCG




32


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




33


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




34







35


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




36


Fast R-CNN



37


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities





38


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





39


Conv Layer 5

Co

nv

laye

rs


Class probabilities





40


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





41


Conv Layer 5

Co

nv

laye

rs


Class probabilities





42






43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




30



MCG





31



MCG




32


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




33


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




34







35


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




36


Fast R-CNN



37


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities





38


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





39


Conv Layer 5

Co

nv

laye

rs


Class probabilities





40


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





41


Conv Layer 5

Co

nv

laye

rs


Class probabilities





42






43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




31



MCG




32


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




33


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




34







35


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




36


Fast R-CNN



37


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities





38


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





39


Conv Layer 5

Co

nv

laye

rs


Class probabilities





40


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





41


Conv Layer 5

Co

nv

laye

rs


Class probabilities





42






43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




32


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




33


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




34







35


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




36


Fast R-CNN



37


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities





38


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





39


Conv Layer 5

Co

nv

laye

rs


Class probabilities





40


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





41


Conv Layer 5

Co

nv

laye

rs


Class probabilities





42






43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




33


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




34







35


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




36


Fast R-CNN



37


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities





38


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





39


Conv Layer 5

Co

nv

laye

rs


Class probabilities





40


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





41


Conv Layer 5

Co

nv

laye

rs


Class probabilities





42






43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




34







35


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




36


Fast R-CNN



37


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities





38


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





39


Conv Layer 5

Co

nv

laye

rs


Class probabilities





40


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





41


Conv Layer 5

Co

nv

laye

rs


Class probabilities





42






43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




35


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities




36


Fast R-CNN



37


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities





38


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





39


Conv Layer 5

Co

nv

laye

rs


Class probabilities





40


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





41


Conv Layer 5

Co

nv

laye

rs


Class probabilities





42






43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




36


Fast R-CNN



37


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities





38


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





39


Conv Layer 5

Co

nv

laye

rs


Class probabilities





40


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





41


Conv Layer 5

Co

nv

laye

rs


Class probabilities





42






43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




37


Conv Layer 5

Co

nv

laye

rs

RPN RPN Proposals

RPN Proposals

Class probabilities





38


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





39


Conv Layer 5

Co

nv

laye

rs


Class probabilities





40


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





41


Conv Layer 5

Co

nv

laye

rs


Class probabilities





42






43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




38


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





39


Conv Layer 5

Co

nv

laye

rs


Class probabilities





40


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





41


Conv Layer 5

Co

nv

laye

rs


Class probabilities





42






43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




39


Conv Layer 5

Co

nv

laye

rs


Class probabilities





40


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





41


Conv Layer 5

Co

nv

laye

rs


Class probabilities





42






43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




40


Conv Layer 5

Co

nv

laye

rsRPN RPN Proposals





41


Conv Layer 5

Co

nv

laye

rs


Class probabilities





42






43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




41


Conv Layer 5

Co

nv

laye

rs


Class probabilities





42






43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




42






43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




43




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




44




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




45









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu









Set of actions A




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




Set of actions A





Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




Set of states S

(oh)










3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu









3

minimum IoU06



Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




Policy function























Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu






















Detection Faces

60


Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu



Detection FacesDDFD

61




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




62




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




63






64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




64




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




65





66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




66





67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




67




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




68





69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




69



70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




70



training



71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




71


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals








FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu









FaceNet






























83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu































83




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




Image Database

Visual Query

ldquoA dogrdquo

Expected outcome



Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




Image Database

Visual Query

ldquoThis dogrdquo

Expected outcome








INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu









INVERTED FILE

word Image ID1 1 12 2 1 30 1023 10 124 23 6 10


























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu
























(336x256)Resolution


(42x32)




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




Global Search(GS)

Local Search(LS)





96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu






96


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals



97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




97





98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




98




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




99




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




100






101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




101




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




102




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




103






104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




104





105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




105





106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




106






107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




107






108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




108








109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




109





110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




110




Cuts algorithm



multiscale regions



111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




111



1

feature vector

2

[1 2]


suboptimal

BBOX CNN

vector A




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




112





1

feature vector

2

[1 2]

REGION CNN

vector B



113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




113




vector C



114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




114



SVM



115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




115




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




116







117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




117




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu




118


Local analysis for

Segmentation

person

bag

me

my bagperson

bag

Proposals


Thank you




xaviergiroupcedu



Thank you




xaviergiroupcedu


deep learning for computer vision (2/4): object analytics @ lasalle 2016

Technology