parking space detection based on a multi-task deep ......spatial transformer network • reducing...

26
Parking Space Detection Based on A Multi-task Deep Convolutional Network with Spatial Transform Hoang Tran Vu and Ching-Chun Huang Department of Electrical Engineering, Chung Cheng University, Taiwan Speaker: Hoang Tran Vu

Upload: others

Post on 16-Oct-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Parking Space Detection Based on A Multi-task Deep ......Spatial Transformer network • Reducing the variations from perspective distortion, parking displacement, and vehicle size

Parking Space Detection Based on A Multi-task Deep Convolutional Network with Spatial Transform

Hoang Tran Vu and Ching-Chun Huang

Department of Electrical Engineering, Chung Cheng University, Taiwan

Speaker: Hoang Tran Vu

Page 2: Parking Space Detection Based on A Multi-task Deep ......Spatial Transformer network • Reducing the variations from perspective distortion, parking displacement, and vehicle size

Outline

• Goal

• Challenges

• Proposed method

• Results and Discussions

• Conclusions

• Future works

2/26

Page 3: Parking Space Detection Based on A Multi-task Deep ......Spatial Transformer network • Reducing the variations from perspective distortion, parking displacement, and vehicle size

Outline

• Goal

• Challenges

• Proposed method

• Results and Discussions

• Conclusions

• Future works

3/26

Page 4: Parking Space Detection Based on A Multi-task Deep ......Spatial Transformer network • Reducing the variations from perspective distortion, parking displacement, and vehicle size

Goal

• Using a CNN-based deep learning framework to infer the parking status

4/26

Page 5: Parking Space Detection Based on A Multi-task Deep ......Spatial Transformer network • Reducing the variations from perspective distortion, parking displacement, and vehicle size

Outline

• Goal

• Challenges

• Proposed method

• Results and Discussions

• Conclusions

• Future works

5/26

Page 6: Parking Space Detection Based on A Multi-task Deep ......Spatial Transformer network • Reducing the variations from perspective distortion, parking displacement, and vehicle size

Challenges

• Inter-object occlusion and perspective distortions

6/26

Page 7: Parking Space Detection Based on A Multi-task Deep ......Spatial Transformer network • Reducing the variations from perspective distortion, parking displacement, and vehicle size

Challenges

• Non-unified vehicle size and uncontrollable parking displacement.

7/26

Page 8: Parking Space Detection Based on A Multi-task Deep ......Spatial Transformer network • Reducing the variations from perspective distortion, parking displacement, and vehicle size

Outline

• Goal

• Challenges

• Proposed method

• Results and Discussions

• Conclusions

• Future works

8/26

Page 9: Parking Space Detection Based on A Multi-task Deep ......Spatial Transformer network • Reducing the variations from perspective distortion, parking displacement, and vehicle size

Proposed method

9/26

• Three main parts: – Spatial Transformer network

(STN)

– Neighbor’s Hypotheses Prediction Network (NHPN)

– Inference layer

Page 10: Parking Space Detection Based on A Multi-task Deep ......Spatial Transformer network • Reducing the variations from perspective distortion, parking displacement, and vehicle size

Outline• Goal• Challenges• Proposed method

– Spatial Transformer network– Neighbor’s Hypotheses Prediction Network– Inference layer

• Results and Discussions• Conclusions• Future works

10/26

Page 11: Parking Space Detection Based on A Multi-task Deep ......Spatial Transformer network • Reducing the variations from perspective distortion, parking displacement, and vehicle size

Spatial Transformer network• Reducing the variations from perspective distortion, parking

displacement, and vehicle size.– Using a spatial transformer network (STN) [15]

[15] Max Jaderberg, Karen Simonyan, Andrew Zisserman, and Koray Kavukcuoglu, “Spatial transformer networks,” in Advances in Neural Information Processing Systems, pages 2017–2025, 2015.

(𝑥𝑠 , 𝑦𝑠 ): the source coordinate in the input image𝑇𝜃 : 2D affine transformation (6 parameters)

(𝑥𝑡 , 𝑦𝑡 ): the target coordinate in the transformed image

11/26

Page 12: Parking Space Detection Based on A Multi-task Deep ......Spatial Transformer network • Reducing the variations from perspective distortion, parking displacement, and vehicle size

Outline• Goal• Challenges• Proposed method

– Spatial Transformer network– Neighbor’s Hypotheses Prediction Network– Inference layer

• Results and Discussions• Conclusions• Future works

12/26

Page 13: Parking Space Detection Based on A Multi-task Deep ......Spatial Transformer network • Reducing the variations from perspective distortion, parking displacement, and vehicle size

Neighbor’s Hypotheses Prediction Network

• Solving the inter-occlusion problem.– Designing a CNN-based deep learning network to predict the status of

a 3-space.

𝐿1 𝑊𝐹 ,𝑊𝜃, 𝑋𝑡, 𝐻 = −1

8𝑁

𝑛=1

𝑁

𝑘=1

8

ℎ𝑛𝑘𝑙𝑜𝑔𝑝𝑛

𝑘 + (1 − ℎ𝑛𝑘)log(1 − 𝑝𝑛

𝑘)

WF : NHPN parameters.𝑊𝜃 : STN parameters.𝑋𝑡 : input training set.𝐻 : corresponding status labels.𝑁 : sample number. ℎ𝑛𝑘 : the label of the kth hypothesis of the

nth sample

13/26

Page 14: Parking Space Detection Based on A Multi-task Deep ......Spatial Transformer network • Reducing the variations from perspective distortion, parking displacement, and vehicle size

Neighbor’s Hypotheses Prediction Network

• This network is designed with three properties.– Being determined by many stages separated by a pooling layer.

– down-sampling the input image to a small size before applying fully connected layers for classification.

– Increasing the number of kernels in the later layers

14/26

Page 15: Parking Space Detection Based on A Multi-task Deep ......Spatial Transformer network • Reducing the variations from perspective distortion, parking displacement, and vehicle size

Outline• Goal• Challenges• Proposed method

– Spatial Transformer network– Neighbor’s Hypotheses Prediction Network– Inference layer

• Results and Discussions• Conclusions• Future works

15/26

Page 16: Parking Space Detection Based on A Multi-task Deep ......Spatial Transformer network • Reducing the variations from perspective distortion, parking displacement, and vehicle size

Inference layer

• Inferring the status of the considered space.– Building a 2- class logistic regression model on the top of NHPN.

𝑊𝐶 : inference layer parameters.𝑃 : input training set.𝑌 : corresponding status labels.𝑁 : sample number. y𝑛 : the label of the middle space of the nth 3-space unit.𝑆𝑛1 and 𝑆𝑛

0 : the occupied and vacant probabilities of the middle space.

𝐿2 𝑊𝐶 , 𝑃, 𝑌 = −1

𝑁

𝑛=1

𝑁

y𝑛 log 𝑆𝑛1 + 1 − y𝑛 log 𝑆𝑛

0

16/26

Page 17: Parking Space Detection Based on A Multi-task Deep ......Spatial Transformer network • Reducing the variations from perspective distortion, parking displacement, and vehicle size

Outline

• Goal

• Challenges

• Proposed method

• Results and Discussions

• Conclusions

• Future works

17/26

Page 18: Parking Space Detection Based on A Multi-task Deep ......Spatial Transformer network • Reducing the variations from perspective distortion, parking displacement, and vehicle size

Results and Discussions

[3] C.C. Huang, Hoang Tran Vu, “Vacant Parking Space Detection based on a Multi-layer Inference Framework,” IEEE Transactions on Circuit and Systems for Video Technology, May 2016.

• The training and testing datasets are collected within 1 month.– 8277 images for training.

– 525 images for testing.

Huang’s work [3] : one of state-of-art for hand-craft feature based methods

CNN1 : CNN(considered space) + L2(.) CNN2 : CNN(three spaces) + L2(.)CNN-STN1 : CNN(three spaces) + STN + L1(.) CNN-STN2 : CNN(three spaces) + STN + L2(.)

18/26

Page 19: Parking Space Detection Based on A Multi-task Deep ......Spatial Transformer network • Reducing the variations from perspective distortion, parking displacement, and vehicle size

Results and Discussions

19/26

Page 20: Parking Space Detection Based on A Multi-task Deep ......Spatial Transformer network • Reducing the variations from perspective distortion, parking displacement, and vehicle size

Results and Discussions

• Understanding what the network learned in the feature domain.– Generating the synthetic images [18] that cause high activation.

[18] J. Yosinski, J. Clune, A. Nguyen, T. Fuchs, H. Lipson. “Understanding neural networks through deep visualization,” in ICML Deep Learning Workshop, 2015.20/26

Page 21: Parking Space Detection Based on A Multi-task Deep ......Spatial Transformer network • Reducing the variations from perspective distortion, parking displacement, and vehicle size

Results and Discussions

• The real-time camera view and detection results.

[17] C. C. Huang. (2015). Huang’s Projects. [Online]. Available at http://acm.ee.ccu.edu.tw:2017.21/26

Page 22: Parking Space Detection Based on A Multi-task Deep ......Spatial Transformer network • Reducing the variations from perspective distortion, parking displacement, and vehicle size

Outline

• Goal

• Challenges

• Proposed method

• Results and Discussions

• Conclusions

• Future works

22/26

Page 23: Parking Space Detection Based on A Multi-task Deep ......Spatial Transformer network • Reducing the variations from perspective distortion, parking displacement, and vehicle size

Conclusions• Proposing a deep convolutional network for parking space

detection.– Addressing the practical challenges : lighting variations, parking

displacement, non-unified car size.

– Integrating a convolutional spatial transformer network (STN) to crop the local image area adaptively.

– Adopting a multi-task loss function to handle the inter-object occlusion problems.

23/26

Page 24: Parking Space Detection Based on A Multi-task Deep ......Spatial Transformer network • Reducing the variations from perspective distortion, parking displacement, and vehicle size

Outline

• Goal

• Challenges

• Proposed method

• Results and Discussions

• Conclusions

• Future works

24/26

Page 25: Parking Space Detection Based on A Multi-task Deep ......Spatial Transformer network • Reducing the variations from perspective distortion, parking displacement, and vehicle size

Future works

• Enhancing the low contrast areas as well as learning the invariant color features.

• Using the transfer learning methods to transfer the information between different domains (different parking spaces or different viewing angles) efficiently.

25/26

Page 26: Parking Space Detection Based on A Multi-task Deep ......Spatial Transformer network • Reducing the variations from perspective distortion, parking displacement, and vehicle size

Thanks for listening!

26/26