double-constrained rpca based on saliency maps for foreground detection in automated maritime...

Double-constrained RPCA based on Saliency Maps for Foreground Detection in Automated Maritime Surveillance

Andrews Sobral, Thierry Bouwmans and El-hadi ZahZahPh.D. Student, Computer Vision

Lab. L3I/MIA – University of La Rochelle, France

Summary

▪ Context: Maritime Surveillance

▪ Understanding an Intelligent Video Surveillance Framework

▪ Introduction to Background Subtraction

▪ Robust Principal Component Analysis (RPCA)

▪ The SCM-RPCA approach

▪ Definition of shape and confidence map

▪ Solving the SCM-RPCA

▪ Experimental results

▪ Conclusions

Context: Maritime Surveillance

http://www.asv.fr/en/mediaImages from:

▪ The automatic video surveillance of maritime environment aims to:

▪ Detect and identify specific ships, vessels or boats on the sea and their activity.

▪ Extend the capabilities to identify maritime risks: piracy, trafficking, immigration, etc.

▪ However, the development of automatic video surveillance applications for maritime environment is a very difficult task due to the complexity of the scenes: moving water, waves, etc.

▪ The motion of the objects of interest (i.e. ships or boats) can be mixed with the dynamic behavior of the background (non-regular patterns).

http://www.asv.fr/en/media



!

Video Content Analysis(VCA) or Video Analytics

Behavior Analysis

Image acquisition and preprocessing

ObjectDetection

ObjectTracking

event location

Intrusion detection Collision

prevention

Target detection and tracking

Anomaly detection

Target behavior analysis

Traffic data collection and

analysis

activity report

Understanding an Intelligent Video Surveillance Framework

supervisor

Behind the Scenes of an Intelligent Video Surveillance Framework

!

Example of automatic incident detection

our focus

Introduction to Background Subtraction

Initialize Background Model

frame modelForegroundDetection

Background Model Maintenance

Background subtraction methods

Traditional methods:• Basic methods, mean and variance over time• Fuzzy based methods• Statistical methods • Non-parametric methods• Neural and neuro-fuzzy methods

Matrix and Tensor Factorization methods:• Eigenspace-based methods (PCA / SVD)• RPCA, LRR, NMF, MC, ST, etc.• Tensor Decomposition, NTF, etc.

BGSLibrary (C++)https://github.com/andrewssobral/bgslibrary

A large number of algorithms have been proposed for background subtraction over the last few years:

LRSLibrary (MatLab)https://github.com/andrewssobral/lrslibrary

our focus

Andrews Sobral and Antoine Vacavant. A comprehensive review of background subtraction algorithms evaluated with synthetic and real videos. Computer Vision and Image Understanding (CVIU), 2014. http://dx.doi.org/10.1016/j.cviu.2013.12.005

https://github.com/andrewssobral/bgslibrary

https://github.com/andrewssobral/bgslibrary

https://github.com/andrewssobral/lrslibrary



http://dx.doi.org/10.1016/j.cviu.2013.12.005

http://dx.doi.org/10.1016/j.cviu.2013.12.005

Robust Principal Component Analysis (RPCA)

▪ RPCA can be formulated as the problem of decomposing a data matrix (M) into two components (L) and (S), where (M) is the sum of a low-rank matrix (L) and a sparse matrix (S):

▪ M = L + S

Sparse error matrix

SL

Underlying low-rank matrix

M

Matrix of corrupted observations

Robust Principal Component Analysis (RPCA)

http://perception.csl.illinois.edu/matrix-rank/home.html

M L S

(tradeoff between rank and sparsity)

http://perception.csl.illinois.edu/matrix-rank/home.html

What about RPCA for dynamic background?

▪

The SCM-RPCA approach

▪ We propose to combine some ideas of Oreifej et al. (2013) and Yang et al. (2015).

▪ The weighting matrix proposed by Yang et al. (2015) can be used as a shape constraint (or region constraint),

▪ While the confidence map proposed by Oreifej et al. (2013) reinforces the pixels belonging from the moving objects.

▪ The original 3WD was modified adding the shape constraint as has been done in the RMAMR.

▪ We chose to modify the 3WD instead of RMAMR due its capacity to deal more robustly with the multimodality of the background.

Definition of shape and confidence map

▪ The second contribution of this paper refers to the way of building the shape constraint and confidence map.

▪ Instead of using dense optical flow (temporal descriptor) as a preliminary step, we suggest to use a saliency detector (spatial descriptor).

▪ In some cases where:▪ a) the object of interest moves very slowly (i.e long distance boats) or

▪ b) the background is very dynamic (i.e boats in the sea), the optical flow may not be enough to ensure the object detection.

▪ In addition, the computation of the dense optical flow usually request more computational cost than saliency maps.▪ In this work, the BMS method proposed by Zhang and Sclaroff (2014) was selected by its speed performance and

visual results.

http://cs-people.bu.edu/jmzhang/BMS/BMS.html



Block diagram of the SCM-RPCA

Is important to note that the double constraints (confidence map and shape) can be built from two different types of source (i.e. from spatial, temporal, or spatio-temporal information), but in this work we focus only on spatial saliency maps.

Solving the SCM-RPCA

Experimental results

UCSD MarDT

The UCSD and MarDT data sets consists of 18 and 28 video sequences, respectively, both acquired from a stationary and moving camera, but in this work (due to page limits) we have selected only four sequences from UCSD and three sequences from MarDT.

http://www.svcl.ucsd.edu/projects/background\_subtraction/ucsdbgsub\_dataset.htmhttp://www.dis.uniroma1.it/~labrococo/MAR/index.htm

http://www.svcl.ucsd.edu/projects/background/_subtraction/ucsdbgsub/_dataset.htm



http://www.dis.uniroma1.it/~labrococo/MAR/index.htm

http://www.dis.uniroma1.it/~labrococo/MAR/index.htm

Quantitative results on UCSD data set

The quantitative results in Table 2 show that the SCM-RPCA outperforms the previous methods, with the highest F-measure average over the selected video sequences.

Is important to note that all constrained RPCA evaluated in this paper have been used the saliency maps as the input constraint.

Benchmark evaluation over UCSD data set

The algorithms were implemented in MATLAB (R2014a) running on a laptop computer with Windows 7 Professional 64 bits, 2.7 GHz Core i7-3740QM processor and 32Gb of RAM.

We can see that the combination of shape constraint and confidence map did not changed significantly the number of iterations and computation time over original 3WD.

Visual results on UCSD data set

From left to right: (a) input frame, (b) saliency map generated by BMS, (c) ground truth, (d) proposed approach, (e) 3WD, and (f) RMAMR.

Visual results on MarDT data set

Is important to note that in the UCSD scenes we have used the original spatial saliency map provided by BMS, while for the MarDT scenes we have subtracted its temporal median due to the high saliency from the buildings around the river.

Conclusions

▪ The experimental results indicate a better enhancement of the object foreground mask when compared with its direct competitors.

▪ As shown in quality evaluation, the combination with confidence map and shape constraint can reduce the amount of false positive pixels.

▪ In the future work, we will investigate how the combination of different types of source can improve the foreground detection.

▪ We are also interested to perform the decomposition incrementally as in Online RPCA proposed by Feng et al. (2013).

▪ MATLAB codes available: https://sites.google.com/site/scmrpca

https://sites.google.com/site/scmrpca

https://sites.google.com/site/scmrpca

double-constrained rpca based on saliency maps for foreground detection in automated maritime...

Science