qingxiong yang, student member, ieee, liang wang, student member, ieee,

Stereo Matching with Color-Weighted Correlation, Hierarchical Belief Propagation,and Occlusion HandlingQingxiong Yang, Student Member, IEEE, Liang Wang, Student Member, IEEE,Ruigang Yang, Member, IEEE, Henrik Stewe´ nius, Member, IEEE, and David Niste´ r, Member, IEEE

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 31, NO. 3, MARCH 2009

Outline• Introduction• System Overview•Methods• Initialization• Pixel Classification• Iterative Refinement• Fast-Converging Belief Propagation

• Depth Enhancement• Experiments• Conclusion

Introduction

Introduction• Stereo is one of the most extensively researched

topics in computer vision.

• Energy Minimization framework:• Graph Cut• Belief Propagation(BP)

Objective(Contribution)• To formulate stereo model with careful handling

of:• Disparity• Discontinuity• Occlusion

• Differs from the normal framework in the final stages of the algorithm

• Outperforms all other algorithms on the average

System Overview

•1) Initialization

•2) Pixel Classification

•3) Iterative Refinement

Initialization(Block 1)

Initialization• Input:• Left Image IL

• Right Image IR

• Output:• Initial Left Disparity Map DL

• Initial Right Disparity Map DR

• Initial Data Term ED(0)

Color-Weighted Correlation

Correlation Volume

Data Term Initialization

Hierarchical BP

Disparity Map Initialization

DL(0) DR

Correlation Volume

Hierarchical BP

Initialization• Color-weighted Correlation

• To build the Correlation Volume

• Makes the match scores less sensitive to occlusion boundaries

• By using the fact that occlusion boundaries most often cause color discontinuities as well

DL(0) DR

Correlation Volume• Color difference Δxy between pixel x and y

(in the same image)

Ic: Intensity of the color channel c

• The weight of pixel x in the support window of y:

Color Difference Spatial Difference

Correlation Volume• The Correlation Volume[27]:

• Wx : support window around x• d(yL, yR ) : pixel dissimilarity[1]

• xL , yL : pixels in left image IL

• xR , yR : corresponding pixels in right image IR

• dx : disparity value of pixel XL in IL

weight

Pixels in the window Dissimilarity[1]

dx = arg min CL,x (yL, yR)

weight

xR = xL – dx

yR = yL – dx

Correlation VolumeDisparity Map Bad Pixel

[27] K.-J. Yoon and I.-S. Kweon, “Adaptive Support-Weight Approach for Correspondence Search,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 28, pp. 650-656, 2006.

[1] S. Birchfield and C. Tomasi, “A Pixel Dissimilarity Measure That Is Insensitive to Image Sampling,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 20, pp. 401-406, 1998.

Correlation Volume

Hierarchical BP

Initialization• Initial Data Term• Total energy 　 = Data Term + Smooth Term

• Computed from Correlation Volume

• Given an iteration index i = 0 here because it will be iteratively refined

DL(0) DR

Initial Data Term• Initial Data Term:

• Ƞbp : twice the average of correlation volume to exclude the outliers

Correlation Volume0.2

X 2Average

Correlation Volume

Hierarchical BP

Initialization• hierarchical Belief Propagation

• Employed with the data term and the reference image

• Resulting in the initial left and right disparity maps DL

(0) and DR

DL(0) DR

Pixel Classification

(Block 2)

Pixel ClassificationInput

Output

Pixel Classification•Mutual Consistency Check• Requires that the disparity value from the left and

right disparity maps are consistent, i.e.,

• Not Pass : occluded pixel• Pass : unoccluded pixel 　　　　 => Correlation Confidence Measure

• Correlation Confidence• Based on how distinctive the highest peak in a

pixel's correlation profile is

• 　 : the cost for the best disparity value• 　 : the cost for the second best disparity value 　　　　

Pixel Classification

If > αs stableElse unstable

dx = arg min CL,x (yL, yR)

Iterative Refinement

(Block 3)

• Goal: to propagate information from the stable pixels to the unstable and the occluded pixels

Iteration

• Color Segmentation• Color segments in IL are extrated by Mean Shift[6]

[6] D. Comaniciu and P. Meer, “Mean Shift: A Robust Approach Toward Feature Space Analysis,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 24, pp. 603-619, 2002.

• Plane Fitting

• Using the disparity values for the stable pixels in each color segment

• Disparity values are taken from the current hypothesis for the left disparity map DL

(i). (Initial: DL(0))

• The plane-fitted depth map is used as a regularization for the new disparity estimation.

• Plane Fitting• Using RANSAC[10]

• Iterates until the plane parameters converge

Iterative Refinement[10] M.A. Fischler and R.C. Bolles, “Random Sample Consensus: A Paradigm for Model Fitting with Applications to Image Analysis and Automated Cartography,” Comm. ACM, vol. 24, pp. 381-395, 1981.

• Plane Fitting output : D(i)

• The ratio of stable pixels of each segment:• If Ratio > ȠS

• Stable pixels: D(i)

• Unstable, Occluded pixels: D(i)

• If Ratio ≤ ȠS

• All pixels : D(i)

Iterative Refinementpf

Iteration

• Absolute Difference:

• D(i+1) : New Disparity Map• D(i) 　 : Plane-fitted Disparity Map

• Data Term:

• The core energy minimization of our algorithm is carried out via the hierarchical BP algorithm.

Belief Propagation

Total Energy for Pixel X

Data Term Smooth Term

•Max-Product BP[25] :

• 　　　　 : Message vector passed from pixel X to one of its neighbors Y

Max-Product Belief Propagation

Data Term

Jump Cost

[25] Y. Weiss and W. Freeman, “On the Optimality of Solutions of the Max-Product Belief Propagation Algorithm in Arbitrary Graphs,” IEEE Trans. Information Theory, vol. 2, pp. 732-735, 2001.

• Jump Cost:

• dx : Disparity of pixel X

• d : Disparity of pixel Y (X’s neighbor)

• αbp : The number of disparity levels / 8

• ρs : 1 – (normalized average color difference)

• ρbp : The rate of increase in the cost

Disparity Differenceof pixel X and its neigbor Y

• Total Energy for pixel X:

• Finally the label d that minimizes the total Energy for each pixel is selected.

Smooth TemData Tem

• Standard loopy BP algorithm is too slow.

• Hierarchical BP[9] runs much faster while maintaining comparable accuracy.

• Works in a coarse-to-fine manner

Hierarchical Belief Propagation

[9] P.F. Felzenszwalb and D.P. Huttenlocher, “Efficient Belief Propagation for Early Vision,” Proc. IEEE CS Conf. Computer Vision and Pattern Recognition, vol. 1, pp. 261-268, 2004.

Hierarchical Belief PropagationCoarser(Level 1)

Finer(Level 0)

• A large number of iterations is required to guarantee convergence in a standard BP algorithm.

• Fast-Converging BP effectively removes the redundant computation.

• Only updating the pixels that have not yet converged (value bigger than ȠZ )

Fast-Converging Belief Propagation

Depth Enhancement

• To reduce the discontinuities caused by the quantization

• Sub-pixel Estimation algorithm is proposed.

• Cost Function:

Depth Enhancement

• The depth with the minimum of the cost function:

• d: the discrete depth with the minimal cost• d+: d+1• d- : d-1

• Replace each value with the average of those values that are within one disparity over a 9 x 9 window

Depth Enhancement

Experiments

ExperimentsParameter Settings Used Throughout:

ExperimentsResults on the Middlebury Data Sets with Error Threshold 1

nonocc : The subset of the nonoccluded pixelsdisc : The subset of the pixels near the occluded areas. all : The subset of the pixels being either nonoccluded or half-occluded

Error%

ExperimentsResults on the Middlebury Data Sets with Error Threshold 0.5

Color-Weighted Correlation Voume :

Initial Hierarchical BP:

Plane fitting:

Integer-Based Disparity Map:

Depth Enhancement:

Ground Trueh:

Conclusion

Conclusion• Propose a stereo model based on • energy minimization• color segmentation• plane fitting• repeated application of hierarchical BP• depth enhancement

• A fast converging BP approach is proposed.• Preserves the same accuracy as the standard BP• The runtime is sublinear to the number of iterations.

Conclusion• The algorithm is currently outperforming the

other algorithms on the Middlebury data sets on average.

• There’s space for Improvement:• Only refined the disparity map for the reference

image• [19] suggests that, by generating a good disparity

map for the right image, the occlusion constraints can be extracted more accurately.

J. Sun, Y. Li, S.B. Kang, and H.-Y. Shum, “Symmetric Stereo Matching for Occlusion Handling,” Proc. IEEE CS Conf. Computer Vision and Pattern Recognition, vol. 2, pp. 399-406, 2005.

qingxiong yang, student member, ieee, liang wang, student member, ieee,

Documents

ieee transactions on smart grid 1 reactive … adaptive...

ieee transactions on dependaple and secure … · mahmoud...

muhammad tahir, senior member, ieee ieee...

jayanth nayak, ertem tuncel, member, ieee, and deniz...

232 ieee transactions on very large scale … · matteo...

joint histogram based cost aggregation for stereo matching...

1370 ieee transactions on pattern ... - · pdf filepenrose...

member, ieee, s. villar, member, ieee, senior member, ieee

2814 ieee transactions on wireless communications,...

tracking video objects in cluttered background andrea...

submitted to ieee transactions on … polar codes alexios...

vapr: void aware pressure routing for underwater sensor...

authors: yanchao zhang, member, ieee, wei liu, wenjing...

ieee journal on selected areas in communications, … ·...

ieee transactions on dependable and secure ...member, ieee,...

limin liu, member, ieee zhen li, member, ieee edward j....

yi cui, member, ieee, yuan xue, member, ieee, and klara...

alireza sadeghi , member, ieee ieee proof · alireza...

bader al- manthari , member, ieee, hossam hassanein , ...

ieee copyright statementbrunos/publications/... · by luca...