domain transformation-based efficient cost aggregation for local stereo matching cuong cao pham and...

49
Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits and Systems for Video Technology, 2012 1

Upload: shon-flynn

Post on 20-Jan-2016

220 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching

Cuong Cao Pham and Jae Wook Jeon, Member, IEEE

IEEE Transactions on Circuits and Systems for Video Technology, 2012

1

Page 2: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

Outline

• Introduction

• Framework

• Proposed Algorithm

• Compute Costs

• Cost Aggregation : Domain Tramsformation

• Optimization & Refinment

• Experimental Results

• Conclusion2

Page 3: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

Introduction

3

Page 4: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

Background

• Global stereo algorithms:• High accuracy but low speed

• Local stereo algorithms :• High speed but low accuracy• The key : cost aggregation

4

Adaptive support-weight[4] :‧The most well-known local method‧The state-of-art local algorithm‧Reduce the gap between global method and local method

→ Excessive time consumption related to support window size

[4] K.-J. Yoon and I.-S. Kweon, “Adaptive Support-Weight Approach for Correspondence Search,” IEEE Trans. Pattern Anal. Mach. Intell., vol.28, no. 4, pp. 650-656, 2006.

Page 5: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

Related Work

• Adaptive Weight[4]

• Bilateral filter

• Cost-volume filtering[21]

• Guided filter

• Geodesic Diffusion[27]

• Anisotropic diffusion

→ Geodesic diffusion

5[21] C. Rhemann, A. Hosni, M. Bleyer, C. Rother, and M. Gelautz, “Fast Cost-Volume Filtering for Visual Correspondence and Beyond,” in Proc.IEEE Intl. Conf. Comput. Vis. Pattern Recognit. (CVPR), pp. 3017-3024,2011.[27] L. De-Maeztu, A. Villanueva, and R. Cabeza, “Near Real-Time Stereo Matching Using Geodesic Diffusion,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 34, no. 2, pp. 410 - 416, 2012.

Page 6: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

Objective

• Present a cost aggregation technique:

• Achieve high precision

• Fast execution

• Using Domain transformation

• Domain transformation:

• Aggregation of 2D cost data → a sequence of 1D filters

• Lower computational requirements

6

Page 7: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

Framework

7

Page 8: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

Framework

8

Page 9: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

ProposedAlgorithm

9

Page 10: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

Pixel-wise Cost Consumption

• Truncated absolute difference (TAD) :

• TAD of the gradient :

• Final cost data:10

Ii(p): intensity value of the i-th color channel in the RGB color space at pixel p of the image I

Tc : user-defined truncation value

Page 11: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

Aggregation 1D Cost Data

• Inspired by the domain transformation technique[14]

• Dimensionality reduction technique

• Defines a geodesic distance-preserving representation of a 2D

image embedded in 5D (x, y, Ir, Ig, Ib) as a real line.

• Aggregation of 2D cost data → a sequence of 1D filters

• Reduce computational time

11

[14] Eduardo S. L. Gastal and Manuel M. Oliveira, “Domain Transform for Edge-Aware Image and Video Processing,” ACM Trans. Graph., vol. 30, no. 4, 2011.

Page 12: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

n-1 n

Aggregation 1D Cost Data

• 1D discrete signal:

• Cost slide Cd :

• Feedback comb filter[32]:

• Cd,y : input signal

• Cd,y : output signal

• a feedback coefficient 12

‘ a : consistent → non-edge-aware filter

row y

[32] J. Smith, “Introduction to Digital Filters with Audio Applications,” W3K Publishing, 2007.

Page 13: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

• 1D discrete signal:

• Cost slide Cd :

• Feedback comb filter[32]:

• Cd,y : input signal

• Cd,y : output signal

• a feedback coefficient

n-1 n

Aggregation 1D Cost Data

13

‘ a : consistent → non-edge-aware filter

row y

Page 14: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

• Two similar samples set a high value of a• Two different samples set a low value of a ( Discontinue region → prevent the propagation train )

• Edge-aware feedback comb filter:

• g : chosen metric representing the dissimilarity between two samples

• Compute g as the distance between two samples in the 1D domain (transformed from the corresponding row of the guidance image I)

Aggregation 1D Cost Data

14

Page 15: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

Domain Transformation

• I : R2→ R3 (a 2D RGB color image)

• p = (xp, yp) : spatial coordinate

• I(p) = (rp, gp, bp) : range coordinates

• Goal: find a transform t :R2→ R which preserves the original distances between points on C (given by some metric)

15

g

v

R2

R3

Page 16: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

• L1 distance between two neighboring points in the original domain R2

• Distance between two corresponding samples in the new domain R

• gt(x) = t (x, I(x)) : the transformation operator at point x

Domain transformation

16

must equal

R2R

Page 17: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

• Divide both sides by h and take the limit as h→0:

• The value at any point u in the transformed domain:

(By taking the integral of gt′ (x) from 0 to u)

Domain transformation

17

Page 18: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

• The value at any point u in the transformed domain:

• The distance between any two points u and v in the transformed domain :

(corresponds to the arc length from u to v of the signal I)

Domain transformation

18

Page 19: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

• The distance between any two points u and v :

• We can also control the influence of spatial and intensity range

information similar to the bilateral filter.

• Embedding the values of σs and σr :

Domain transformation

19

Page 20: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

• Select the maximum absolute difference to define the distance between two points in the original domain:

• The final distance g:

Domain transformation

20

Page 21: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

Domain transformation

21

Left image Non-edge-aware filter Edge-aware filter

Page 22: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

Aggregation 2D Cost Data

22

Page 23: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

Aggregation 2D Cost Data

• 1. Left → Right

• 2. Right → Left

• 3. Top → Bottom

• 4. Bottom→ Top

23

Page 24: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

Aggregation 2D Cost Data

24

L→ R

R→ L

T→ B

B→ T

Page 25: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

Aggregation 2D Cost Data

• is the 1D discrete signal plotted from each

column along the y direction of the cost slide Cd

• :

• σH : kernel standard deviation (implicitly set to σs)

• σs [10,300] and σr [0.01,0.3] can yields good results.

25

Page 26: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

Aggregation 2D Cost Data

26

‧Algorithm:

Page 27: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

Optimization & Refinement

• Winner-take-all • Select disparities

• Left-Right consistency check• Occluded regions

• Weighted median filter• Noise removing

27

Page 28: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

Winner-take-all

• Winner-take-all(WTA) strategy:

• Sd : the set of all possible disparities

• Cd : Aggregated cost

28

Page 29: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

Left-right consistency check

• The disparity maps obtained at this stage contain errors in

the occluded regions.

• Perform Left-right consistency check

• A pixel in the left disparity map is marked as invalidated: when its value differs from the corresponding value of the pixel in the right

disparity map by a value greater than one

• Assign the minimum value between two closest validated pixels

29

Left image Right image

validated

invalidated

min

Page 30: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

Weighted Median Filter

• Using a weighted median filter to :• Remove streak-like artifacts

• Remove the small amount of remaining noise

• Select bilateral filter weight to compute the weighted

median filter

• The validated pixels are not affected by this operation.

30

Page 31: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

Consistency Map vs. Final disparity

31Invalidated pixels

Page 32: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

ExperimentalResults

32

Page 33: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

Experimental Results

• Middlebury stereo evaluation• Middlebury dataset

• Real-world image• Camcorder data

• Execution time• CUDA implementation

33

Page 34: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

Middlebury Evaluation - 1

• Adaptive Weight[4]

• 3535 support window with γs = 17 and γr = 7:5

• Cost-volume filtering[21]

• 19×19 support window and ε = 0:0004

• Geodesic Diffusion[27]

• Iterated n = 24 times with γc = 40 and l0 = 0:15

• InfoPermeable[31]

• Exponential function with σ = 25

• Proposed• σs=25 and σr=0.1

34Compare with the best-performing algorithm inspired by well-known edge-aware filters

[31] C. Cigla and A. A. Alatan, “Efficient Edge-Preserving Stereo Matching”, in ICCV Workshop on LDRMV, 2011.

Page 35: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

Middlebury Evaluation - 1

• Compare the performance of the raw cost aggregation

• The same pixel-wise cost computation and disparity optimization steps were installed to ensure fair comparison.

• Select the TAD of the color and the gradient for computing matching costs

• { λ , Tc, Tg }={ 0.1, 7/255, 2/255 }

• Guidance image used for the aggregation stage:• Using 3x3 median filter

• Reduce the high-frequncy information that is not actually useful35

Page 36: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

36

Page 37: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

Experimental Results

37

Only non-occluded and discontinuity regions

Page 38: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

Middlebury Evaluation - 2

• Without refinement vs. with refinement

• { λ , Tc, Tg, σs , σr }={ 0.1, 7/255, 2/255, 45, 0.006 }

• 3x3 median filter• Filtering Guidance image used for the aggregation stage

• The weighted median filter• Used in disparity refinement stage

• r = 21, γs = 81, and γr = 0.04

38

Page 39: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

Experimental Results

39

without refinement with refinement

Page 40: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

Experimental Results

40

Page 41: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

Experimental Results

41

Page 42: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

Experimental Results

42

Page 43: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

Real-world Image• Camcorder data:• Cafe (640360, 32 possible disparities)• Newspaper (512384, 32 possible disparities)• Book_Arrival (512384, 60 possible disparities)

43

Page 44: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

Proposed vs. CostFilter[21]

44

Page 45: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

Execution time

• Using C++

• PC with an AMD Athlon 64 X2 Dual Core 3800+ 2.00 Ghz.

• Measure only the execution time of the aggregation performing on the left view

• No occlusion handling or post-processing times were included.

45

Page 46: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

Execution time

46

Support window size / number of iterations

Window: 2n+1 2n+1

Window: 2n+1 2n+1

Iteration times: n

Page 47: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

CUDA Implementation

47

Algorithm Time(s) Graphics Card Image

GeoDif 0.06 NVIDIA GeForce GTX 480 Tsukuba stereo pair

CostFilter 0.041 NVIDIA GeForce GTX 480 400300 image

Proposed 0.0095 NVIDIA GeForce GTX 460 Tsukuba stereo pair

Page 48: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

Conclusion

48

Page 49: Domain Transformation-Based Efficient Cost Aggregation for Local Stereo Matching Cuong Cao Pham and Jae Wook Jeon, Member, IEEE IEEE Transactions on Circuits

Conclusion

• Solve the excessive time consumption bottleneck of adaptive-weight

• Integrates the appealing properties of domain transformation into the cost aggregation

• Using a sequence of 1D operations• Lower computational requirements

• Lower memory costs

• Fast and accurate local method

49