online 3d reconstruction using convex optimization

1
Online 3D Reconstruction Using Convex Optimization Gottfried Graber, Thomas Pock, Horst Bischof Institute for Computer Graphics and Vision, Graz University of Technology, Austria Introduction Live camera stream Online 3D reconstruction I Realtime camera tracking I Quasi-dense depthmaps I Robust iterative fusion I Volumetric representation via truncated signed distance function: u [-1, 1], Ω R 3 I Surface is the zero-level set of u I Real 3D geometry, i.e. topologically unconstrained Tracking I Parallel Tracking and Mapping (PTAM) of Klein & Murray [1] I High quality camera pose estimate through bundle adjustment Depthmap generation I GPU-accelerated multiview planesweep [2] I Normalized cross-correlation - well-textured scene required I Resulting depthmaps: noisy and incomplete PTAM realtime tracking Keyframe selection Depthmap Robust depthmap fusion I Method similar to [3] min u Z Ω |∇u | + λ N X i =1 Z Ω h (x , i )|u (x ) - d i |dx h (x , i ) . . . Histogram count of bin i at voxel x d i ...... Distance for histogram bin i I Convex energy: no initialization needed, guaranteed to find global minimum I Minimization with first order prima-dual algorithm of Chambolle & Pock [4] I Fully automatic pipeline, no user input required Signed distance fields I Convert depthmaps to signed distance fields f k I Histogram compression allows to store arbitrary number of distance fields with constant memory footprint K X k =1 Z Ω |u (x ) - f k (x )| dx N X i =1 Z Ω h (x , i )|u (x ) - d i | dx f k (x ) ... Signed distance field over voxel-grid on Ω K ... Number of signed distance fields N ... Number of histogram bins, N K δ w i f i +1 0 -1 η x Camera line-of-sight i r(x) Surface Generation of signed distance fields Visualization I GPU-accelerated raycaster to render iso-levels of u I Texture information: Project surface 3D points into nearest keyframes, compute median of grayvalues Virtual camera pose (green) and nearby keyframes (blue) from which texture information is taken. Results Frame 368, 0:14s Frame 1945, 1:17s Detail is added as camera explores Frame 821, 0:33s Frame 2142, 1:25s Missing detail due to lack of data Frame 1364, 0:55s Frame 2804, 1:50s Detail is added as camera explores Frame 1459, 0:58s Camera has not explored this region Textured result References [1] G. Klein and D. Murray, Parallel Tracking and Mapping for Small AR Workspaces, 6th IEEE and ACM International Symposium on Mixed and Augmented Reality, November 2007. [2] R. Collins, A space-sweep approach to true multi-image matching, In IEEE Computer Vision and Pattern Recogni- tion, June 1996. [3] C. Zach, Fast and high quality fusion of depth maps, Proc. 3DPVT, 2008. [4] A. Chambolle and T. Pock, A first-order primal-dual algorithm for convex problems with applications to imaging, J. Math. Imaging Vis., May 2011. http://www.gpu4vision.org This work was supported by the BRIDGE project HD-VIP (no. 827544)

Upload: others

Post on 12-Sep-2021

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Online 3D Reconstruction Using Convex Optimization

Online 3D Reconstruction Using Convex OptimizationGottfried Graber, Thomas Pock, Horst Bischof

Institute for Computer Graphics and Vision, Graz University of Technology, Austria

Introduction

Live camera stream Online 3D reconstruction

I Realtime camera trackingI Quasi-dense depthmapsI Robust iterative fusionI Volumetric representation via truncated signed distance

function:u : Ω→ [−1, 1], Ω ⊆ R3

I Surface is the zero-level set of uI Real 3D geometry, i.e. topologically unconstrained

Tracking

I Parallel Tracking and Mapping (PTAM) of Klein & Murray [1]I High quality camera pose estimate through bundle adjustment

Depthmap generation

I GPU-accelerated multiview planesweep [2]I Normalized cross-correlation - well-textured scene requiredI Resulting depthmaps: noisy and incomplete

PTAM realtime tracking Keyframe selection Depthmap

Robust depthmap fusion

I Method similar to [3]

minu

Ω|∇u|+ λ

N∑i=1

∫Ω

h(x, i)|u(x)− di|dx

h(x, i) . . . Histogram count of bin i at voxel xdi . . . . . . Distance for histogram bin i

I Convex energy: no initialization needed, guaranteed to find global minimumI Minimization with first order prima-dual algorithm of Chambolle & Pock [4]I Fully automatic pipeline, no user input required

Signed distance fields

I Convert depthmaps to signed distance fields fk

I Histogram compression allows to store arbitrary number ofdistance fields with constant memory footprint

K∑k=1

∫Ω|u(x)− fk(x)| dx ≈

N∑i=1

∫Ω

h(x, i)|u(x)− di| dx

fk(x) . . . Signed distance field over voxel-grid on ΩK . . . Number of signed distance fieldsN . . . Number of histogram bins, N K

δ

wi

fi

+1

0

−1

η

x

Camera

line−of−sight

ir(x)

Surface

Generation of signed distance fields

Visualization

I GPU-accelerated raycaster to renderiso-levels of u

I Texture information: Project surface 3Dpoints into nearest keyframes, computemedian of grayvalues

Virtual camera pose (green) and nearby keyframes (blue) fromwhich texture information is taken.

Results

Frame 368, 0:14s

Frame 1945, 1:17s Detail isadded as camera explores

Frame 821, 0:33s

Frame 2142, 1:25s Missing detaildue to lack of data

Frame 1364, 0:55s

Frame 2804, 1:50s Detail isadded as camera explores

Frame 1459, 0:58s Camera hasnot explored this region

Textured result

References

[1] G. Klein and D. Murray, Parallel Tracking and Mapping for Small AR Workspaces, 6th IEEE and ACM International Symposium on Mixed and Augmented Reality,November 2007.[2] R. Collins, A space-sweep approach to true multi-image matching, In IEEE Computer Vision and Pattern Recogni- tion, June 1996.[3] C. Zach, Fast and high quality fusion of depth maps, Proc. 3DPVT, 2008.[4] A. Chambolle and T. Pock, A first-order primal-dual algorithm for convex problems with applications to imaging, J. Math. Imaging Vis., May 2011.

http://www.gpu4vision.org This work was supported by the BRIDGE project HD-VIP (no. 827544)