dodecapen: accurate 6dof tracking of a passive stylus

Post on 14-May-2022

3 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

DodecaPen: Accurate 6DoF Tracking of a Passive Stylus

Po-Chen Wu*†, Robert Wang†, Kenrick Kin†, Christopher Twigg†, Shangchen Han†

Ming-Hsuan Yang‡, Shao-Yi Chien*

*National Taiwan University, †Oculus Research, ‡UC Merced

Outline

Introduction

Proposed System

Experimental Results

Conclusion

@Students in AI WorkshopDodecaPen 2

Outline

Introduction

Proposed System

Experimental Results

Conclusion

@Students in AI WorkshopDodecaPen 3

Introduction

Our system can track the 6DoF pose of a calibrated pen from a single camera with submillimeter accuracy.

@Students in AI WorkshopDodecaPen 4

@Students in AI WorkshopDodecaPen 5

@Students in AI WorkshopDodecaPen 6

Why Dodecahedron?

To prevent pose jumping (due to coplanar points)

@Students in AI WorkshopDodecaPen 7

Explanation of Pose Jumping (1/2)

@Students in AI WorkshopDodecaPen 8

Explanation of Pose Jumping (2/2)

Multiple local minima of a cost function (due to coplanar points)

@Students in AI WorkshopDodecaPen 9

Co

st

Pose

OriginalImage

NoisyImage

Outline

Introduction

Proposed System

Experimental Results

Conclusion

@Students in AI WorkshopDodecaPen 10

Approximate Pose Estimation (APE)

• Marker Detection

• Minimize reprojection error 𝐸𝑟(𝐩) with P𝑛P algorithm to get the initial pose 𝐩′

𝐸𝑟 𝐩 =1

𝑛

𝑖=1

𝑛

ෝ𝐮𝑖 − 𝐮𝑖 𝐱𝑖; 𝐩2

11

ෝ𝐮: detected point in the camera image𝐱: point on the dodecahedron𝐮: transformed 𝐱 point in the camera image𝐩: pose (including rotation matrix 𝑹 and translation vector 𝐭)

Inter-frame Corner Tracking (ICT)

• Pyramidal Lucas-Kanade marker corner tracking

• P𝑛P algorithm to get the initial pose 𝐩′

12

If APE does not succeed…

Marker Intensity Normalization

• We normalize the intensity to ensure intensity invariance before minimizing the residual between the model and image.

13

Dense Pose Refinement (DPR)

• Minimize appearance distance 𝐸𝑎(𝐩) with Gauss Newton and backtracking line search (BLS) to get the final pose 𝐩∗

𝐸𝑎 𝐩 =1

𝑛

𝑖=1

𝑛

𝐼𝑐 𝐮𝑖 𝐱𝑖; 𝐩 − 𝑂𝑡 𝐱𝑖2

14

𝐼𝑐: camera image𝑂𝑡: target object𝐱: point on the dodecahedron𝐮: transformed 𝐱 point in the camera frame𝐩: pose (including rotation matrix 𝑹 and translation vector 𝐭)

Marker MipmapsMipmap Masks

Masked Mipmaps

Vectorization

Gauss-Newton Iteration

Δ𝐩∗ = argminΔ𝐩

1

𝑛

𝑖=1

𝑛

𝐼𝑐 𝐮𝑖 𝐩′ + Δ𝐩 − 𝑂𝑡 𝐱𝑖2

≈ argminΔ𝐩

1

𝑛

𝑖=1

𝑛

𝐼𝑐 𝐮𝑖 𝐩′ + ቤ𝜕𝐼𝑐𝜕𝐩

𝐩=𝐩′

Δ𝐩 − 𝑂𝑡 𝐱𝑖

2

15

𝐸𝑎 𝐩 =1

𝑛

𝑖=1

𝑛

𝐼𝑐 𝐮𝑖 𝐩 − 𝑂𝑡 𝐱𝑖2

𝐉𝑐Δ𝐩 = 𝐎𝑡 − 𝐈𝑐ቤ𝜕𝐈𝑐𝜕𝐩

𝐩=𝐩′

≡ 𝐉𝑐 𝐸𝑎′ 𝐩 = 0

Δ𝐩 = 𝐉𝑐T𝐉𝑐

−1𝐉𝑐T(𝐎𝑡 − 𝐈𝑐)

Backtracking Line Search (BLS)

Gauss-Newton iteration does not always converge with a fixed step size since our least squares problem is nonlinear.

We shrink Δ𝐩 by Δ𝐩 ← αΔ𝐩 until it meets the Armijo-Goldstein condition below:

𝐸𝑎 𝐩′ + Δ𝐩 ≤ 𝐸𝑎 𝐩′ + 𝑐𝛻𝐸𝑎 𝐩′ TΔ𝐩

𝛻𝐸𝑎 𝐩′ is the local function gradient

α = 0.5 , 𝑐 = 10−4

@Students in AI WorkshopDodecaPen 16

Dodecahedron Calibration (DC)

Determine the precise pose of each marker with respect to the dodecahedron

One-time offline bundle adjustment

𝐸𝑎 𝐩𝑗 , 𝐩𝑘 =

𝑖

𝑗

𝑘

𝐼𝑐 𝐮𝑖 𝐱𝑖; 𝐩𝑗; 𝐩𝑘 − 𝑂𝑡 𝐱𝑖

2

17

𝐼𝑐: camera image𝑂𝑡: target object𝐱: point on the dodecahedron𝐮: transformed 𝐱 point in the camera frame𝐩: dodecahedron pose𝐩: marker pose

Outline

Introduction

Proposed System

Experimental Results

Conclusion

@Students in AI WorkshopDodecaPen 18

Evaluation with Synthetic Data

We generate synthetic image sequences with 24 motion patterns of the virtual DodecaPen Each sequence consists of 301 frames with the same

resolution (1280*1024) and intrinsics as our real camera.

We further evaluate the proposed approaches under varying shot noise, spatial blur, camera resolutions, and mask kernel widths.

@Students in AI WorkshopDodecaPen 19

@Students in AI WorkshopDodecaPen 20

Pen-tip Trajectories

@Students in AI WorkshopDodecaPen 21

@Students in AI WorkshopDodecaPen 22

Evaluation with Real Data

Four real drawings

Compare with a motion capture system Constructed with 16 OptiTrack Prime 17W (1.7 megapixels, 70 degrees field-

of-view) cameras

@Students in AI WorkshopDodecaPen 23

24

DodecaPen VS. Mocap

@Students in AI WorkshopDodecaPen 25

The accuracy of the proposed method is comparable to a motion capture system with 10 active cameras.

TW$ 10,000 VS. TW$ 1,000,000

Outline

Introduction

Proposed System

Experimental Results

Conclusion

@Students in AI WorkshopDodecaPen 26

Conclusion

6DoF tracking using a set of readily available and easy-to-assemble components

Sub-millimeter accurate

Single camera pose estimation can be fast enough and robust enough for drawing in 2D, 3D and in VR

@Students in AI WorkshopDodecaPen 27

Potential AI Research Topics

General Object Pose Tracking

@Students in AI WorkshopDodecaPen 28

top related