photo tourism - courses.cs.washington.edu · • enhanced morphing • scale up structure from...

21
Photo Tourism: Photo Tourism: Exploring Photo Collections in 3D Exploring Photo Collections in 3D Noah Snavely Steven M. Seitz University of Washington Richard Szeliski Microsoft Research © 2006 Noah Snavely 15,464 76,389 37,383 15,464 76,389 37,383 Reproduced with permission of Yahoo! Inc. © 2005 by Yahoo! Inc. YAHOO! and the YAHOO! logo are trademarks of Yahoo! Inc.

Upload: phamtram

Post on 06-Sep-2018

220 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Photo Tourism - courses.cs.washington.edu · • Enhanced morphing • Scale up structure from motion algorithm Visibility Visibility Advantages of 3D over 2D

Photo Tourism:Photo Tourism:Exploring Photo Collections in 3DExploring Photo Collections in 3D

Noah Snavely Steven M. Seitz

University of Washington

Richard Szeliski Microsoft Research

© 2006 Noah Snavely

15,464

76,389

37,383

15,464

76,389

37,383

Reproduced with permission of Yahoo! Inc. © 2005 by Yahoo! Inc.YAHOO! and the YAHOO! logo are trademarks of Yahoo! Inc.

Page 2: Photo Tourism - courses.cs.washington.edu · • Enhanced morphing • Scale up structure from motion algorithm Visibility Visibility Advantages of 3D over 2D

Photo TourismPhoto Tourism Photo Tourism overviewPhoto Tourism overview

Scene reconstruction

Scene reconstruction

Photo ExplorerPhoto

ExplorerInput photographs Relative camera positions and orientations

Point cloud

Sparse correspondence

Related workRelated work• Image-based modeling

• Image-based rendering

Schaffalitzky and ZissermanECCV 2002

Brown and Lowe3DIM 2005

Debevec, et al.SIGGRAPH 1996

Aspen Movie MapLippman, et al., 1978

Photorealistic IBR:Levoy and Hanrahan, SIGGRAPH 1996Gortler, et al, SIGGRAPH 1996Seitz and Dyer, SIGGRAPH 1996Aliaga, et al, SIGGRAPH 2001

and many others

Related workRelated work

• Image browsing

Toyama, et al, Int. Conf. Multimedia, 2003

McCurdy and GriswoldMobisys 2003

Sivic and ZissermanICCV 2003

Page 3: Photo Tourism - courses.cs.washington.edu · • Enhanced morphing • Scale up structure from motion algorithm Visibility Visibility Advantages of 3D over 2D

Photo Tourism overviewPhoto Tourism overview

Scene reconstruction

Scene reconstruction

Photo ExplorerPhoto

ExplorerInput photographs

Scene reconstructionScene reconstruction• Automatically estimate

– position, orientation, and focal length of cameras

– 3D positions of feature points

Feature detectionFeature detection

Pairwisefeature matching

Pairwisefeature matching

Incremental structure

from motion

Incremental structure

from motion

Correspondence estimation

Correspondence estimation

Feature detectionFeature detection

• Detect features using SIFT [Lowe, IJCV 2004]

Feature detectionFeature detection

• Detect features using SIFT [Lowe, IJCV 2004]

Page 4: Photo Tourism - courses.cs.washington.edu · • Enhanced morphing • Scale up structure from motion algorithm Visibility Visibility Advantages of 3D over 2D

Pairwise feature matchingPairwise feature matching

• Match features between each pair of images

Pairwise feature matchingPairwise feature matching

• Refine matching using RANSAC [Fischler & Bolles 1987] to estimate fundamental matrices between pairs

Feature detectionFeature detection

Detect features using SIFT [Lowe, IJCV 2004]

Feature detectionFeature detection

Detect features using SIFT [Lowe, IJCV 2004]

Page 5: Photo Tourism - courses.cs.washington.edu · • Enhanced morphing • Scale up structure from motion algorithm Visibility Visibility Advantages of 3D over 2D

Feature detectionFeature detection

Detect features using SIFT [Lowe, IJCV 2004]

Feature matchingFeature matching

Match features between each pair of images

Feature matchingFeature matchingRefine matching using RANSAC [Fischler & Bolles 1987] to estimate fundamental matrices between pairs

Correspondence estimationCorrespondence estimation

• Link up pairwise matches to form connected components of matches across several images

Image 1 Image 2 Image 3 Image 4

Page 6: Photo Tourism - courses.cs.washington.edu · • Enhanced morphing • Scale up structure from motion algorithm Visibility Visibility Advantages of 3D over 2D

Structure from motionStructure from motion

Camera 1

Camera 2

Camera 3R1,t1

R2,t2R3,t3

p1

p4

p3

p2

p5

p6

p7

minimizef (R,T,P)

Incremental structure from motionIncremental structure from motion

Incremental structure from motionIncremental structure from motion Incremental structure from motionIncremental structure from motion

Page 7: Photo Tourism - courses.cs.washington.edu · • Enhanced morphing • Scale up structure from motion algorithm Visibility Visibility Advantages of 3D over 2D

Incremental structure from motionIncremental structure from motion Incremental structure from motionIncremental structure from motion

Incremental structure from motionIncremental structure from motion Reconstruction performanceReconstruction performance

• For photo sets from the Internet, 20% to 75% of the photos were registered

• Most unregistered photos belonged to different connected components

• Running time: < 1 hour for 80 photos

> 1 week for 2600 photo

Page 8: Photo Tourism - courses.cs.washington.edu · • Enhanced morphing • Scale up structure from motion algorithm Visibility Visibility Advantages of 3D over 2D

Photo Tourism overviewPhoto Tourism overview

Scene reconstruction

Scene reconstruction

Photo ExplorerPhoto

ExplorerInput photographs

Photo ExplorerPhoto Explorer

DemoDemo Photo Tourism overviewPhoto Tourism overview

Scene reconstruction

Scene reconstruction

Photo ExplorerPhoto

ExplorerInput photographs

• Navigation• Rendering• Annotations

Page 9: Photo Tourism - courses.cs.washington.edu · • Enhanced morphing • Scale up structure from motion algorithm Visibility Visibility Advantages of 3D over 2D

Navigation controlsNavigation controls

• Free-flight navigation

• Object-based browsing

• Relation-based browsing

• Overhead map

ObjectObject--based browsingbased browsing

ObjectObject--based browsingbased browsing

• Visibility

• Resolution

• Head-on view

RelationRelation--based browsingbased browsing

Zoom in

Zoom out

Move left Move right

Find all details Find all zoom outs

Find all similar images

Page 10: Photo Tourism - courses.cs.washington.edu · • Enhanced morphing • Scale up structure from motion algorithm Visibility Visibility Advantages of 3D over 2D

RelationRelation--based browsingbased browsing

is to the left of

is detail of

is a zoom out of

Image A Image C

Image D

Image B

RelationRelation--based browsingbased browsing

is to the left of

is detail of

is a zoom out of

Image A Image C

Image D

Image B

RelationRelation--based browsingbased browsing

Image A

Image B

RelationRelation--based browsingbased browsing

Image A

Image B

Page 11: Photo Tourism - courses.cs.washington.edu · • Enhanced morphing • Scale up structure from motion algorithm Visibility Visibility Advantages of 3D over 2D

RelationRelation--based browsingbased browsing

Image A

Image C

Image B

to the right of

RelationRelation--based browsingbased browsing

Image A

Image C

Image B

to the right of

RelationRelation--based browsingbased browsing

Image A

Image C

Image D

Image B

to the right of

is detail of

RelationRelation--based browsingbased browsing

Image A

Image C

Image D

Image B

to the right of

is detail of

Page 12: Photo Tourism - courses.cs.washington.edu · • Enhanced morphing • Scale up structure from motion algorithm Visibility Visibility Advantages of 3D over 2D

RelationRelation--based browsingbased browsing

Image A

Image C

Image D

Image B

to the right of

is detail ofis zoom-out of is detail of

to the left of

is zoom-out of

is detail of

is detail of is zoom-out of

Overhead mapOverhead map

Rome

Prague

Prague Old Town SquarePrague Old Town Square

Page 13: Photo Tourism - courses.cs.washington.edu · • Enhanced morphing • Scale up structure from motion algorithm Visibility Visibility Advantages of 3D over 2D

Photo Tourism overviewPhoto Tourism overview

Scene reconstruction

Scene reconstruction

Photo ExplorerPhoto

ExplorerInput photographs

• Navigation• Rendering• Annotations

RenderingRendering

RenderingRendering RenderingRendering

Page 14: Photo Tourism - courses.cs.washington.edu · • Enhanced morphing • Scale up structure from motion algorithm Visibility Visibility Advantages of 3D over 2D

Rendering transitionsRendering transitions Rendering transitionsRendering transitions

Rendering transitionsRendering transitions Rendering transitionsRendering transitions

Camera A Camera B

Page 15: Photo Tourism - courses.cs.washington.edu · • Enhanced morphing • Scale up structure from motion algorithm Visibility Visibility Advantages of 3D over 2D

Photo Tourism overviewPhoto Tourism overview

Scene reconstruction

Scene reconstruction

Photo ExplorerPhoto

ExplorerInput photographs

• Navigation• Rendering• Annotations

Rome

Prague

Paris

AnnotationsAnnotations AnnotationsAnnotations

Reproduced with permission of Yahoo! Inc. © 2005 by Yahoo! Inc.YAHOO! and the YAHOO! logo are trademarks of Yahoo! Inc.

Page 16: Photo Tourism - courses.cs.washington.edu · • Enhanced morphing • Scale up structure from motion algorithm Visibility Visibility Advantages of 3D over 2D

AnnotationsAnnotations

Rome

PragueParis

Yosemite

YosemiteYosemite YosemiteYosemite

Topographic data courtesy USGS

Page 17: Photo Tourism - courses.cs.washington.edu · • Enhanced morphing • Scale up structure from motion algorithm Visibility Visibility Advantages of 3D over 2D

ContributionsContributions

• Automated system for registering photo collections in 3D for interactive exploration

• Structure from motion algorithm demonstrated on hundreds of photos from the Internet

• Photo exploration system combining new image-based rendering and photo navigation techniques

Limitations / Future workLimitations / Future work

• Not all photos can be reliably matched

Better feature detection / matching

Integrating GPS & other localization info.

• Structure from motion scalability

More efficient (sparse) algorithms

• Plane-based transitions lack parallax

Limitations / Future workLimitations / Future work Limitations / Future workLimitations / Future work

• Not all photos can be reliably matched

Better feature detection / matching

Integrating GPS & other localization info.

• Structure from motion scalability

More efficient (sparse) algorithms

• Plane-based transitions lack parallax

Richer transitions

• Photo explorer scalability…

Page 18: Photo Tourism - courses.cs.washington.edu · • Enhanced morphing • Scale up structure from motion algorithm Visibility Visibility Advantages of 3D over 2D

Future workFuture work

• Photo explorer scalability– Design client-server architecture for streaming

images and geometry at required resolution

– Scale to all of the world’s photos (and videos…)

– Photosynth project at Microsoft Live Labs(live demo)

PhotosynthPhotosynth

Future workFuture work

• Photo explorer scalability– Scale to all of the world’s photos (and videos…)

– Computational complexity: avoid matching all images to all other images

• vocabulary trees [Nister]

• graphical models, nested dissection [Dellaert]

• serendipitous (?) probabilistic inference

AcknowledgementsAcknowledgements

• National Science Foundation

• Achievement Rewards for College Scientists (ARCS)

• The many people who allowed use of their photos

• UW GRAIL Lab

• MSR Interactive Visual Media Lab

• Kevin Chiu and Andy Hou for writing the Java applet

Page 19: Photo Tourism - courses.cs.washington.edu · • Enhanced morphing • Scale up structure from motion algorithm Visibility Visibility Advantages of 3D over 2D

ConclusionConclusionIndexing the world’s photos in 3D provides a new way to share and experience our worldTo find out more:– http://phototour.cs.washington.edu– http://research.microsoft.com/IVM/PhotoTourism– http://labs.live.com/photosynth

Saint Basil's Cathedral Mount RushmoreTrafalgar Square Rockefeller Center

StatisticsStatistics

Trevi Fountain 466 360

Yosemite 325 1,893

Notre Dame 597 2,635

Prague 197 235

Great Wall 82 120

Trafalgar Square 278 1,893

Dataset # input # registered

Reconstruction running timeReconstruction running time

• Great Wall: 82 / 120 photos registeredRunning time: ~ 3 hours

• Notre Dame: 597 / 2,635 photos registeredRunning time: ~ 2 weeks

Page 20: Photo Tourism - courses.cs.washington.edu · • Enhanced morphing • Scale up structure from motion algorithm Visibility Visibility Advantages of 3D over 2D

Future workFuture work

• Incorporate other metadata (e.g., time, photographer) and media (e.g., panoramas, video)

• Enhanced morphing

• Scale up structure from motion algorithm

VisibilityVisibility

VisibilityVisibility Advantages of 3D over 2DAdvantages of 3D over 2D

• 3D geometry has multi-image consistency

• Can annotate point cloud directly

• Can import annotations from georeferencedsources (e.g., landmark databases)

• Can use depth as cue for rejecting outliers in selection

Page 21: Photo Tourism - courses.cs.washington.edu · • Enhanced morphing • Scale up structure from motion algorithm Visibility Visibility Advantages of 3D over 2D

PostPost--processing the reconstructionprocessing the reconstruction

• Compute gravity direction

• Center point cloud at the origin

• Scale model to unit variance