data-driven methods: video textures cs129 computational photography james hays, brown, fall 2012...

Data-driven methods: Video Textures

Cs129 Computational PhotographyJames Hays, Brown, Fall 2012Many slides from Alexei Efros

Weather Forecasting for Dummies™ Let’s predict weather:

• Given today’s weather only, we want to know tomorrow’s• Suppose weather can only be {Sunny, Cloudy, Raining}

The “Weather Channel” algorithm:• Over a long period of time, record:

– How often S followed by R

– How often S followed by S

– Etc.

• Compute percentages for each state: – P(R|S), P(S|S), etc.

• Predict the state with highest probability!• It’s a Markov Chain

4.04.02.0

3.03.04.0

1.06.03.0

Markov Chain

What if we know today and yesterday’s weather?

TomorrowC SR

Second Order Markov Chain

4.04.02.0

3.03.04.0

1.06.03.0

TomorrowC SR

4.04.02.0

3.03.04.0

1.06.03.0

4.04.02.0

3.03.04.0

1.06.03.0

4.04.02.0

3.03.04.0

1.06.03.0O

onR,RR,CR,SC,RC,CC,SS,RS,CS,S

Text Synthesis

[Shannon,’48] proposed a way to generate English-looking text using N-grams:• Assume a generalized Markov model• Use a large text to compute prob. distributions of

each letter given N-1 previous letters • Starting from a seed repeatedly sample this Markov

chain to generate new letters • Also works for whole words

WE NEED TO EAT CAKE

Mark V. Shaney (Bell Labs)

Results (using alt.singles corpus):• “As I've commented before, really relating to

someone involves standing next to impossible.”

• “One morning I shot an elephant in my arms and kissed him.”

• “I spent an interesting evening recently with a grain of salt”

Data Driven PhilosophyThe answer to most non-trivial questions is “Let’s see what the data says”, rather than “Let’s build a parametric model” or “Let’s simulate it from the ground up”.

This can seem unappealing in its simplicity (see Chinese Room thought experiment), but there are still critical issues of representation, generalization, and data choices.

In the end, data driven methods work very well.

“Every time I fire a linguist, the performance of our speech recognition system goes up.” “ Fred Jelinek (Head of IBM's Speech Recognition Group), 1988

Video TexturesVideo Textures

Arno Schödl

Richard Szeliski

David Salesin

Irfan Essa

Microsoft Research, Georgia Tech

Still photosStill photos

Video clipsVideo clips

Video texturesVideo textures

Problem statementProblem statement

video clip video texture

Our approachOur approach

• How do we find good transitions?

Finding good transitions Finding good transitions

Compute L2 distance Di, j between all frames

Similar frames make good transitions

frame ivs.

frame j

Markov chain representationMarkov chain representation

2 3 41

Similar frames make good transitions

Transition costs Transition costs

• Transition from i to j if successor of i is similar to j

• Cost function: Cij = Di+1, j

i j D i+ 1, j

Transition probabilitiesTransition probabilities

•Probability for transition Pij inversely related

to cost:•Pij ~ exp ( – Cij / 2 )

high low

Preserving dynamicsPreserving dynamics

Preserving dynamics Preserving dynamics

• Cost for transition ij

• Cij = wk Di+k+1, j+kk = -N

j j+ 1

i+ 1 i+ 2

j-1j-2

i jD i, j - 1 D Di+ 1, j i+ 2, j+ 1

D i-1, j-2

Preserving dynamics – effect Preserving dynamics – effect

• Cost for transition ij

• Cij = wk Di+k+1, j+kk = -N

2 3 41

Dead endsDead ends

• No good transition at the end of sequence

2 3 41

Future costFuture cost

• Propagate future transition costs backward

• Iteratively compute new cost

• Fij = Cij + mink Fjk

2 3 41

• Q-learning

Future cost – effectFuture cost – effect

Finding good loopsFinding good loops

• Alternative to random transitions

• Precompute set of loops up front

Video portraitVideo portrait

• Useful for web pages

Region-based analysisRegion-based analysis

• Divide video up into regions

• Generate a video texture for each region

Automatic region analysisAutomatic region analysis

Discussion Discussion

• Some things are relatively easy

Discussion Discussion

• Some are hard

Michel Gondry train videoMichel Gondry train video

http://www.youtube.com/watch?v=0S43IwBF0uM

data-driven methods: video textures cs129 computational photography james hays, brown, fall 2012...

transition ijcij

transition pij

new costfij

j kdead endsno good

compute percentages

new letters

dynamics cost

good transitions frame

Documents

krishna kumar singh, kayvon fatahalian, alexei...

light fields computational photography connelly barnes...

convolution, edge detection, sampling 15-463: computational...

putting objects in perspective derek hoiem alexei a. efros...

modeling light 15-463: rendering and image processing alexei...

high dynamic range images 15-463: rendering and image...

are categories necessary in a data-rich world? alexei...

fshubhtuls,tinghuiz,efros,malikg@eecs.berkeley.edu arxiv...

data-driven approaches for texture and...

learning category-specific mesh reconstruction from image...

point processing (szeliski 3.1) cs129: computational...

data-driven methods: video & texture cs195g computational...

the camera 15-463: computational photography alexei efros,...

image warping (szeliski 3.6.1) cs129: computational...

image warping 15-463: computational photography alexei...

image blending and compositing 15-463: computational...

image morphing 15-463: computational photography alexei...

automatic image colorization - university of wisconsin...

15-463 (15-862): computational photography. staff prof:...

image warping and morphing 15-463: computational photography...