1 slides for the book: probabilistic robotics authors: sebastian thrun wolfram burgard dieter fox...

36
1 Slides for the book: Probabilistic Robotics Authors: Sebastian Thrun Wolfram Burgard Dieter Fox Publisher: MIT Press, 2005. Web site for the book & more slides: http://www.probabilistic-robotics.org/

Upload: annice-golden

Post on 17-Dec-2015

219 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: 1 Slides for the book: Probabilistic Robotics Authors: Sebastian Thrun Wolfram Burgard Dieter Fox Publisher: MIT Press, 2005. Web site for the book & more

1

Slides for the book:Probabilistic Robotics

•Authors:• Sebastian Thrun• Wolfram Burgard• Dieter Fox

•Publisher:• MIT Press, 2005.

•Web site for the book & more slides:http://www.probabilistic-robotics.org/

Page 2: 1 Slides for the book: Probabilistic Robotics Authors: Sebastian Thrun Wolfram Burgard Dieter Fox Publisher: MIT Press, 2005. Web site for the book & more

Probabilistic Robotics

Introduction

ProbabilitiesBayes rule

Bayes filters

Page 3: 1 Slides for the book: Probabilistic Robotics Authors: Sebastian Thrun Wolfram Burgard Dieter Fox Publisher: MIT Press, 2005. Web site for the book & more

3

Probabilistic Robotics

Key idea: Explicit representation of uncertainty using the calculus of probability theory

• Perception = state estimation• Action = utility

optimization

Page 4: 1 Slides for the book: Probabilistic Robotics Authors: Sebastian Thrun Wolfram Burgard Dieter Fox Publisher: MIT Press, 2005. Web site for the book & more

4

Pr(A) denotes probability that proposition A is true.

Axioms of Probability Theory

1)Pr(0 A

1)Pr( True

)Pr()Pr()Pr()Pr( BABABA

0)Pr( False

Page 5: 1 Slides for the book: Probabilistic Robotics Authors: Sebastian Thrun Wolfram Burgard Dieter Fox Publisher: MIT Press, 2005. Web site for the book & more

5

A Closer Look at Axiom 3

B

BA A BTrue

)Pr()Pr()Pr()Pr( BABABA

Page 6: 1 Slides for the book: Probabilistic Robotics Authors: Sebastian Thrun Wolfram Burgard Dieter Fox Publisher: MIT Press, 2005. Web site for the book & more

6

Using the Axioms

)Pr(1)Pr(

0)Pr()Pr(1

)Pr()Pr()Pr()Pr(

)Pr()Pr()Pr()Pr(

AA

AA

FalseAATrue

AAAAAA

Page 7: 1 Slides for the book: Probabilistic Robotics Authors: Sebastian Thrun Wolfram Burgard Dieter Fox Publisher: MIT Press, 2005. Web site for the book & more

7

Discrete Random Variables

• X denotes a random variable.

• X can take on a countable number of values in {x1, x2, …, xn}.

• P(X=xi), or P(xi), is the probability that the random variable X takes on value xi.

• P( ) is called probability mass function.

• E.g. 02.0,08.0,2.0,7.0)( RoomP

.

Page 8: 1 Slides for the book: Probabilistic Robotics Authors: Sebastian Thrun Wolfram Burgard Dieter Fox Publisher: MIT Press, 2005. Web site for the book & more

8

Continuous Random Variables

• X takes on values in the continuum.

• p(X=x), or p(x), is a probability density function.

• E.g.

b

a

dxxpbax )()),(Pr(

x

p(x)

Page 9: 1 Slides for the book: Probabilistic Robotics Authors: Sebastian Thrun Wolfram Burgard Dieter Fox Publisher: MIT Press, 2005. Web site for the book & more

9

Joint and Conditional Probability

• P(X=x and Y=y) = P(x,y)

• If X and Y are independent then P(x,y) = P(x) P(y)

• P(x | y) is the probability of x given yP(x | y) = P(x,y) / P(y)P(x,y) = P(x | y) P(y)

• If X and Y are independent thenP(x | y) = P(x)

Page 10: 1 Slides for the book: Probabilistic Robotics Authors: Sebastian Thrun Wolfram Burgard Dieter Fox Publisher: MIT Press, 2005. Web site for the book & more

10

Law of Total Probability, Marginals

y

yxPxP ),()(

y

yPyxPxP )()|()(

x

xP 1)(

Discrete case

1)( dxxp

Continuous case

dyypyxpxp )()|()(

dyyxpxp ),()(

Page 11: 1 Slides for the book: Probabilistic Robotics Authors: Sebastian Thrun Wolfram Burgard Dieter Fox Publisher: MIT Press, 2005. Web site for the book & more

11

Bayes Formula

evidence

prior likelihood

)(

)()|()(

)()|()()|(),(

yP

xPxyPyxP

xPxyPyPyxPyxP

Page 12: 1 Slides for the book: Probabilistic Robotics Authors: Sebastian Thrun Wolfram Burgard Dieter Fox Publisher: MIT Press, 2005. Web site for the book & more

12

Normalization

)()|(

1)(

)()|()(

)()|()(

1

xPxyPyP

xPxyPyP

xPxyPyxP

x

yx

xyx

yx

yxPx

xPxyPx

|

|

|

aux)|(:

aux

1

)()|(aux:

Algorithm:

Page 13: 1 Slides for the book: Probabilistic Robotics Authors: Sebastian Thrun Wolfram Burgard Dieter Fox Publisher: MIT Press, 2005. Web site for the book & more

13

Conditioning

• Law of total probability:

dzyzPzyxPyxP

dzzPzxPxP

dzzxPxP

)|(),|()(

)()|()(

),()(

Page 14: 1 Slides for the book: Probabilistic Robotics Authors: Sebastian Thrun Wolfram Burgard Dieter Fox Publisher: MIT Press, 2005. Web site for the book & more

14

Bayes Rule with Background Knowledge

)|(

)|(),|(),|(

zyP

zxPzxyPzyxP

Page 15: 1 Slides for the book: Probabilistic Robotics Authors: Sebastian Thrun Wolfram Burgard Dieter Fox Publisher: MIT Press, 2005. Web site for the book & more

15

Conditioning

• Total probability:

dzzPzyxPyxP

dzzPzxPxP

dzzxPxP

)(),|()(

)()|()(

),()(

Page 16: 1 Slides for the book: Probabilistic Robotics Authors: Sebastian Thrun Wolfram Burgard Dieter Fox Publisher: MIT Press, 2005. Web site for the book & more

16

Conditional Independence

)|()|(),( zyPzxPzyxP

),|()( yzxPzxP

),|()( xzyPzyP

equivalent to

and

Page 17: 1 Slides for the book: Probabilistic Robotics Authors: Sebastian Thrun Wolfram Burgard Dieter Fox Publisher: MIT Press, 2005. Web site for the book & more

17

Simple Example of State Estimation

• Suppose a robot obtains measurement z

• What is P(open|z)?

Page 18: 1 Slides for the book: Probabilistic Robotics Authors: Sebastian Thrun Wolfram Burgard Dieter Fox Publisher: MIT Press, 2005. Web site for the book & more

18

Causal vs. Diagnostic Reasoning

•P(open|z) is diagnostic.

•P(z|open) is causal.

•Often causal knowledge is easier to obtain.

•Bayes rule allows us to use causal knowledge:

)()()|(

)|(zP

openPopenzPzopenP

count frequencies!

Page 19: 1 Slides for the book: Probabilistic Robotics Authors: Sebastian Thrun Wolfram Burgard Dieter Fox Publisher: MIT Press, 2005. Web site for the book & more

19

Example

• P(z|open) = 0.6 P(z|open) = 0.3

• P(open) = P(open) = 0.5

67.03

2

5.03.05.06.0

5.06.0)|(

)()|()()|(

)()|()|(

zopenP

openpopenzPopenpopenzP

openPopenzPzopenP

• z raises the probability that the door is open.

Page 20: 1 Slides for the book: Probabilistic Robotics Authors: Sebastian Thrun Wolfram Burgard Dieter Fox Publisher: MIT Press, 2005. Web site for the book & more

20

Combining Evidence

•Suppose our robot obtains another observation z2.

•How can we integrate this new information?

•More generally, how can we estimateP(x| z1...zn )?

Page 21: 1 Slides for the book: Probabilistic Robotics Authors: Sebastian Thrun Wolfram Burgard Dieter Fox Publisher: MIT Press, 2005. Web site for the book & more

21

Recursive Bayesian Updating

),,|(

),,|(),,,|(),,|(

11

11111

nn

nnnn

zzzP

zzxPzzxzPzzxP

Markov assumption: zn is independent of z1,...,zn-1 if we know x.

)()|(

),,|()|(

),,|(

),,|()|(),,|(

...1...1

11

11

111

xPxzP

zzxPxzP

zzzP

zzxPxzPzzxP

ni

in

nn

nn

nnn

Page 22: 1 Slides for the book: Probabilistic Robotics Authors: Sebastian Thrun Wolfram Burgard Dieter Fox Publisher: MIT Press, 2005. Web site for the book & more

22

Example: Second Measurement

• P(z2|open) = 0.5 P(z2|open) = 0.6

• P(open|z1)=2/3

625.08

5

31

53

32

21

32

21

)|()|()|()|(

)|()|(),|(

1212

1212

zopenPopenzPzopenPopenzP

zopenPopenzPzzopenP

• z2 lowers the probability that the door is open.

Page 23: 1 Slides for the book: Probabilistic Robotics Authors: Sebastian Thrun Wolfram Burgard Dieter Fox Publisher: MIT Press, 2005. Web site for the book & more

23

A Typical Pitfall

•Two possible locations x1 and x2

•P(x1)=0.99

•P(z|x2)=0.09 P(z|x1)=0.07

0

0.1

0.2

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

5 10 15 20 25 30 35 40 45 50

p( x

| d)

Number of integrations

p(x2 | d)p(x1 | d)

Page 24: 1 Slides for the book: Probabilistic Robotics Authors: Sebastian Thrun Wolfram Burgard Dieter Fox Publisher: MIT Press, 2005. Web site for the book & more

24

Actions

•Often the world is dynamic since• actions carried out by the robot,• actions carried out by other agents,• or just the time passing by

change the world.

•How can we incorporate such actions?

Page 25: 1 Slides for the book: Probabilistic Robotics Authors: Sebastian Thrun Wolfram Burgard Dieter Fox Publisher: MIT Press, 2005. Web site for the book & more

25

Typical Actions

• The robot turns its wheels to move

• The robot uses its manipulator to grasp an object

• Plants grow over time…

• Actions are never carried out with absolute certainty.

• In contrast to measurements, actions generally increase the uncertainty.

Page 26: 1 Slides for the book: Probabilistic Robotics Authors: Sebastian Thrun Wolfram Burgard Dieter Fox Publisher: MIT Press, 2005. Web site for the book & more

26

Modeling Actions

•To incorporate the outcome of an action u into the current “belief”, we use the conditional pdf

P(x|u,x’)

•This term specifies the pdf that executing u changes the state from x’ to x.

Page 27: 1 Slides for the book: Probabilistic Robotics Authors: Sebastian Thrun Wolfram Burgard Dieter Fox Publisher: MIT Press, 2005. Web site for the book & more

27

Example: Closing the door

Page 28: 1 Slides for the book: Probabilistic Robotics Authors: Sebastian Thrun Wolfram Burgard Dieter Fox Publisher: MIT Press, 2005. Web site for the book & more

28

State Transitions

P(x|u,x’) for u = “close door”:

If the door is open, the action “close door” succeeds in 90% of all cases.

open closed0.1 1

0.9

0

Page 29: 1 Slides for the book: Probabilistic Robotics Authors: Sebastian Thrun Wolfram Burgard Dieter Fox Publisher: MIT Press, 2005. Web site for the book & more

29

Integrating the Outcome of Actions

')'()',|()|( dxxPxuxPuxP

)'()',|()|( xPxuxPuxP

Continuous case:

Discrete case:

Page 30: 1 Slides for the book: Probabilistic Robotics Authors: Sebastian Thrun Wolfram Burgard Dieter Fox Publisher: MIT Press, 2005. Web site for the book & more

30

Example: The Resulting Belief

)|(1161

83

10

85

101

)(),|(

)(),|(

)'()',|()|(

1615

83

11

85

109

)(),|(

)(),|(

)'()',|()|(

uclosedP

closedPcloseduopenP

openPopenuopenP

xPxuopenPuopenP

closedPcloseduclosedP

openPopenuclosedP

xPxuclosedPuclosedP

Page 31: 1 Slides for the book: Probabilistic Robotics Authors: Sebastian Thrun Wolfram Burgard Dieter Fox Publisher: MIT Press, 2005. Web site for the book & more

31

Bayes Filters: Framework

• Given:• Stream of observations z and action data u:

• Sensor model P(z|x).• Action model P(x|u,x’).• Prior probability of the system state P(x).

• Wanted: • Estimate of the state X of a dynamical system.• The posterior of the state is also called Belief:

),,,|()( 11 tttt zuzuxPxBel

},,,{ 11 ttt zuzud

Page 32: 1 Slides for the book: Probabilistic Robotics Authors: Sebastian Thrun Wolfram Burgard Dieter Fox Publisher: MIT Press, 2005. Web site for the book & more

32

Markov Assumption

Underlying Assumptions• Static world• Independent noise• Perfect model, no approximation errors

),|(),,|( 1:1:11:1 ttttttt uxxpuzxxp )|(),,|( :1:1:0 tttttt xzpuzxzp

Page 33: 1 Slides for the book: Probabilistic Robotics Authors: Sebastian Thrun Wolfram Burgard Dieter Fox Publisher: MIT Press, 2005. Web site for the book & more

33111 )(),|()|( ttttttt dxxBelxuxPxzP

Bayes Filters

),,,|(),,,,|( 1111 ttttt uzuxPuzuxzP Bayes

z = observationu = actionx = state

),,,|()( 11 tttt zuzuxPxBel

Markov ),,,|()|( 11 tttt uzuxPxzP

Markov11111 ),,,|(),|()|( tttttttt dxuzuxPxuxPxzP

1111

111

),,,|(

),,,,|()|(

ttt

ttttt

dxuzuxP

xuzuxPxzP

Total prob.

Markov111111 ),,,|(),|()|( tttttttt dxzzuxPxuxPxzP

Page 34: 1 Slides for the book: Probabilistic Robotics Authors: Sebastian Thrun Wolfram Burgard Dieter Fox Publisher: MIT Press, 2005. Web site for the book & more

34

Bayes Filter Algorithm

1. Algorithm Bayes_filter( Bel(x),d ):2. 0

3. If d is a perceptual data item z then4. For all x do5. 6. 7. For all x do8.

9. Else if d is an action data item u then10. For all x do11.

12. Return Bel’(x)

)()|()(' xBelxzPxBel )(' xBel

)(')(' 1 xBelxBel

')'()',|()(' dxxBelxuxPxBel

111 )(),|()|()( tttttttt dxxBelxuxPxzPxBel

Page 35: 1 Slides for the book: Probabilistic Robotics Authors: Sebastian Thrun Wolfram Burgard Dieter Fox Publisher: MIT Press, 2005. Web site for the book & more

35

Bayes Filters are Familiar!

• Kalman filters

• Particle filters

• Hidden Markov models

• Dynamic Bayesian networks

• Partially Observable Markov Decision Processes (POMDPs)

111 )(),|()|()( tttttttt dxxBelxuxPxzPxBel

Page 36: 1 Slides for the book: Probabilistic Robotics Authors: Sebastian Thrun Wolfram Burgard Dieter Fox Publisher: MIT Press, 2005. Web site for the book & more

36

Summary

•Bayes rule allows us to compute probabilities that are hard to assess otherwise.

•Under the Markov assumption, recursive Bayesian updating can be used to efficiently combine evidence.

•Bayes filters are a probabilistic tool for estimating the state of dynamic systems.