1 exact inference algorithms bucket-elimination and more compsci 179, spring 2010 set 8: rina...

Exact Inference Algorithms Bucket-elimination and more

COMPSCI 179, Spring 2010Set 8: Rina Dechter

(Reading: chapter 14, Russell and Norvig

Counting

1 2 3 4

4 3 2 155

How many people?

SUM operatorCHAIN structure

Maximization

What is the maximum?

MAX operatorTREE structure

12” 14” 15”

I II III

P60G80G

Min-Cost Assignment

What is minimum cost configuration?

I 30 50

II 40 55

III ∞ 60

I II III

12” 45 ∞ ∞

14” 50 60 70

15” ∞ 65 8060G

50 ∞

I II III

12” 75 ∞ ∞

14” 80 100 130

15” ∞ 105 140

MIN-SUM operatorsCHAIN structure

Belief Updating

Buzzsound

Mechanical problem

Hightemperature

Faultyhead

Readdelays

H P(H)

0 .91 .1

F P(F)0 .991 .01

H F M P(M|H,F)

0 0 0 .90 0 1 .10 1 0 .10 1 1 .91 0 0 .81 0 1 .21 1 0 .011 1 1 .99

F R P(R|F)0 0 .80 1 .21 0 .30 1 .7

P(F | B=1) = ?

M h1(M)

0 .051 .8

H F M Bel(M,H,F)

0 0 0 .04050 0 1 .0720 1 0 .00450 1 1 .6481 0 0 .0041 0 1 .0081 1 0 .000051 1 1 .0792

H h2(H)0 .91 .1

F h3(F)0 .12451 .7317

F h4(F)0 11 1

H F M P(M|H,F)

0 0 0 .90 0 1 .10 1 0 .10 1 1 .91 0 0 .81 0 1 .21 1 0 .011 1 1 .99

M B P(B|M)0 0 .950 1 .051 0 .21 1 .8

* * =F P(F,B=1

)0 .1232551 .073175

P(B=1) = .19643

Probability of evidence

P(F=1|B=1) = .3725

Updated belief

SUM-PROD operatorsPOLY-TREE structure

P(h,f,r,m,b) = P(h) P(f) P(m|h,f) P(r|f) P(b|m)

T R L M

)(XmZX

)(XmXZ

)(ZmZM)(ZmZL

)(ZmMZ)(ZmLZ

)(XmYX

)(XmXY

)(YmTY

)(YmYT

)(YmRY

)(YmYR

ZLZMZZX

(Z)m(Z)mXZP(X)m

ZLP(Z)m

ZMP(Z)m

Belief updating (sum-prod)

XLZXZZM

XMZXZZL

(Z)mXmXZPZm

(X)mXPXm

)()|()(

T R L M

)(XmZX

)(XmXZ

)(ZmZM)(ZmZL

)(ZmMZ)(ZmLZ

)(XmYX

)(XmXY

)(YmTY

)(YmYT

)(YmRY

)(YmYR

(Z)m(Z)mXZP(X)m

ZLP(Z)m

ZMP(Z)m

)|(max

MPE (max-prod)

(Z)mXmXZPZm

(X)mXPXm

)()|(max)(

CSP – consistency (projection-join)

T R L M

)(XmZX

)(ZmMZ)(ZmLZ

)(XmYX

)(YmTY )(YmRY

(Z)λ(Z)λZXR(X)λ

LZR(Z)λ

MZR(Z)λ

T R L M

)(XmZX

)(ZmMZ)(ZmLZ

)(XmYX

)(YmTY )(YmRY

LLZMZZX

(X)m(X)msol

(Z)m(Z)mZXR(X)m

LZR(Z)m

MZR(Z)m

#CSP (sum-prod)

T R L M

)(XmZX

)(XmXZ

)(ZmZM)(ZmZL)(ZmMZ)(ZmLZ

)(XmYX

)(XmXY

)(YmTY

)(YmYT)(YmRY

)(YmYR

Tree-solving

ZLZMZZX

(Z)m(Z)mXZP(X)m

ZLP(Z)m

ZMP(Z)m

XLZXZZM

XMZXZZL

(Z)mXmXZPZm

(X)mXPXm

)()|()(

Belief updating (sum-prod)

MPE (max-prod)

(Z)m(Z)mXZP(X)m

ZLP(Z)m

ZMP(Z)m

)|(max

(Z)mXmXZPZm

(X)mXPXm

)()|(max)(

CSP – consistency (projection-join)

(Z)λ(Z)λZXR(X)λ

LZR(Z)λ

MZR(Z)λ

#CSP (sum-prod)

LLZMZZX

(X)m(X)msol

(Z)m(Z)mZXR(X)m

LZR(Z)m

MZR(Z)m

Belief Propagation

• Instances of tree message passing algorithm

• Exact for trees

• Linear in the input size

• Importance:– One of the first algorithms for inference in Bayesian networks– Gives a cognitive dimension to its computations – Basis for conditioning algorithms for arbitrary Bayesian network– Basis for Loopy Belief Propagation (approximate algorithms)

[Pearl, 1988]

Exact Inference Algorithms Bucket-elimination

COMPSCI 179, Spring 2010Set 8: Rina Dechter

(Reading: chapter 14, Russell and Norvig

Belief Updating

lung Cancer

Smoking

Bronchitis

Dyspnoea

P (lung cancer=yes | smoking=no, dyspnoea=yes ) = ?

Belief updating: P(X|evidence)=?

“Moral” graph

P(a|e=0) P(a,e=0)=

bcde ,,,0

P(a)P(b|a)P(c|a)P(d|b,a)P(e|b,c)=

P(a) d

),,,( ecdahB

P(b|a)P(d|b,a)P(e|b,c)

Variable Elimination

P(c|a)c

Bucket elimination Algorithm BE-bel (Dechter 1996)

Elimination operator

P(a|e=0)

W*=4”induced width” (max clique size)

bucket B:

P(c|a)

P(b|a) P(d|b,a) P(e|b,c)

bucket C:

bucket D:

bucket E:

bucket A:

e)(a,hD

e)c,d,(a,hB

e)d,(a,hC

“Moral” graph

BE-BEL

IntelligenceDifficulty

Letter

Student Network example

• P(J)?

Fall 2003 ICS 275A - Constraint Networks 37

The induced-width

• width: is the max number of parents in the ordered graph• Induced-width: width of induced graph: recursively connecting parents going from last node

to first.• Induced-width w*(d) = the max induced-width over all nodes• Induced-width of a graph: max w*(d) over all d

Complexity of elimination

))((exp ( * dwnOddw ordering along graph moral of widthinduced the)(*

The effect of the ordering:

4)( 1* dw 2)( 2

* dw“Moral” graph

More accurately: O(r exp(w*(d)) where r is the number of cpts.For Bayesian networks r=n. For Markov networks?

BE-BEL

The impact of observationsMoral graph Induced

Moral graph

Adjusted Graph for evidence in B

Induced-adjusted.

Probabilistic Inference Tasks

evidence)|xP(X)BEL(X iii

Belief updating:

Finding most probable explanation (MPE) e),xP(maxarg*x

maxElimination operator

W*=4”induced width” (max clique size)

bucket B:

P(c|a)

P(b|a) P(d|b,a) P(e|b,c)

bucket C:

bucket D:

bucket E:

bucket A:

e)(a,hD

e)c,d,(a,hB

e)d,(a,hC

Algorithm elim-mpe (Dechter 1996)

)xP(maxMPEx

),|(),|()|()|()(maxby replaced is

,,,,cbePbadPabPacPaPMPE

Generating the MPE-tuple

P(b|a) P(d|b,a) P(e|b,c)B:

A: P(a)

P(c|a)

e=0 e)(a,hD

e)c,d,(a,hB

e)d,(a,hC

(a)hP(a)max arga' 1. E

0e' 2.

)e'd,,(a'hmax argd' 3. C

)e'c,,d',(a'h

)a'|P(cmax argc' 4.B

)c'b,|P(e')a'b,|P(d')a'|P(bmax argb' 5.

)e',d',c',b',(a' Return

12” 14” 15”

I II III

P60G80G

Min-Cost Assignment

What is minimum cost configuration?

I 30 50

II 40 55

III ∞ 60

I II III

12” 45 ∞ ∞

14” 50 60 70

15” ∞ 65 8060G

50 ∞

I II III

12” 75 ∞ ∞

14” 80 100 130

15” ∞ 105 140

MIN-SUM operatorsCHAIN structure

BE-MPE

Finding small induced-width

• NP-complete• A tree has induced-width of ?• Greedy algorithms:

– Min width– Min induced-width– Max-cardinality– Fill-in (thought as the best)– See anytime min-width (Gogate and Dechter)

1 exact inference algorithms bucket-elimination and more compsci 179, spring 2010 set 8: rina...

bel slide

norvig slide

d e c b slide

csp sumprod slide

x yz trlm slide

f prf pbm slide

yz trlm mpe maxprod

b c d e

Documents

compsci 6/101: pftw

genome revolution: compsci 006g 1.1 focus compsci 006g...

yn6g1s91 - computer science and...

propositional logic russell and norvig chapter 7

computer science...

compsci 102

plus2 compsci e mar2010

modern exact and approximate combinatorial optimization...

rina dechter

uncertainty (chapter 13, russell & norvig)

answers to russell and norvig question 7.3

1 building bayesian networks compsci 276, fall 2009 set 3:...

informed search(russel norvig)

class3: probabilistic networks · 2018. 4. 19. · class3:...

local and global relational...

importance sampling ics 276 fall 2007 rina dechter

rina dechter university of california irvine

google search laboratories peter norvig, research director

welcome to compsci 201 · • is compsci 201 right for me?...

inteligencia artificial russel e norvig (08)