lorentz workshop on human probabilistic inferences a complexity-theoretic perspective on the...

26
Lorentz workshop on Human Probabilistic Inferences A complexity-theoretic perspective on the preconditions for Bayesian tractability Johan Kwisthout (joint work with Iris van Rooij, Todd Wareham, Maria Otworowska, and Mark Blokpoel)

Upload: shannon-fletcher

Post on 02-Jan-2016

216 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Lorentz workshop on Human Probabilistic Inferences A complexity-theoretic perspective on the preconditions for Bayesian tractability Johan Kwisthout (joint

Lorentz workshop onHuman Probabilistic Inferences

A complexity-theoretic perspective on the preconditions for Bayesian tractability

Johan Kwisthout

(joint work with Iris van Rooij, Todd Wareham, Maria Otworowska, and

Mark Blokpoel)

Page 2: Lorentz workshop on Human Probabilistic Inferences A complexity-theoretic perspective on the preconditions for Bayesian tractability Johan Kwisthout (joint

The Tractability Constraint

“The computations postulated by a model of cognition need to be tractable in the real world in which people live, not only in the small world of an experiment ... This eliminates NP-hard models that lead to computational explosion.” (Gigerenzer et al., 2008)

Page 3: Lorentz workshop on Human Probabilistic Inferences A complexity-theoretic perspective on the preconditions for Bayesian tractability Johan Kwisthout (joint

The Tractability Constraint

“The computations postulated by a model of cognition need to be tractable in the real world in which people live, not only in the small world of an experiment ... This eliminates NP-hard models that lead to computational explosion.” (Gigerenzer et al., 2008)

(neuro-)

Page 4: Lorentz workshop on Human Probabilistic Inferences A complexity-theoretic perspective on the preconditions for Bayesian tractability Johan Kwisthout (joint

Marr’s three levels of explanation

Level Marr’s levels Question

1 Computational What?

2 Algorithm Method?

3 Implementation Implementation?

input i Cognitive process

output o = f(i)

Page 5: Lorentz workshop on Human Probabilistic Inferences A complexity-theoretic perspective on the preconditions for Bayesian tractability Johan Kwisthout (joint

Computational-level Models of Cognition

input iCognitive process

output o = f(i)

Bayesian inference

Page 6: Lorentz workshop on Human Probabilistic Inferences A complexity-theoretic perspective on the preconditions for Bayesian tractability Johan Kwisthout (joint

The Tractability Constraint

“The computations postulated by a model of cognition need to be tractable in the real world in which people live, not only in the small world of an experiment ... This eliminates NP-hard models that lead to computational explosion.” (Gigerenzer et al., 2008)

Page 7: Lorentz workshop on Human Probabilistic Inferences A complexity-theoretic perspective on the preconditions for Bayesian tractability Johan Kwisthout (joint

Scalability

Baker, C.L., Tenenbaum J.B.,& Saxe, R.R. (2007)Goal Inference as Inverse Planning, CogSci’07

Does this scale to, e.g., recognizing intentions in a job interview?

Page 8: Lorentz workshop on Human Probabilistic Inferences A complexity-theoretic perspective on the preconditions for Bayesian tractability Johan Kwisthout (joint

Scalability (2)

Karl Friston (2013) Life as we know it. J R Soc Interface. (2003) Neural Netw.

Page 9: Lorentz workshop on Human Probabilistic Inferences A complexity-theoretic perspective on the preconditions for Bayesian tractability Johan Kwisthout (joint

The Tractability Constraint

“The computations postulated by a model of cognition need to be tractable in the real world in which people live, not only in the small world of an experiment ... This eliminates NP-hard models that lead to computational explosion.” (Gigerenzer et al., 2008)

Page 10: Lorentz workshop on Human Probabilistic Inferences A complexity-theoretic perspective on the preconditions for Bayesian tractability Johan Kwisthout (joint

Why NP-hard is considered intractable

NP-hard functions cannot be computed in polynomial time (assuming P NP). Instead they require exponential time (or worse) for their computation, which is why they are considered intractable (in other words, unrealistic to compute for all but small inputs).

n 2n

5 0.19 msec20 1.75 min50 8.4 x 102 yrs

100 9.4 x 1017 yrs1000 7.9 x 10288 yrs

n n2

5 0.15 msec20 0.04 sec50 0.25 sec

100 1.00 sec1000 1.67 min

Page 11: Lorentz workshop on Human Probabilistic Inferences A complexity-theoretic perspective on the preconditions for Bayesian tractability Johan Kwisthout (joint

n 2n

5 0.19 msec20 1.75 min50 8.4 x 102 yrs

100 9.4 x 1017 yrs1000 7.9 x 10288 yrs

n n2

5 0.15 msec20 0.04 sec50 0.25 sec

100 1.00 sec1000 1.67 min

polynomial exponential

Page 12: Lorentz workshop on Human Probabilistic Inferences A complexity-theoretic perspective on the preconditions for Bayesian tractability Johan Kwisthout (joint

Bayesian Inference is Intractable (NP-hard)

Bayesian inference

Page 13: Lorentz workshop on Human Probabilistic Inferences A complexity-theoretic perspective on the preconditions for Bayesian tractability Johan Kwisthout (joint

Bayesian Inference is Intractable (NP-hard)

Bayesian inference

Bayesian inference

Page 14: Lorentz workshop on Human Probabilistic Inferences A complexity-theoretic perspective on the preconditions for Bayesian tractability Johan Kwisthout (joint

input i intractable function f

output o = f(i)

input i tractable algorithm A

output A(i) f(i)

Algorithmic-level

Computational-levelconsistent

Can approximation help us?

Page 15: Lorentz workshop on Human Probabilistic Inferences A complexity-theoretic perspective on the preconditions for Bayesian tractability Johan Kwisthout (joint

input i intractable function f

output o = f(i)

input i tractable algorithm A

output A(i) f(i)

Algorithmic-level

Computational-level

For Bayesian computations: in general, NO

Page 16: Lorentz workshop on Human Probabilistic Inferences A complexity-theoretic perspective on the preconditions for Bayesian tractability Johan Kwisthout (joint

It is NP-hard to:

• compute posteriors approximately (Dagum & Luby, 1994)

• fixate beliefs approximately (Abdelbar & Hedetniemi, 1998) or with a non-zero probability (Kwisthout, 2011)

• fixate a belief that resembles the most probable belief (Kwisthout, 2012)

• fixate a belief that is likely the most probable belief (Kwisthout & van Rooij, 2012)

• fixate a belief that ranks within the k most probable beliefs (Kwisthout, 2014)

Classical Bayesian approximation results

Page 17: Lorentz workshop on Human Probabilistic Inferences A complexity-theoretic perspective on the preconditions for Bayesian tractability Johan Kwisthout (joint

• Isn’t this all just a property of discrete distributions?

• No, not in general. Any discrete probability distribution can be approximated by a (multivariate) Gaussian mixture distribution and vice versa

• But shouldn’t we then assume that the distributions used in computational cognitive models are constrained to simple Gaussian distributions?

• We might. But then, can we then reasonable scale models up to, e.g., intention recognition? Or even as far as counterfactual reasoning?

Blokpoel, Kwisthout, and Van Rooij (2012). When can predictive brains be truly Bayesian? Commentary to Andy’s BBS paper – see also Andy’s reply

But wait…

Page 18: Lorentz workshop on Human Probabilistic Inferences A complexity-theoretic perspective on the preconditions for Bayesian tractability Johan Kwisthout (joint

Intractability as “bad news”intractability is a theoretical problem

"zeer indrukwekkend, collega... maar werkt het ook in theorie? "

Fokke & Sukke: “Very impressive, colleague … but does it also work in theory?”

If it is tractable in practice, then it should also be tractable in theory. If not, then our theories fail to explain what happens in practice.

Page 19: Lorentz workshop on Human Probabilistic Inferences A complexity-theoretic perspective on the preconditions for Bayesian tractability Johan Kwisthout (joint

Tractability as a theoretical constraint helps cognitive modelers by ruling out unrealistic models.

Intractability as “good news”tractability is a model constraint

Cognitive functions

Tractable functions

All possible functions

Inference

Belief fixation

constrained

constrained

Page 20: Lorentz workshop on Human Probabilistic Inferences A complexity-theoretic perspective on the preconditions for Bayesian tractability Johan Kwisthout (joint

Step 1. Identify parameters of the model that can be proven to be sources of intractability

• In general, NP-hard problems take exponential time in the worst case to solve some instances are easy, some are hard

• Identify what makes these instances hard (or easy)

How to constrain Bayesian inferences?

Page 21: Lorentz workshop on Human Probabilistic Inferences A complexity-theoretic perspective on the preconditions for Bayesian tractability Johan Kwisthout (joint

Step 1. Identify parameters of the model that can be proven to be sources of intractability

exp(n) exp(k1, k2, …km)poly(n)

For example, k1 : max out degreek2 : max # unknowns

k3 : etc.

Bayesian inference

How to constrain Bayesian inferences?

Page 22: Lorentz workshop on Human Probabilistic Inferences A complexity-theoretic perspective on the preconditions for Bayesian tractability Johan Kwisthout (joint

Step 1. Identify parameters of the model that can be proven to be sources of intractability

exp(n) exp(k1, k2, …km)poly(n)

Step 2. Constrain the model to small values for the parameters k1, k2, …km. (Note: n can still be large!)

Step 3. Verify that the constraints hold for humans in real-life situations, and test in the lab if performance breaks down when parameters are large

How to constrain Bayesian inferences?

Page 23: Lorentz workshop on Human Probabilistic Inferences A complexity-theoretic perspective on the preconditions for Bayesian tractability Johan Kwisthout (joint

What makes Bayesian inferences tractable?

Exact inferences

degree of network?

cardinality of variables?

length of paths/chains?

structure of dependences

posterior probability

Approximate inferences

degree of network?

cardinality of variables?

length of paths/chains?

structure of dependences

posterior probability characteristics of theprobability distribution?

Page 24: Lorentz workshop on Human Probabilistic Inferences A complexity-theoretic perspective on the preconditions for Bayesian tractability Johan Kwisthout (joint

• Local search techniques, MC sampling, etc., are dependent on the landscape of the probability distribution

• For some Bayesian inference problems, this landscape can be parameterized – we can prove bounds on the success of the approximation algorithm relative to this parameter

• Kwisthout & Van Rooij (2013), Bridging the gap between theory and practice of approximate Bayesian inference. Cognitive Systems Research, 24, 2–8.

• Kwisthout (under review), Treewidth and the Computational Complexity of MAP Approximations.

CPDs and approximate inference

Page 25: Lorentz workshop on Human Probabilistic Inferences A complexity-theoretic perspective on the preconditions for Bayesian tractability Johan Kwisthout (joint

Our version of the Tractability Constraint

“The computations postulated by a model of cognition need to be tractable in the real world in which people live, not only in the small world of an experiment ... This eliminates NP-hard models that lead to computational explosion.” (Gigerenzer et al., 2008) This poses the need for a thorough analysis of the sources of complexity underlying NP-hard models, and eliminates NP-hard models expect those that can be proven to be fixed-parameter tractable for parameters that may safely be assumed to be small in the real world.

Page 26: Lorentz workshop on Human Probabilistic Inferences A complexity-theoretic perspective on the preconditions for Bayesian tractability Johan Kwisthout (joint

Informal

Formal

Tractability testing

Empirical testing

Computational level theory

Tractable-design cycle