up2date january 30th, 2009parallel.vub.ac.be/~jan/papers/introduction to bayesian...up2date...

26
Slide 1 www.mastersininnovation.com Confidential 30.01.2009 Bayesian Networks Jan Lemeire Department of Electronics and Informatics Vrije Universiteit Brussel [email protected] www.vub.ac.be UP2DATE January 30 th , 2009 www.mastersininnovation.com Commercially confidence This presentation contains ideas and information which are proprietary of VERHAERT, Masters in Innovation*, it is given in confidence. You are authorized to open and view the electronic copy of this document and to print a single copy. Otherwise, the material may not in whole or in part be copied, stored electronically or communicated to third parties without prior agreement of VERHAERT, Masters in Innovation*. * VERHAERT, Masters in Innovation is a registered trade name of Verhaert Consultancies N.V.

Upload: others

Post on 28-Feb-2021

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: UP2DATE January 30th, 2009parallel.vub.ac.be/~jan/papers/Introduction to Bayesian...UP2DATE –January 30th, 2009 Commercially confidence –This presentation contains ideas and information

Bayesian networks

Slide 1

ww

w.m

aste

rsin

inn

ova

tio

n.c

om

Confidential 30.01.2009

Bayesian Networks

Jan Lemeire

Department of Electronics and Informatics

Vrije Universiteit Brussel

[email protected]

www.vub.ac.be

UP2DATE – January 30th, 2009

www.mastersininnovation.com

Commercially confidence – This presentation contains ideas and information which are proprietary of VERHAERT, Masters in Innovation*, it is given in confidence. You are authorized to

open and view the electronic copy of this document and to print a single copy. Otherwise, the material may not in whole or in part be copied, stored electronically or communicated to third

parties without prior agreement of VERHAERT, Masters in Innovation*.

* VERHAERT, Masters in Innovation is a registered trade name of Verhaert Consultancies N.V.

Page 2: UP2DATE January 30th, 2009parallel.vub.ac.be/~jan/papers/Introduction to Bayesian...UP2DATE –January 30th, 2009 Commercially confidence –This presentation contains ideas and information

Bayesian networks

Slide 2

ww

w.m

aste

rsin

inn

ova

tio

n.c

om

Confidential 30.01.2009

Introduction: From Bayes’ rule to

Bayesian networks

The use of Bayesian networks

Learning from data

Comparison with other techniques

Conclusions

Overview

Page 3: UP2DATE January 30th, 2009parallel.vub.ac.be/~jan/papers/Introduction to Bayesian...UP2DATE –January 30th, 2009 Commercially confidence –This presentation contains ideas and information

Bayesian networks

Slide 3

ww

w.m

aste

rsin

inn

ova

tio

n.c

om

Confidential 30.01.2009

Belief Update during Medical Diagnosis

Information P(cancer)

Prior belief 0.001

Short of breath (symptom) 0.002

Smoker 0.008

X-rays (clinical test) 0.04

It is not bronchitis 0.5

Page 4: UP2DATE January 30th, 2009parallel.vub.ac.be/~jan/papers/Introduction to Bayesian...UP2DATE –January 30th, 2009 Commercially confidence –This presentation contains ideas and information

Bayesian networks

Slide 4

ww

w.m

aste

rsin

inn

ova

tio

n.c

om

Confidential 30.01.2009

Bayes’ rule

Thomas Bayes

P(cancer | smoker) = P(cancer). P(smoker |cancer)

P(smoker)

Prior belief New knowledgePosterior belief

Page 5: UP2DATE January 30th, 2009parallel.vub.ac.be/~jan/papers/Introduction to Bayesian...UP2DATE –January 30th, 2009 Commercially confidence –This presentation contains ideas and information

Bayesian networks

Slide 5

ww

w.m

aste

rsin

inn

ova

tio

n.c

om

Confidential 30.01.2009

Types of questions

Probabilistic queries

"I've got a temperature of 101, I'm a 37-year-old

Male and my tongue feels kind of funny but I have

no headache. What's the chance that I've got

bubonic plague?"

Take decisions about tests

Utility of tests: which tests give me maximal

information

Take decisions about interventions

Explain things in terms of causal relations

How to represent this knowledge?

expert

Page 6: UP2DATE January 30th, 2009parallel.vub.ac.be/~jan/papers/Introduction to Bayesian...UP2DATE –January 30th, 2009 Commercially confidence –This presentation contains ideas and information

Bayesian networks

Slide 6

ww

w.m

aste

rsin

inn

ova

tio

n.c

om

Confidential 30.01.2009

Bayesian Networks

Intuitive graphical representation expressing the relations among the

variables. Can be causal relations.

Plus probabilities attached to each node.

Visited

AsiaPollution Smoker

Tuberculosis Cancer Bronchitis

positive X-ray

resultsincreased total

serum calciumshort of breath

(Dyspnoea)

P(visited Asia)P(pollution)P(Smoker)P(tuberculosis | visited Asia)P(cancer |smoker , pollution)P(Bronchitis|smoker)…

Page 7: UP2DATE January 30th, 2009parallel.vub.ac.be/~jan/papers/Introduction to Bayesian...UP2DATE –January 30th, 2009 Commercially confidence –This presentation contains ideas and information

Bayesian networks

Slide 7

ww

w.m

aste

rsin

inn

ova

tio

n.c

om

Confidential

Overview

30.01.2009

Introduction: From Bayes’ rule to

Bayesian networks

The use of Bayesian networks

Learning from data

Comparison with other techniques

Conclusions

Page 8: UP2DATE January 30th, 2009parallel.vub.ac.be/~jan/papers/Introduction to Bayesian...UP2DATE –January 30th, 2009 Commercially confidence –This presentation contains ideas and information

Bayesian networks

Slide 8

ww

w.m

aste

rsin

inn

ova

tio

n.c

om

Confidential 30.01.2009

Typical use of Bayesian networks

to model and explain a domain.

to update beliefs about states of certain variables when some other variables were observed,

e.g.: P(car breaks down | age of car = 16, changed oil = no).

≈ prediction

to find most probable configurations of variables

to support decision making under uncertainty (a Bayesian Network is a probabilistic model)

to find good strategies for solving tasks in a domain with uncertainty

Page 9: UP2DATE January 30th, 2009parallel.vub.ac.be/~jan/papers/Introduction to Bayesian...UP2DATE –January 30th, 2009 Commercially confidence –This presentation contains ideas and information

Bayesian networks

Slide 9

ww

w.m

aste

rsin

inn

ova

tio

n.c

om

Confidential

Typical use of Bayesian networks II

30.01.2009

Smoking Cancer

Smoking Cancer

Smoking &

Cancer Gene

Explain things in terms of causal relations

E.g.: smoking causes lung cancer?

To answer qualitative questions

E.g.: if the national bank would lower interest rates, but the confidence remains low, would it help the economy?

or

Page 10: UP2DATE January 30th, 2009parallel.vub.ac.be/~jan/papers/Introduction to Bayesian...UP2DATE –January 30th, 2009 Commercially confidence –This presentation contains ideas and information

Bayesian networks

Slide 10

ww

w.m

aste

rsin

inn

ova

tio

n.c

om

Confidential 30.01.2009

Dynamic Bayesian Networks

Models a dynamic system: the state at time t is affected by the state

at time t-1

Used in reliability analysis

Graphical Duration Models

Rail defect

Page 11: UP2DATE January 30th, 2009parallel.vub.ac.be/~jan/papers/Introduction to Bayesian...UP2DATE –January 30th, 2009 Commercially confidence –This presentation contains ideas and information

Bayesian networks

Slide 11

ww

w.m

aste

rsin

inn

ova

tio

n.c

om

Confidential 30.01.2009

Examples

Norsys (http://www.norsys.com/ )

http://www.norsys.com/netlibrary/index.htm

Coronary Risk: a Bayesian Network to predict risk of Coronary Heart

Disease

Agricultural Yield

Car Diagnosis

Chest Clinic Decision: A graphical method for solving a decision

analysis problem

Oil Wildcatter Extended: decision network

Win95pts: An expert system for printer troubleshooting in Windows 95.

Page 12: UP2DATE January 30th, 2009parallel.vub.ac.be/~jan/papers/Introduction to Bayesian...UP2DATE –January 30th, 2009 Commercially confidence –This presentation contains ideas and information

Bayesian networks

Slide 12

ww

w.m

aste

rsin

inn

ova

tio

n.c

om

Confidential

Overview

30.01.2009

Introduction: From Bayes’ rule to

Bayesian networks

The use of Bayesian networks

Learning from data

Comparison with other techniques

Conclusions

Page 13: UP2DATE January 30th, 2009parallel.vub.ac.be/~jan/papers/Introduction to Bayesian...UP2DATE –January 30th, 2009 Commercially confidence –This presentation contains ideas and information

Bayesian networks

Slide 13

ww

w.m

aste

rsin

inn

ova

tio

n.c

om

Confidential 30.01.2009

Learning Bayesian networks

From data and expert knowledge

I. Parameterization (‘the probabilities’)

Based on assumptions on the relations

e.g. linear with Gaussian errors

When the structure is known

Page 14: UP2DATE January 30th, 2009parallel.vub.ac.be/~jan/papers/Introduction to Bayesian...UP2DATE –January 30th, 2009 Commercially confidence –This presentation contains ideas and information

Bayesian networks

Slide 14

ww

w.m

aste

rsin

inn

ova

tio

n.c

om

Confidential

Learning Bayesian networks II

II. The structure of the graph

1. Find minimal model that best explains the data

Trade-off between goodness-of-fit and model complexity

2. Find model that explains the conditional independencies

A causal structure implies conditional independencies

30.01.2009

overfitting in regression

Causal structure is learned from data!!

Smoking Cancer positive X-ray

results

Page 15: UP2DATE January 30th, 2009parallel.vub.ac.be/~jan/papers/Introduction to Bayesian...UP2DATE –January 30th, 2009 Commercially confidence –This presentation contains ideas and information

Bayesian networks

Slide 15

ww

w.m

aste

rsin

inn

ova

tio

n.c

om

Confidential

Introduction: From Bayes’ rule to

Bayesian networks

The use of Bayesian networks

Learning from data

Comparison with other techniques

Conclusions

Overview

30.01.2009

Page 16: UP2DATE January 30th, 2009parallel.vub.ac.be/~jan/papers/Introduction to Bayesian...UP2DATE –January 30th, 2009 Commercially confidence –This presentation contains ideas and information

Bayesian networks

Slide 16

ww

w.m

aste

rsin

inn

ova

tio

n.c

om

Confidential

Statistical Learning

Goal: learning from data, try to understand the underlying system

that generated the data

Supervised learning

30.01.2009

Classification of galaxies by Hubble telescope

learn to predict E from A, B, C & D

Page 17: UP2DATE January 30th, 2009parallel.vub.ac.be/~jan/papers/Introduction to Bayesian...UP2DATE –January 30th, 2009 Commercially confidence –This presentation contains ideas and information

Bayesian networks

Slide 17

ww

w.m

aste

rsin

inn

ova

tio

n.c

om

Confidential 30.01.2009

Techniques for supervised learning

RegressionSupport Vector Machines

Neural network

Page 18: UP2DATE January 30th, 2009parallel.vub.ac.be/~jan/papers/Introduction to Bayesian...UP2DATE –January 30th, 2009 Commercially confidence –This presentation contains ideas and information

Bayesian networks

Slide 18

ww

w.m

aste

rsin

inn

ova

tio

n.c

om

Confidential

Statistical Learning II

Unsupervised learning

there is no outcome measure

the goal is to describe the associations and patterns of the data

30.01.2009

Example: find types of

consumers

Cluster Analysis

Page 19: UP2DATE January 30th, 2009parallel.vub.ac.be/~jan/papers/Introduction to Bayesian...UP2DATE –January 30th, 2009 Commercially confidence –This presentation contains ideas and information

Bayesian networks

Slide 19

ww

w.m

aste

rsin

inn

ova

tio

n.c

om

Confidential

Provide ‘black box’ models for the relation between the variables

that we know and the ones we want to predict.

They are usually better in that task.

Bayesian networks model the relations among all variables.

Bayesian networks provide insight.

Bayesian networks are not good in pattern recognition (e.g.

recognizing characters from a printed text).

30.01.2009

Techniques for statistical learning compared

Page 20: UP2DATE January 30th, 2009parallel.vub.ac.be/~jan/papers/Introduction to Bayesian...UP2DATE –January 30th, 2009 Commercially confidence –This presentation contains ideas and information

Bayesian networks

Slide 20

ww

w.m

aste

rsin

inn

ova

tio

n.c

om

Confidential

Knowledge representation

30.01.2009

Expert Systems

Rule-based systems

Decision trees

A => BB & C => D

Example: Evaluating car insurance risks.

Page 21: UP2DATE January 30th, 2009parallel.vub.ac.be/~jan/papers/Introduction to Bayesian...UP2DATE –January 30th, 2009 Commercially confidence –This presentation contains ideas and information

Bayesian networks

Slide 21

ww

w.m

aste

rsin

inn

ova

tio

n.c

om

Confidential

Knowledge representation II

30.01.2009

Fuzzy Logic Fuzzy Rules

Page 22: UP2DATE January 30th, 2009parallel.vub.ac.be/~jan/papers/Introduction to Bayesian...UP2DATE –January 30th, 2009 Commercially confidence –This presentation contains ideas and information

Bayesian networks

Slide 22

ww

w.m

aste

rsin

inn

ova

tio

n.c

om

Confidential

Knowledge representation techniques compared

Bayesian networks can be regarded as the underlying (causal)

structure, from which (fuzzy) rules can be extracted

While the graph describes the relations among the variables, other

techniques describe the type and structure of the relations better .

30.01.2009

Digital Circuit

Page 23: UP2DATE January 30th, 2009parallel.vub.ac.be/~jan/papers/Introduction to Bayesian...UP2DATE –January 30th, 2009 Commercially confidence –This presentation contains ideas and information

Bayesian networks

Slide 23

ww

w.m

aste

rsin

inn

ova

tio

n.c

om

Confidential

Overview

30.01.2009

Introduction: From Bayes’ rule to

Bayesian networks

The use of Bayesian networks

Learning from data

Comparison with other techniques

Conclusions

Page 24: UP2DATE January 30th, 2009parallel.vub.ac.be/~jan/papers/Introduction to Bayesian...UP2DATE –January 30th, 2009 Commercially confidence –This presentation contains ideas and information

Bayesian networks

Slide 24

ww

w.m

aste

rsin

inn

ova

tio

n.c

om

Confidential 30.01.2009

Conclusions

A lot of data is available nowadays + a lot computing power

Data mining and statistical learning becomes interesting

Academic researchers are in need of real-world data and problems!

A Bayesian network provides an intuitive graphical representation

Knowledge representation

Learning algorithms exist that can learn the models from data

Extract knowledge from data

Useful when the relations among the variables matter

Not when the (causal) relations are trivial

Page 25: UP2DATE January 30th, 2009parallel.vub.ac.be/~jan/papers/Introduction to Bayesian...UP2DATE –January 30th, 2009 Commercially confidence –This presentation contains ideas and information

Bayesian networks

Slide 25

ww

w.m

aste

rsin

inn

ova

tio

n.c

om

Confidential 30.01.2009

References

My research: http://parallel.vub.ac.be => research => causal

inference

Norsys: http://www.norsys.com/

http://www.norsys.com/netlibrary/index.htm (examples)

tutorials

Bayesia: http://www.bayesia.com

Examples:

http://www.bayesia.com/en/products/bayesialab/resources.php

Bayesian Network Repository

http://compbio.cs.huji.ac.il/Repository/

Statistical Data Mining Tutorials:

http://www.autonlab.org/tutorials/

Page 26: UP2DATE January 30th, 2009parallel.vub.ac.be/~jan/papers/Introduction to Bayesian...UP2DATE –January 30th, 2009 Commercially confidence –This presentation contains ideas and information

Bayesian networks

Slide 26

ww

w.m

aste

rsin

inn

ova

tio

n.c

om

Confidential 30.01.2009

Verhaert New Products & Services nvHogenakkerhoekstraat 21

9150 Kruibeke

Belgium

Tel +32 (0)3 250 19 00

Fax +32 (0)3 254 10 08

www.verhaert.com

[email protected]

www.mastersininnovation.com