open universes and nuclear weapons

92
1 Open universes and nuclear weapons Stuart Russell Computer Science Division, UC Berkeley

Upload: rupert

Post on 11-Jan-2016

43 views

Category:

Documents


0 download

DESCRIPTION

Open universes and nuclear weapons. Outline. Why we need expressive probabilistic languages BLOG combines probability and first-order logic Application to global seismic monitoring for the Comprehensive Nuclear-Test-Ban Treaty (CTBT). The world has things in it!!. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Open universes and  nuclear weapons

1

Open universes and nuclear weapons

Stuart RussellComputer Science Division, UC Berkeley

Page 2: Open universes and  nuclear weapons

2

Outline

Why we need expressive probabilistic languages BLOG combines probability and first-order logic Application to global seismic monitoring for the

Comprehensive Nuclear-Test-Ban Treaty (CTBT)

Page 3: Open universes and  nuclear weapons

3

The world has things in it!!

Expressive language => concise models => fast learning, sometimes fast reasoning E.g., rules of chess:

1 page in first-order logicOn(color,piece,x,y,t)

~100000 pages in propositional logicWhiteKingOnC4Move12

~100000000000000000000000000000000000000 pages as atomic-state modelR.B.KB.RPPP..PPP..N..N…..PP….q.pp..Q..n..n..ppp..pppr.b.kb.r

[Note: chess is a tiny problem compared to the real world]

Page 4: Open universes and  nuclear weapons

4

Brief history of expressiveness

atomic propositional first-order/relational

logic

probability

Page 5: Open universes and  nuclear weapons

5

Brief history of expressiveness

atomic propositional first-order/relational

logic

probability

5th C B.C.

Page 6: Open universes and  nuclear weapons

6

Brief history of expressiveness

atomic propositional first-order/relational

logic

probability

5th C B.C.

17th C

Page 7: Open universes and  nuclear weapons

7

Brief history of expressiveness

atomic propositional first-order/relational

logic

probability

5th C B.C. 19th C

17th C

Page 8: Open universes and  nuclear weapons

8

Brief history of expressiveness

atomic propositional first-order/relational

logic

probability

5th C B.C. 19th C

17th C 20th C

Page 9: Open universes and  nuclear weapons

9

Brief history of expressiveness

atomic propositional first-order/relational

logic

probability

5th C B.C. 19th C

17th C 20th C 21st C

Page 10: Open universes and  nuclear weapons

10

Brief history of expressiveness

atomic propositional first-order/relational

logic

probability

5th C B.C. 19th C

17th C 20th C 21st C

(be patient!)

Page 11: Open universes and  nuclear weapons

11

First-order probabilistic languages

Gaifman [1964]: Possible worlds with objects and relations,

probabilities attached to (infinitely many) sentences Halpern [1990]:

Probabilities within sentences, constraints on distributions over first-order possible worlds

Poole [1993], Sato [1997], Koller & Pfeffer [1998], various others: KB defines distribution exactly (cf. Bayes nets) assumes unique names and domain closure like Prolog, databases (Herbrand semantics)

Page 12: Open universes and  nuclear weapons

12

Herbrand vs full first-order

GivenFather(Bill,William) and Father(Bill,Junior)How many children does Bill have?

Page 13: Open universes and  nuclear weapons

13

Herbrand vs full first-order

GivenFather(Bill,William) and Father(Bill,Junior)How many children does Bill have?

Herbrand semantics:2

Page 14: Open universes and  nuclear weapons

14

Herbrand vs full first-order

GivenFather(Bill,William) and Father(Bill,Junior)How many children does Bill have?

Herbrand semantics:2First-order logical semantics:

Between 1 and ∞

Page 15: Open universes and  nuclear weapons

15

Possible worlds Propositional

Page 16: Open universes and  nuclear weapons

16

Possible worlds Propositional

First-order + unique names, domain closureA B

C D

A B

C D

A B

C D

A B

C D

A B

C D

Page 17: Open universes and  nuclear weapons

17

Possible worlds Propositional

First-order + unique names, domain closure

First-order open-universe

A B

C D

A B

C D

A B

C D

A B

C D

A B

C D

A B C D A B C D A B C D A B C D A B C D A B C D

Page 18: Open universes and  nuclear weapons

18

Open-universe models

Essential for learning about what exists, e.g., vision, NLP, information integration, tracking, life

[Note the GOFAI Gap: logic-based systems going back to Shakey assumed that perceived objects would be named correctly]

Key question: how to define distributions over an infinite, heterogeneous set of worlds?

Page 19: Open universes and  nuclear weapons

Bayes nets build propositional worlds

19

BurglaryBurglary

AlarmAlarm

EarthquakeEarthquake

Page 20: Open universes and  nuclear weapons

Bayes nets build propositional worlds

20

BurglaryBurglary

AlarmAlarm

EarthquakeEarthquake

Burglary

Page 21: Open universes and  nuclear weapons

Bayes nets build propositional worlds

21

BurglaryBurglary

AlarmAlarm

EarthquakeEarthquake

Burglarynot Earthquake

Page 22: Open universes and  nuclear weapons

Bayes nets build propositional worlds

22

BurglaryBurglary

AlarmAlarm

EarthquakeEarthquake

Burglarynot EarthquakeAlarm

Page 23: Open universes and  nuclear weapons

23

Open-universe models in BLOG

Construct worlds using two kinds of steps, proceeding in topological order: Dependency statements: Set the value of a

function or relation on a tuple of (quantified) arguments, conditioned on parent values

Page 24: Open universes and  nuclear weapons

24

Open-universe models in BLOG

Construct worlds using two kinds of steps, proceeding in topological order: Dependency statements: Set the value of a

function or relation on a tuple of (quantified) arguments, conditioned on parent values

Number statements: Add some objects to the world, conditioned on what objects and relations exist so far

Page 25: Open universes and  nuclear weapons

25

Semantics

Every well-formed* BLOG model specifies a unique proper probability distribution over open-universe possible worlds; equivalent to an infinite contingent Bayes net

* No infinite receding ancestor chains, no conditioned cycles, all expressions finitely evaluable

Page 26: Open universes and  nuclear weapons

26

Example: Citation Matching[Lashkari et al 94] Collaborative Interface Agents,

Yezdi Lashkari, Max Metral, and Pattie Maes, Proceedings of the Twelfth National Conference on Articial Intelligence, MIT Press, Cambridge, MA, 1994.

Metral M. Lashkari, Y. and P. Maes. Collaborative interface agents. In Conference of the American Association for Artificial Intelligence, Seattle, WA, August 1994.

Are these descriptions of the same object?

Core task in CiteSeer, Google Scholar, over 300 companies in the record linkage industry

Page 27: Open universes and  nuclear weapons

27

(Simplified) BLOG model

#Researcher ~ NumResearchersPrior();

Name(r) ~ NamePrior();

#Paper(FirstAuthor = r) ~ NumPapersPrior(Position(r));

Title(p) ~ TitlePrior();

PubCited(c) ~ Uniform({Paper p});

Text(c) ~ NoisyCitationGrammar (Name(FirstAuthor(PubCited(c))), Title(PubCited(c)));

Page 28: Open universes and  nuclear weapons

28

(Simplified) BLOG model

#Researcher ~ NumResearchersPrior();

Name(r) ~ NamePrior();

#Paper(FirstAuthor = r) ~ NumPapersPrior(Position(r));

Title(p) ~ TitlePrior();

PubCited(c) ~ Uniform({Paper p});

Text(c) ~ NoisyCitationGrammar (Name(FirstAuthor(PubCited(c))), Title(PubCited(c)));

Page 29: Open universes and  nuclear weapons

29

(Simplified) BLOG model

#Researcher ~ NumResearchersPrior();

Name(r) ~ NamePrior();

#Paper(FirstAuthor = r) ~ NumPapersPrior(Position(r));

Title(p) ~ TitlePrior();

PubCited(c) ~ Uniform({Paper p});

Text(c) ~ NoisyCitationGrammar (Name(FirstAuthor(PubCited(c))), Title(PubCited(c)));

Page 30: Open universes and  nuclear weapons

30

(Simplified) BLOG model

#Researcher ~ NumResearchersPrior();

Name(r) ~ NamePrior();

#Paper(FirstAuthor = r) ~ NumPapersPrior(Position(r));

Title(p) ~ TitlePrior();

PubCited(c) ~ Uniform({Paper p});

Text(c) ~ NoisyCitationGrammar (Name(FirstAuthor(PubCited(c))), Title(PubCited(c)));

Page 31: Open universes and  nuclear weapons

31

(Simplified) BLOG model

#Researcher ~ NumResearchersPrior();

Name(r) ~ NamePrior();

#Paper(FirstAuthor = r) ~ NumPapersPrior(Position(r));

Title(p) ~ TitlePrior();

PubCited(c) ~ Uniform({Paper p});

Text(c) ~ NoisyCitationGrammar (Name(FirstAuthor(PubCited(c))), Title(PubCited(c)));

Page 32: Open universes and  nuclear weapons

32

(Simplified) BLOG model

#Researcher ~ NumResearchersPrior();

Name(r) ~ NamePrior();

#Paper(FirstAuthor = r) ~ NumPapersPrior(Position(r));

Title(p) ~ TitlePrior();

PubCited(c) ~ Uniform({Paper p});

Text(c) ~ NoisyCitationGrammar (Name(FirstAuthor(PubCited(c))), Title(PubCited(c)));

Page 33: Open universes and  nuclear weapons

33

Citation Matching Results

Four data sets of ~300-500 citations, referring to ~150-300 papers

0

0.05

0.1

0.15

0.2

0.25

Reinforce Face Reason Constraint

Error

(Fraction of Clusters Not Recovered Correctly)

Phrase Matching[Lawrence et al. 1999]

Generative Model + MCMC[Pasula et al. 2002]

Conditional Random Field[Wellner et al. 2004]

Page 34: Open universes and  nuclear weapons

34

Example: Sibyl attacks Typically between 100 and 10,000 real people About 90% are honest, have one login ID Dishonest people own between 10 and 1000 logins. Transactions may occur between logins

If two logins are owned by the same person (sibyls), then a transaction is highly likely;

Otherwise, transaction is less likely (depending on honesty of each login’s owner).

A login may recommend another after a transaction: Sibyls with the same owner usually recommend each other; Otherwise, probability of recommendation depends on the honesty of the

two owners.

Page 35: Open universes and  nuclear weapons

35

#Person ~ LogNormal[6.9, 2.3]();Honest(x) ~ Boolean[0.9]();#Login(Owner = x) ~ if Honest(x) then 1 else LogNormal[4.6,2.3]();Transaction(x,y) ~ if Owner(x) = Owner(y) then SibylPrior() else TransactionPrior(Honest(Owner(x)), Honest(Owner(y)));Recommends(x,y) ~ if Transaction(x,y) then if Owner(x) = Owner(y) then Boolean[0.99]() else RecPrior(Honest(Owner(x)), Honest(Owner(y)));

Evidence: lots of transactions and recommendations, maybe some Honest(.) assertionsQuery: Honest(x)

Page 36: Open universes and  nuclear weapons

36

#Person ~ LogNormal[6.9, 2.3]();Honest(x) ~ Boolean[0.9]();#Login(Owner = x) ~ if Honest(x) then 1 else LogNormal[4.6,2.3]();Transaction(x,y) ~ if Owner(x) = Owner(y) then SibylPrior() else TransactionPrior(Honest(Owner(x)), Honest(Owner(y)));Recommends(x,y) ~ if Transaction(x,y) then if Owner(x) = Owner(y) then Boolean[0.99]() else RecPrior(Honest(Owner(x)), Honest(Owner(y)));

Evidence: lots of transactions and recommendations, maybe some Honest(.) assertionsQuery: Honest(x)

Page 37: Open universes and  nuclear weapons

37

#Person ~ LogNormal[6.9, 2.3]();Honest(x) ~ Boolean[0.9]();#Login(Owner = x) ~ if Honest(x) then 1 else LogNormal[4.6,2.3]();Transaction(x,y) ~ if Owner(x) = Owner(y) then SibylPrior() else TransactionPrior(Honest(Owner(x)), Honest(Owner(y)));Recommends(x,y) ~ if Transaction(x,y) then if Owner(x) = Owner(y) then Boolean[0.99]() else RecPrior(Honest(Owner(x)), Honest(Owner(y)));

Evidence: lots of transactions and recommendations, maybe some Honest(.) assertionsQuery: Honest(x)

Page 38: Open universes and  nuclear weapons

38

#Person ~ LogNormal[6.9, 2.3]();Honest(x) ~ Boolean[0.9]();#Login(Owner = x) ~ if Honest(x) then 1 else LogNormal[4.6,2.3]();Transaction(x,y) ~ if Owner(x) = Owner(y) then SibylPrior() else TransactionPrior(Honest(Owner(x)), Honest(Owner(y)));Recommends(x,y) ~ if Transaction(x,y) then if Owner(x) = Owner(y) then Boolean[0.99]() else RecPrior(Honest(Owner(x)), Honest(Owner(y)));

Evidence: lots of transactions and recommendations, maybe some Honest(.) assertionsQuery: Honest(x)

Page 39: Open universes and  nuclear weapons

39

#Person ~ LogNormal[6.9, 2.3]();Honest(x) ~ Boolean[0.9]();#Login(Owner = x) ~ if Honest(x) then 1 else LogNormal[4.6,2.3]();Transaction(x,y) ~ if Owner(x) = Owner(y) then SibylPrior() else TransactionPrior(Honest(Owner(x)), Honest(Owner(y)));Recommends(x,y) ~ if Transaction(x,y) then if Owner(x) = Owner(y) then Boolean[0.99]() else RecPrior(Honest(Owner(x)), Honest(Owner(y)));

Evidence: lots of transactions and recommendations, maybe some Honest(.) assertionsQuery: Honest(x)

Page 40: Open universes and  nuclear weapons

40

Example: classical data association

Page 41: Open universes and  nuclear weapons

41

Example: classical data association

Page 42: Open universes and  nuclear weapons

42

Example: classical data association

Page 43: Open universes and  nuclear weapons

43

Example: classical data association

Page 44: Open universes and  nuclear weapons

44

Example: classical data association

Page 45: Open universes and  nuclear weapons

45

Example: classical data association

Page 46: Open universes and  nuclear weapons

46

#Aircraft(EntryTime = t) ~ NumAircraftPrior();

Exits(a, t) if InFlight(a, t) then ~ Bernoulli(0.1);

InFlight(a, t)if t < EntryTime(a) then = falseelseif t = EntryTime(a) then = trueelse = (InFlight(a, t-1) & !Exits(a, t-1));

State(a, t)if t = EntryTime(a) then ~ InitState() elseif InFlight(a, t) then ~ StateTransition(State(a, t-1));

#Blip(Source = a, Time = t) if InFlight(a, t) then

~ NumDetectionsCPD(State(a, t));

#Blip(Time = t) ~ NumFalseAlarmsPrior();

ApparentPos(r)if (Source(r) = null) then ~ FalseAlarmDistrib()else ~ ObsCPD(State(Source(r), Time(r)));

Page 47: Open universes and  nuclear weapons

47

Inference

Theorem: BLOG inference algorithms (rejection sampling, importance sampling, MCMC) converge to correct posteriors for any well-formed* model, for any first-order query

Current generic MCMC engine is quite slow Applying compiler technology Developing user-friendly methods for specifying

piecemeal MCMC proposals

Page 48: Open universes and  nuclear weapons

48

CTBT

Bans testing of nuclear weapons on earth Allows for outside inspection of 1000km2

182/195 states have signed 153/195 have ratified Need 9 more ratifications including US, China US Senate refused to ratify in 1998

“too hard to monitor”

Page 49: Open universes and  nuclear weapons

49

2053 nuclear explosions

Page 50: Open universes and  nuclear weapons

50

Page 51: Open universes and  nuclear weapons

51

254 monitoring stations

Page 52: Open universes and  nuclear weapons

52

Page 53: Open universes and  nuclear weapons

53

Vertically Integrated Seismic Analysis

The problem is hard: ~10000 “detections” per day, 90% false CTBT system (SEL3) finds 69% of significant events plus

about twice as many spurious (nonexistent) events 16 human analysts find more events, correct existing ones,

throw out spurious events, generate LEB (“ground truth”) Unreliable below magnitude 4 (1kT)

Solve it by global probabilistic inference NET-VISA finds around 88% of significant events

Page 54: Open universes and  nuclear weapons

54

Page 55: Open universes and  nuclear weapons

55

Page 56: Open universes and  nuclear weapons

56

Page 57: Open universes and  nuclear weapons

57

Page 58: Open universes and  nuclear weapons

58

Page 59: Open universes and  nuclear weapons

59

Page 60: Open universes and  nuclear weapons

60

Page 61: Open universes and  nuclear weapons

61

Page 62: Open universes and  nuclear weapons

62

Page 63: Open universes and  nuclear weapons

63

Page 64: Open universes and  nuclear weapons

64

Generative model for IDC arrival data

Events occur in time and space with magnitude Natural spatial distribution a mixture of Fisher-Binghams Man-made spatial distribution uniform Time distribution Poisson with given spatial intensity Magnitude distribution Gutenberg-Richter (exponential) Aftershock distribution (not yet implemented)

Travel time according to IASPEI91 model plus Laplacian error distribution for each of 14 phases

Detection depends on magnitude, distance, station* Detected azimuth, slowness plus Laplacian error False detections with station-dependent distribution

Page 65: Open universes and  nuclear weapons

65

# SeismicEvents ~ Poisson[TIME_DURATION*EVENT_RATE];IsEarthQuake(e) ~ Bernoulli(.999);EventLocation(e) ~ If IsEarthQuake(e) then EarthQuakeDistribution()

Else UniformEarthDistribution();Magnitude(e) ~ Exponential(log(10)) + MIN_MAG;Distance(e,s) = GeographicalDistance(EventLocation(e), SiteLocation(s));IsDetected(e,p,s) ~ Logistic[SITE_COEFFS(s,p)](Magnitude(e), Distance(e,s);#Arrivals(site = s) ~ Poisson[TIME_DURATION*FALSE_RATE(s)];#Arrivals(event=e, site) = If IsDetected(e,s) then 1 else 0;Time(a) ~ If (event(a) = null) then Uniform(0,TIME_DURATION)

else IASPEI(EventLocation(event(a)),SiteLocation(site(a)),Phase(a)) + TimeRes(a);TimeRes(a) ~ Laplace(TIMLOC(site(a)), TIMSCALE(site(a)));Azimuth(a) ~ If (event(a) = null) then Uniform(0, 360)

else GeoAzimuth(EventLocation(event(a)),SiteLocation(site(a)) + AzRes(a);AzRes(a) ~ Laplace(0, AZSCALE(site(a)));Slow(a) ~ If (event(a) = null) then Uniform(0,20)

else IASPEI-SLOW(EventLocation(event(a)),SiteLocation(site(a)) + SlowRes(site(a));

Page 66: Open universes and  nuclear weapons

66

Seismic event

Propagation

Seismic event

Propagation

Station 1picks

Station 2picks

Forward model structure

Detected atStation 1?

Detected atStation 2?

Station 1noise

Station 2 noise

Page 67: Open universes and  nuclear weapons

67

Seismic event

Propagation

Seismic event

Propagation

Station 1picks

Station 2picks

Forward model structure

Detected atStation 1?

Detected atStation 2?

Station 1noise

Station 2 noise

TypeTime

LocationDepth

MagnitudePhase

Page 68: Open universes and  nuclear weapons

68

Seismic event

Propagation

Seismic event

Propagation

Station 1picks

Station 2picks

Forward model structure

Detected atStation 1?

Detected atStation 2?

Station 1noise

Station 2 noise

Travel timeAmplitude decay

Page 69: Open universes and  nuclear weapons

69

Seismic event

Propagation

Seismic event

Propagation

Station 1picks

Station 2picks

Forward model structure

Detected atStation 1?

Detected atStation 2?

Station 1noise

Station 2 noise

Arrival time*Amplitude*Azimuth*

Slowness*Phase*

Page 70: Open universes and  nuclear weapons

70

Seismic event

Propagation

Seismic event

Propagation

Station 1picks

Station 2picks

Forward model structure

Detected atStation 1?

Detected atStation 2?

Station 1noise

Station 2 noise

Page 71: Open universes and  nuclear weapons

71

Page 72: Open universes and  nuclear weapons

72

Page 73: Open universes and  nuclear weapons

73

Seismic event

Propagation

Seismic event

Propagation

Station 1picks

Station 2picks

Forward model structure

Detected atStation 1?

Detected atStation 2?

Station 1noise

Station 2 noise

Page 74: Open universes and  nuclear weapons

74

Travel-time residual (station 6)

Page 75: Open universes and  nuclear weapons

75

Seismic event

Propagation

Seismic event

Propagation

Station 1picks

Station 2picks

Forward model structure

Detected atStation 1?

Detected atStation 2?

Station 1noise

Station 2 noise

Page 76: Open universes and  nuclear weapons

76

Detection probability as a function of distance (station 6, mb 3.5)

P phase S phase

Page 77: Open universes and  nuclear weapons

77

Seismic event

Propagation

Seismic event

Propagation

Station 1picks

Station 2picks

Forward model structure

Detected atStation 1?

Detected atStation 2?

Station 1noise

Station 2 noise

Page 78: Open universes and  nuclear weapons

78

Overall Pick Error

Page 79: Open universes and  nuclear weapons

79

Overall Azimuth Error

Page 80: Open universes and  nuclear weapons

80

Phase confusion matrix

Page 81: Open universes and  nuclear weapons

81

Fraction of LEB events missed

Page 82: Open universes and  nuclear weapons

82

Fraction of LEB events missed

Page 83: Open universes and  nuclear weapons

83

Event distribution: LEB vs SEL3

Page 84: Open universes and  nuclear weapons

84

Event distribution: LEB vs NET-VISA

Page 85: Open universes and  nuclear weapons

85

Why does NET-VISA work?

Multiple empirically calibrated seismological models Improving model structure and quality improves the results

Sound Bayesian combination of evidence Measured arrival times, phase labels, azimuths, etc.,

NOT taken literally Absence of detections provides negative evidence More detections per event than SEL3 or LEB

Page 86: Open universes and  nuclear weapons

86

Example of using extra detections

Page 87: Open universes and  nuclear weapons

87

NEIC event (3.0) missed by LEB

Page 88: Open universes and  nuclear weapons

88

NEIC event (3.7) missed by LEB

Page 89: Open universes and  nuclear weapons

89

NEIC event (2.6) missed by LEB

Page 90: Open universes and  nuclear weapons

90

Why does NET-VISA not work (perfectly)?

Needs hydroacoustic for mid-ocean events Weaknesses in model:

Travel time residuals for all phases along a single path are assumed to be uncorrelated

Each phase arrival is assumed to generate at most one detection; in fact, multiple detections occur

Arrival detectors use high SNR thresholds, look only at local signal to make hard decisions

Page 91: Open universes and  nuclear weapons

91

Detection-based and signal-based monitoring

events

detections

waveform signals

SEL3 NET-VISA SIG-VISA

Page 92: Open universes and  nuclear weapons

92

Summary

Expressive probability models are very useful BLOG provides a generative language for defining

first-order, open-universe models Inference via MCMC over possible worlds

Other methods welcome! CTBT application is typical of multi-sensor monitoring

applications that need vertical integration and involve data association

Scaling up inference is the next step