g. cowan statistical methods in particle physics1 statistical methods in particle physics day 2:...

55
G. Cowan Statistical Methods in Particle Physics 1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清清清清清清清清清清清清 2010 清 4 清 12—16 清 Glen Cowan Physics Department Royal Holloway, University of Lond [email protected] www.pp.rhul.ac.uk/~cowan

Upload: mattie-giffin

Post on 31-Mar-2015

230 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics 1

Statistical Methods in Particle PhysicsDay 2: Multivariate Methods (I)

清华大学高能物理研究中心2010 年 4 月 12—16日

Glen CowanPhysics DepartmentRoyal Holloway, University of [email protected]/~cowan

Page 2: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics 2

Outline of lecturesDay #1: Introduction

Review of probability and Monte Carlo Review of statistics: parameter estimation

Day #2: Multivariate methods (I) Event selection as a statistical test Cut-based, linear discriminant, neural networks

Day #3: Multivariate methods (II) More multivariate classifiers: BDT, SVM ,...

Day #4: Significance tests for discovery and limits Including systematics using profile likelihood

Day #5: Bayesian methods Bayesian parameter estimation and model selection

Page 3: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics 3

Day #2: outlineMultivariate methods for HEP

Event selection as a statistical test

Neyman-Pearson lemma and likelihood ratio test

Some multivariate classifiers

Cut-based event selection

Linear classifiers

Neural networks

Probability density estimation methods

Page 4: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics page 4

Page 5: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics page 5

Page 6: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics page 6

The Large Hadron Collider

Counter-rotating proton beamsin 27 km circumference ring

pp centre-of-mass energy 14 TeV

Detectors at 4 pp collision points:ATLASCMSLHCb (b physics)ALICE (heavy ion physics)

general purpose

Page 7: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics page 7

The ATLAS detector

2100 physicists37 countries 167 universities/labs

25 m diameter46 m length7000 tonnes~108 electronic channels

Page 8: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics page 8

LHC event production rates

most events (boring)

interesting

very interesting (~1 out of every 1011)

mildly interesting

Page 9: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics page 9

LHC dataAt LHC, ~109 pp collision events per second, mostly uninteresting

do quick sifting, record ~200 events/secsingle event ~ 1 Mbyte1 “year” 107 s, 1016 pp collisions / year2 109 events recorded / year (~2 Pbyte / year)

For new/rare processes, rates at LHC can be vanishingly smalle.g. Higgs bosons detectable per year could be ~103

→ 'needle in a haystack'

For Standard Model and (many) non-SM processes we can generatesimulated data with Monte Carlo programs (including simulationof the detector).

Page 10: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics page 10

A simulated SUSY event in ATLAS

high pT

muons

high pT jets

of hadrons

missing transverse energy

p p

Page 11: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics page 11

Background events

This event from Standard Model ttbar production alsohas high p

T jets and muons,

and some missing transverseenergy.

→ can easily mimic a SUSY event.

Page 12: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics page 12

A simulated event

PYTHIA Monte Carlopp → gluino-gluino

.

.

.

Page 13: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics page 13

Event selection as a statistical testFor each event we measure a set of numbers: nx,,x=x 1

x1 = jet p

T

x2 = missing energyx

3 = particle i.d. measure, ...

x follows some n-dimensional joint probability density, which

depends on the type of event produced, i.e., was it ,ttpp ,g~g~pp

x i

x jE.g. hypotheses H

0, H

1, ...

Often simply “signal”, “background”

1H|xp

0H|xp

Page 14: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics page 14

Finding an optimal decision boundary

In particle physics usually startby making simple “cuts”:

xi < c

i

xj < c

j

Maybe later try some other type of decision boundary:

H0 H

0

H0

H1

H1

H1

Page 15: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics page 15

Page 16: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics page 16

Page 17: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics page 17

Page 18: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics page 18

Page 19: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics page 19

Page 20: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics page 20

Two distinct event selection problemsIn some cases, the event types in question are both known to exist.

Example: separation of different particle types (electron vs muon)Use the selected sample for further study.

In other cases, the null hypothesis H0 means "Standard Model" events,and the alternative H1 means "events of a type whose existence isnot yet established" (to do so is the goal of the analysis).

Many subtle issues here, mainly related to the heavy burdenof proof required to establish presence of a new phenomenon.

Typically require p-value of background-only hypothesis below ~ 10 (a 5 sigma effect) to claim discovery of "New Physics".

Page 21: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics page 21

Using classifier output for discovery

y

f(y)

y

N(y)

Normalized to unity Normalized to expected number of events

excess?

signal

background background

searchregion

Discovery = number of events found in search region incompatiblewith background-only hypothesis.

p-value of background-only hypothesis can depend crucially distribution f(y|b) in the "search region".

ycut

Page 22: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics page 22

Example of a "cut-based" studyIn the 1990s, the CDF experiment at Fermilab (Chicago) measuredthe number of hadron jets produced in proton-antiproton collisionsas a function of their momentum perpendicular to the beam direction:

Prediction low relative to data forvery high transverse momentum.

"jet" ofparticles

Page 23: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics page 23

High pT jets = quark substructure?Although the data agree remarkably well with the Standard Model(QCD) prediction overall, the excess at high pT appears significant:

The fact that the variable is "understandable" leads directly to a plausible explanation for the discrepancy, namely, that quarks could possess an internal substructure.

Would not have been the case if the variable plotted was a complicated combination of many inputs.

Page 24: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics page 24

High pT jets from parton model uncertaintyFurthermore the physical understanding of the variable led oneto a more plausible explanation, namely, an uncertain modeling ofthe quark (and gluon) momentum distributions inside the proton.

When model adjusted, discrepancy largely disappears:

Can be regarded as a "success" of the cut-based approach. Physicalunderstanding of output variable led to solution of apparent discrepancy.

Page 25: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics page 25

Page 26: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics page 26

Page 27: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics page 27

Page 28: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics page 28

Page 29: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics page 29

Page 30: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics page 30

Page 31: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics page 31

Page 32: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics page 32

Page 33: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics page 33

Page 34: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics page 34

Page 35: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics page 35

Page 36: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics page 36

Page 37: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics page 37

Neural network example from LEP IISignal: ee → WW (often 4 well separated hadron jets)

Background: ee → qqgg (4 less well separated hadron jets)

← input variables based on jetstructure, event shape, ...none by itself gives much separation.

Neural network output:

(Garrido, Juste and Martinez, ALEPH 96-144)

Page 38: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics page 38

Some issues with neural networksIn the example with WW events, goal was to select these eventsso as to study properties of the W boson.

Needed to avoid using input variables correlated to theproperties we eventually wanted to study (not trivial).

In principle a single hidden layer with an sufficiently large number ofnodes can approximate arbitrarily well the optimal test variable (likelihoodratio).

Usually start with relatively small number of nodes and increaseuntil misclassification rate on validation data sample ceasesto decrease.

Often MC training data is cheap -- problems with getting stuck in local minima, overtraining, etc., less important than concerns of systematic differences between the training data and Nature, and concerns aboutthe ease of interpretation of the output.

Page 39: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics page 39

Overtraining

training sample independent test sample

If decision boundary is too flexible it will conform too closelyto the training points → overtraining.

Monitor by applying classifier to independent test sample.

Page 40: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics page 40

validation sample

training sample

Monitoring overtrainingWe can monitor the misclassification rate (or value of the error function) as a function of some parameter related to the level of flexibility of the decision boundary, such as the number of nodes in the hidden layer.

For the data sample used to train the network, the error rate continues to decrease, but for an independent validation sample, it will level off and even increase.

error rate

number of nodes

Page 41: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics page 41

Page 42: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics page 42

Page 43: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics 43

Page 44: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics 44

Page 45: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics 45

Page 46: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics 46

Page 47: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics 47

Page 48: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics 48

Page 49: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics 49

Page 50: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics 50

Page 51: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics 51

Page 52: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics 52

Page 53: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics 53

Page 54: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics 54

Page 55: G. Cowan Statistical Methods in Particle Physics1 Statistical Methods in Particle Physics Day 2: Multivariate Methods (I) 清华大学高能物理研究中心 2010 年 4 月 12—16

G. Cowan Statistical Methods in Particle Physics page 55

Summary

Information from many variables can be used to distinguishbetween event types.

Try to exploit as much information as possible.Try to keep method as simple as possible.Often start with: cuts, linear classifiersAnd then try less simple methods: neural networks

Tomorrow we will see some more multivariate classifiers:Probability density estimation methodsBoosted Decision TreesSupport Vector Machines