1 a general introduction to tomography & link delay inference with em algorithm presented by...

54
1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

Post on 20-Dec-2015

225 views

Category:

Documents


4 download

TRANSCRIPT

Page 1: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

1

A General Introduction to Tomography & Link Delay

Inference with EM Algorithm

Presented by Joe, Wenjie Jiang

21/02/2004

Page 2: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

2

Outline of Talk

Why tomography? Introduction to tomography Internal Link Delay Inference Basic EM A simple example to infer internal link

delay using EM algorithm Conclusion

Page 3: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

3

Terminology “Tomography”

Brain Tomography

Access is difficult!

Network Tomography

Access is difficult!

Vardi 1996

Page 4: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

4

Why tomography?

What is the: Bandwidth? Loss rate? Link Delay? Traffic demands? Connectivity of links

in the network? (Topology Inference)

Path: a connection between two end nodes, each consisting of several links.

Link: a direct connection with no intermediate routes/hosts.

Page 5: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

5

Motivation

Identify congestion points and performance bottlenecks

Dynamic routing Optimized service providing Security: detection of

anomalous/malicious behavior Capacity planning

Page 6: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

6

Why tomography - Difficulty

Decentralized, heterogeneous and unregulated nature of the internal network.

No incentive for individuals to collect and distribute these info freely.

Collecting all statistics impose an impracticable overhead expense

ISP regards the statistics highly confidential Relaying measurements to decision-making

point consumes bandwidth.

Page 7: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

7

Why tomography - Solution

Widespread internal network monitoring is expensive and infeasible

Edge-based measurement and statistical analysis is practical and scalable

Page 8: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

8

Brain Tomography

Page 9: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

9

Network Tomography

Page 10: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

10

Where are you?

Why tomography? Introduction to tomography Internal Link Delay Inference Basic EM A simple example to infer internal link

delay using EM algorithm Conclusion

Page 11: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

11

Introduction to tomography Use a limited number of measurements to

infer network (link) performance parameters, using:-- Maximum Likelihood Estimator -- Estimation Maximization-- Bayesian Inference

and assuming a prior model. Categories of problems:

-- Link level parameter estimation-- Sender-Receiver traffic intensity.-- Topology Inference

Page 12: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

12

Introduction to tomography (2) Two forms of network tomography:

-- link-level metric estimation based on end-to-end, traffic measurements (counts of sent/received packets, time delays between sent/received packets)-- path level (sender-receiver path) traffic intensity estimation based on link-level measurements (counts of packets through nodes)

Passive or Active measurements? Multicast or Unicast?

Page 13: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

13

Problem Description

To solve the linear system:

A, ө and εhave special structures. Goal: to maximize the likelihood function

Page 14: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

14

Problem Description (2)

A = routing matrix (graph) ө = packet queuing delays

for each link y = packet delays measu

red at the edge ε= noise, inherent rando

mness in traffic measurements

Statistical likelihood function

Page 15: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

15

Problem Description (3)

An virtual multicast tree with four receivers

l1

l2 l3

l4 l5 l6 l7

l1 l2 l3 l4 l5 l6 l7

Y1 Y2 Y3 Y4

Y1=X1+X2+X4

Page 16: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

16

Where are you?

Why tomography? Introduction to tomography Internal Link Delay Inference Basic EM A simple example to infer internal link del

ay using EM algorithm Conclusion

Page 17: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

17

Physical Topology

Measure end-to-end (from sender to receiver) delays

Page 18: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

18

Logical Topology

Logical topology is formed by considering only the branching points in the physical topology

Infer the logical link-level queuing delay distributions!

Page 19: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

19

The basic idea of internal link delay tomography

Send a back-to-back packet pair from a sender, each packet heading to a different receiver

Use the fact that delays are highly correlated on shared links

Queuing delay difference between these two end can be attributed to the unshared links

Page 20: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

20

Delay Estimation

Measure end-to-end delay of packet pairs

Packets experience the same delay on link1

d2=dmin=0 d3>0 Extra delay on link 3!

Page 21: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

21

Packet-pair measurements

)()2( nx

)()2( ny )()1( ny

Key Assumptions

• Fixed known routes

• Temporal independence

• Spatial independence

• Packet-pair delays are identical on share links.

N delay measurements in all

Page 22: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

22

Parameters

α1

α2α3

α4 α5

α6α7

α8α9

αi = parameter of delay pmf on link i

Page 23: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

23

Link delay model• αi = delay pmf on link i

• Link delay model could be multinomial

• quantized delay model: delay= {0, 1, 2, 3,…,L,∞}

• αi= {αi0,αi1,αi2,...,αiL,αi ∞ }

• αij=P{ delay(link i) = j }

• αi0+αi1+αi2,...,αiL+αi ∞=10

0. 020. 040. 060. 080. 10. 120. 140. 160. 180. 2

0 1 2 3 4 … … L

probabi l i ty

Page 24: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

24

Goal

);( YL

N

n

nypYL1

)( );();(

);( )( nyp is the probability of the event of n-th measurement

is the probability of the event of all measurements

Our goal: find );(maxarg

YL

Page 25: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

25

Where are you?

Why tomography? Introduction to tomography Internal Link Delay Inference Basic EM A simple example to infer internal link

delay using EM algorithm Conclusion

Page 26: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

26

Review of MLE (Maximum Likelihood Estimation)

Page 27: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

27

Review of MLE (Maximum Likelihood Estimation)

The basic idea of MLE: God always let the event with the biggest probability happen the most likely -- The MLE of ө is to make the sample occur the most likely

Note we assume X={x1,…xN} to be i.i.d The solution could be easy or hard depending o

n the form of p(ө|X) e.g. p(ө|X) is a single Gaussian ө=(μ, σ2), we ca

n set the derivative of logL(ө|X) to zero and solve it directly.

Page 28: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

28

Complete Data

The sample X={x1,…xN} together with the missing (or latent) data Y is called complete data.

The complete likelihood is

where p(x, y|ө) is the joint density of X and Y given the parameter ө.

The complete log-likelihood is

Page 29: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

29

Complete MLE

By the definition of conditional density,

where p(y|x,ө) is the conditional density of Y given X=x and ө

The complete MLE

Page 30: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

30

Basic idea of EM

Given X=x and ө= өt-1, where өt-1 is the current estimates the unknown parameters

log p(x,Y| ө) is a function of Y whose unique best Me

an Squared Error (MSE) predicator is

Page 31: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

31

EM steps

Page 32: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

32

The magic of EM

the direct MLE of is relatively hard to solve

But the MLE of complete log-likelihood is relatively easier to obtain

since is a function of x and y, (y is hidden), we use the expectation of y under x and

So E-step

M-step

Page 33: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

33

Where are you?

Why tomography? Introduction to tomography Internal Link Delay Inference Basic EM A simple example to infer internal link del

ay using EM algorithm Conclusion

Page 34: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

34

EM in link delay inference

α1

α2α3

α4 α5

α6α7

α8α9

x1

x2 x3

x4 x5x6 x7

x8

x9

Note that here notation x and y have opposite meaning of x, y stated in previous EM algorithm

Page 35: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

35

EM in link delay inference (2)

Complete data Z=(X,Y) the complete data log-likelihood:

Pα[Y|X] has nothing to do with α

mi,j is the total number of packets experience a delay j on link i over N measurements.

][log]|[log][]|[log],[log);,( XPXYPXPXYPYXPYXL

Liii mi

mi

mii

M

ii

LXPXPXPXP

XPXPYXL

,1,0, ][]1[]0[][

][log][log);,(1

Page 36: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

36

EM in link delay inference (3)

Liii

Liii

mLi

mi

mi

mi

mi

mii LXPXPXPXP

,1,0,

,1,0,

,1,0,

][]1[]0[][

The MLE of αwould be

L

jji

jiji

m

m

1,

,,

Page 37: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

37

EM in link delay inference (4)

nmn

mmmp 210

210

n

ii mmm

m

21

MLE

which is the frequency of event mi1

ii

A simple example is that we toss a die, P( the result i)=αi

(i=1,2…6) mi= how many times we see result i

Page 38: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

38

EM in link delay inference (5)

We notice that is similar to

only different that should be replaced by

So the MLE

);,( YXL

jim ,

],|[ )1(,,

ijiji YmEm

L

jji

jiji

m

m

1,

,,

Page 39: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

39

EM in link delay inference (6)

],|)([

],|1[],|1[

1

],|[

)1(1

)1()(11

)1()(,

1 )(,

)1(,,

iNn

ijidelay

Nn

Nn

ijidelayji

Nn jidelayji

ijiji

YjidelayP

YEYEm

m

YmEm

Probability Propagation

Page 40: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

40

A simple example

delay on each link fall into {0,1,2,3}

y1 y2

x1

x2 x3

0

1

2 3

}4

1,

4

1,

4

1,

4

1{},,,{

}4

1,

4

1,

4

1,

4

1{},,,{

}4

1,

4

1,

4

1,

4

1{},,,{

333231303

232221202

131211101

αij=P{ delay (link i) = j }

Page 41: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

41

A simple example (2)Suppose there are 5 measurements:

{ (3,2), (4,2), (6,5), (0,0), (4,1)}

y1 y2

x1

x2 x3

0

1

2 3

)](),(|0[

],|)([

2115

10,1

)1(1,

)0( nynyxPm

YjidelayPm

n

iNnji

Page 42: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

42

A simple example (3)

y1 y2

x1

x2 x3

0

1

2 3

4

1

4

1

4

1

]0[]2[]3[

]0[]2,3[

]0[]0|2,3[

][]|2,3[

]0[]0|2,3[

]2,3|0[)]1(),1(|0[

132

132

1121

3

01121

1121

211211 )0()0(

xPxPxP

xPxxP

xPxyyP

jxPjxyyP

xPxyyP

yyxPyyxP

j

Bayes Formula

Page 43: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

43

A simple example (4)

y1 y2

x1

x2 x3

0

1

2 3

3

1

64/164/164/1

64/1]2,3|0[

04

10

4

1

]3[]3|2,3[4

1

4

1

4

1

]2[]2|2,3[4

1

4

1

4

1

]1[]1|2,3[

211

1121

1121

1121

yyxP

xPxyyP

xPxyyP

xPxyyP

Page 44: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

44

A simple example (5)

y1 y2

x1

x2 x3

0

1

2 3

3

40100

3

1

0]1,4|0[

1]0,0|0[

0]5,6|0[

0]2,4|0[3

1]2,3|0[

0,1

211

211

211

211

211

m

yyxP

yyxP

yyxP

yyxP

yyxPsimilarly:

Page 45: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

45

A simple example (6)

j

i0 1 2 3

1 4/3 11/6 5/6 1

2 1 1/3 5/6 17/6

3 17/6 5/6 4/3 0

jim ,

mi,j computed in the first iteration.

Page 46: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

46

A simple example (7)

15

4

16/56/113/4

3/4

3,12,11,10,1

0,10,1

1,

,,

mmmm

m

m

mL

jji

jiji

the physical meaning of α1,0 is that: the number of packets that experience delay 0 on link i divided by the total number of packets that travel through link i

Page 47: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

47

A simple example (8)

j

i0 1 2 3

1 4/15 11/30 1/6 1/5

2 1/5 1/15 1/6 17/30

3 17/30 1/6 4/15 0

ji ,

αi,j computed in the first iteration

Page 48: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

48

A simple example (9)

j

i0 1 2 3

1 0.4 0.4 0 0.2

2 0.2 0 0 0.8

3 0.4 0.2 0.4 0

ji ,

Iteration: iterate E-step and M-step, until some termination criteria is satisfied!

After 6 iterations, αi,j converges to a fixed value.

Page 49: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

49

A simple example (9)

{ (3,2), (4,2), (6,5), (0,0), (4,1)}

y1 y2

x1

x2 x3

0

1

2 3

0

0. 1

0. 2

0. 3

0. 4

0. 5

0. 6

0. 7

0. 8

0 1 2 3

l i nk1l i nk2l i nk3

Page 50: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

50

Complexity

Page 51: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

51

Where are you?

Why tomography? Introduction to tomography Internal Link Delay Inference Basic EM A simple example to infer internal link

delay using EM algorithm Conclusion

Page 52: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

52

Conclusion

+

The field is just emerging. Deploying measurement/probing schemes and inference

algorithms in larger networks is the next key step.

Page 53: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

53

Problems

The spatial-temporally stationary and independent traffic model has limitations, especially in heavily loaded networks.

A trend for highly uncooperative environment for active probing – passive traffic monitoring techniques, for example based on sampling TCP traffic streams

Page 54: 1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004

54

Thank you!

The End