at&t labs - research internet measurement conference 2003 27-29 of october, 2003 miami, florida,...
TRANSCRIPT
![Page 1: AT&T Labs - Research Internet Measurement Conference 2003 27-29 Of October, 2003 Miami, Florida, USA Date for student](https://reader036.vdocuments.mx/reader036/viewer/2022062801/56649e7b5503460f94b7d372/html5/thumbnails/1.jpg)
AT&T Labs - Research
Internet Measurement Conference 2003
27-29 Of October, 2003
Miami, Florida, USA
http://www.icir.org/vern/imc-2003/
Date for student travel grant applications: Sept 5th
![Page 2: AT&T Labs - Research Internet Measurement Conference 2003 27-29 Of October, 2003 Miami, Florida, USA Date for student](https://reader036.vdocuments.mx/reader036/viewer/2022062801/56649e7b5503460f94b7d372/html5/thumbnails/2.jpg)
AT&T Labs - Research
An Information-Theoretic Approach to Traffic Matrix
Estimation
Yin Zhang, Matthew Roughan, Carsten Lund – AT&T ResearchDavid Donoho – Stanford
Shannon LabShannon Lab
![Page 3: AT&T Labs - Research Internet Measurement Conference 2003 27-29 Of October, 2003 Miami, Florida, USA Date for student](https://reader036.vdocuments.mx/reader036/viewer/2022062801/56649e7b5503460f94b7d372/html5/thumbnails/3.jpg)
AT&T Labs - Research
Want to know demands from source to destination
Problem
Have link traffic measurements
A
B
C
...
...
...,, CABA xx
TM
![Page 4: AT&T Labs - Research Internet Measurement Conference 2003 27-29 Of October, 2003 Miami, Florida, USA Date for student](https://reader036.vdocuments.mx/reader036/viewer/2022062801/56649e7b5503460f94b7d372/html5/thumbnails/4.jpg)
AT&T Labs - Research
Example App: reliability analysis
Under a link failure, routes changewant to find an traffic invariant
A
B
C
![Page 5: AT&T Labs - Research Internet Measurement Conference 2003 27-29 Of October, 2003 Miami, Florida, USA Date for student](https://reader036.vdocuments.mx/reader036/viewer/2022062801/56649e7b5503460f94b7d372/html5/thumbnails/5.jpg)
AT&T Labs - Research
Approach
Principle *“Don’t try to estimate something
if you don’t have any information about it”
Maximum Entropy Entropy is a measure of uncertainty
More information = less entropy To include measurements, maximize entropy subject to
the constraints imposed by the data Impose the fewest assumptions on the results
Instantiation: Maximize “relative entropy” Minimum Mutual Information
![Page 6: AT&T Labs - Research Internet Measurement Conference 2003 27-29 Of October, 2003 Miami, Florida, USA Date for student](https://reader036.vdocuments.mx/reader036/viewer/2022062801/56649e7b5503460f94b7d372/html5/thumbnails/6.jpg)
AT&T Labs - Research
Mathematical Formalism
Only measure traffic at links
1
3
2router
link 1
link 2
link 3
Traffic y1
![Page 7: AT&T Labs - Research Internet Measurement Conference 2003 27-29 Of October, 2003 Miami, Florida, USA Date for student](https://reader036.vdocuments.mx/reader036/viewer/2022062801/56649e7b5503460f94b7d372/html5/thumbnails/7.jpg)
AT&T Labs - Research
Mathematical Formalism
1
3
2router
route 2
route 1
route 3
Problem: Estimate traffic matrix (x’s) from the link measurements (y’s)
Traffic y1
Traffic matrix element x1
![Page 8: AT&T Labs - Research Internet Measurement Conference 2003 27-29 Of October, 2003 Miami, Florida, USA Date for student](https://reader036.vdocuments.mx/reader036/viewer/2022062801/56649e7b5503460f94b7d372/html5/thumbnails/8.jpg)
AT&T Labs - Research
Mathematical Formalism
1
3
2router
route 2
route 1
route 3
311 xxy
Problem: Estimate traffic matrix (x’s) from the link measurements (y’s)
![Page 9: AT&T Labs - Research Internet Measurement Conference 2003 27-29 Of October, 2003 Miami, Florida, USA Date for student](https://reader036.vdocuments.mx/reader036/viewer/2022062801/56649e7b5503460f94b7d372/html5/thumbnails/9.jpg)
AT&T Labs - Research
Mathematical Formalism
1
3
2router
route 2
route 1
route 3
311 xxy
Problem: Estimate traffic matrix (x’s) from the link measurements (y’s)
3
2
1
3
2
1
110
011
101
x
x
x
y
y
y
![Page 10: AT&T Labs - Research Internet Measurement Conference 2003 27-29 Of October, 2003 Miami, Florida, USA Date for student](https://reader036.vdocuments.mx/reader036/viewer/2022062801/56649e7b5503460f94b7d372/html5/thumbnails/10.jpg)
AT&T Labs - Research
Mathematical Formalism
1
3
2router
route 2
route 1
route 3
311 xxy
For non-trivial networkUNDERCONSTRAINED
y = Ax
Routing matrix
![Page 11: AT&T Labs - Research Internet Measurement Conference 2003 27-29 Of October, 2003 Miami, Florida, USA Date for student](https://reader036.vdocuments.mx/reader036/viewer/2022062801/56649e7b5503460f94b7d372/html5/thumbnails/11.jpg)
AT&T Labs - Research
Regularization
Want a solution that satisfies constraints: y = Ax Many more unknowns than measurement: O(N2) vs O(N) Underconstrained system Many solutions satisfy the equations Must somehow choose the “best” solution
Such (ill-posed linear inverse) problems occur in Medical imaging: e.g CAT scans Seismology Astronomy
Statistical intuition => Regularization Penalty function J(x) solution:
xJAxyx
22minarg
![Page 12: AT&T Labs - Research Internet Measurement Conference 2003 27-29 Of October, 2003 Miami, Florida, USA Date for student](https://reader036.vdocuments.mx/reader036/viewer/2022062801/56649e7b5503460f94b7d372/html5/thumbnails/12.jpg)
AT&T Labs - Research
How does this relate to other methods?
Previous methods are just particular cases of J(x)
Tomogravity (Zhang, Roughan, Greenberg and Duffield) J(x) is a weighted quadratic distance from a gravity model
A very natural alternative Start from a penalty function that satisfies the
“maximum entropy” principle Minimum Mutual Information
![Page 13: AT&T Labs - Research Internet Measurement Conference 2003 27-29 Of October, 2003 Miami, Florida, USA Date for student](https://reader036.vdocuments.mx/reader036/viewer/2022062801/56649e7b5503460f94b7d372/html5/thumbnails/13.jpg)
AT&T Labs - Research
Minimum Mutual Information (MMI)
Mutual Information I(S,D) Information gained about Source from Destination I(S,D) = -relative entropy with respect to independent S
and DI(S,D) = 0S and D are independentp(D|S) = p(D)gravity model
Natural application of principle * Assume independence in the absence of other information Aggregates have similar behavior to network overall
When we get additional information (e.g. y = Ax) Maximize entropy Minimize I(S,D) (subject to
constraints) J(x) = I(S,D)
equivalent
![Page 14: AT&T Labs - Research Internet Measurement Conference 2003 27-29 Of October, 2003 Miami, Florida, USA Date for student](https://reader036.vdocuments.mx/reader036/viewer/2022062801/56649e7b5503460f94b7d372/html5/thumbnails/14.jpg)
AT&T Labs - Research
MMI in practice
In general there aren’t enough constraints Constraints give a subspace of possible solutions
y = Ax
![Page 15: AT&T Labs - Research Internet Measurement Conference 2003 27-29 Of October, 2003 Miami, Florida, USA Date for student](https://reader036.vdocuments.mx/reader036/viewer/2022062801/56649e7b5503460f94b7d372/html5/thumbnails/15.jpg)
AT&T Labs - Research
MMI in practice
Independence gives us a starting point
y = Ax
independent solution
![Page 16: AT&T Labs - Research Internet Measurement Conference 2003 27-29 Of October, 2003 Miami, Florida, USA Date for student](https://reader036.vdocuments.mx/reader036/viewer/2022062801/56649e7b5503460f94b7d372/html5/thumbnails/16.jpg)
AT&T Labs - Research
MMI in practice
Find a solution which Satisfies the constraint Is closest to the independent solution
solution
Distance measure is the Kullback-Lieber divergence
![Page 17: AT&T Labs - Research Internet Measurement Conference 2003 27-29 Of October, 2003 Miami, Florida, USA Date for student](https://reader036.vdocuments.mx/reader036/viewer/2022062801/56649e7b5503460f94b7d372/html5/thumbnails/17.jpg)
AT&T Labs - Research
Is that it?
Not quite that simple Need to do some networking specific things e.g. conditional independence to model hot-potato
routing
Can be solved using standard optimization toolkits Taking advantage of sparseness of routing matrix A
Back to tomogravity Conditional independence = generalized gravity model Quadratic distance function is a first order
approximation to the Kullback-Leibler divergence Tomogravity is a first-order approximation to MMI
![Page 18: AT&T Labs - Research Internet Measurement Conference 2003 27-29 Of October, 2003 Miami, Florida, USA Date for student](https://reader036.vdocuments.mx/reader036/viewer/2022062801/56649e7b5503460f94b7d372/html5/thumbnails/18.jpg)
AT&T Labs - Research
Results – Single example
±20% bounds for larger flows Average error ~11% Fast (< 5 seconds) Scales:
O(100) nodes
![Page 19: AT&T Labs - Research Internet Measurement Conference 2003 27-29 Of October, 2003 Miami, Florida, USA Date for student](https://reader036.vdocuments.mx/reader036/viewer/2022062801/56649e7b5503460f94b7d372/html5/thumbnails/19.jpg)
AT&T Labs - Research
More results
tomogravitymethod
simpleapproximation
>80% of demands have <20% error
Large errors are in small flows
![Page 20: AT&T Labs - Research Internet Measurement Conference 2003 27-29 Of October, 2003 Miami, Florida, USA Date for student](https://reader036.vdocuments.mx/reader036/viewer/2022062801/56649e7b5503460f94b7d372/html5/thumbnails/20.jpg)
AT&T Labs - Research
Other experiments
Sensitivity Very insensitive to lambda Simple approximations work well
Robustness Missing data Erroneous link data Erroneous routing data
Dependence on network topology Via Rocketfuel network topologies
Additional information Netflow Local traffic matrices
![Page 21: AT&T Labs - Research Internet Measurement Conference 2003 27-29 Of October, 2003 Miami, Florida, USA Date for student](https://reader036.vdocuments.mx/reader036/viewer/2022062801/56649e7b5503460f94b7d372/html5/thumbnails/21.jpg)
AT&T Labs - Research
Dependence on Topology
0
5
10
15
20
25
30
0 1 2 3 4 5 6 7 8 9 10 11unknowns per measurement
rela
tive
err
ors
(%)
randomgeographicLinear (geographic)
clique
star (20 nodes)
![Page 22: AT&T Labs - Research Internet Measurement Conference 2003 27-29 Of October, 2003 Miami, Florida, USA Date for student](https://reader036.vdocuments.mx/reader036/viewer/2022062801/56649e7b5503460f94b7d372/html5/thumbnails/22.jpg)
AT&T Labs - Research
Additional information – Netflow
![Page 23: AT&T Labs - Research Internet Measurement Conference 2003 27-29 Of October, 2003 Miami, Florida, USA Date for student](https://reader036.vdocuments.mx/reader036/viewer/2022062801/56649e7b5503460f94b7d372/html5/thumbnails/23.jpg)
AT&T Labs - Research
Local traffic matrix (George Varghese)
for referenceprevious case
0%1%5%10%
![Page 24: AT&T Labs - Research Internet Measurement Conference 2003 27-29 Of October, 2003 Miami, Florida, USA Date for student](https://reader036.vdocuments.mx/reader036/viewer/2022062801/56649e7b5503460f94b7d372/html5/thumbnails/24.jpg)
AT&T Labs - Research
Conclusion
We have a good estimation method Robust, fast, and scales to required size Accuracy depends on ratio of unknowns to
measurements Derived from principle
Approach gives some insight into other methods Why they work – regularization Should provide better idea of the way forward
Additional insights about the network and traffic Traffic and network are connected
Implemented Used in AT&T’s NA backbone Accurate enough in practice
![Page 25: AT&T Labs - Research Internet Measurement Conference 2003 27-29 Of October, 2003 Miami, Florida, USA Date for student](https://reader036.vdocuments.mx/reader036/viewer/2022062801/56649e7b5503460f94b7d372/html5/thumbnails/25.jpg)
AT&T Labs - Research
Results
Methodology Use netflow based partial (~80%) traffic matrix Simulate SNMP measurements using routing sim, and
y = Ax Compare estimates, and true traffic matrix
Advantage Realistic network, routing, and traffic Comparison is direct, we know errors are due to
algorithm not errors in the data Can do controlled experiments (e.g. introduce known
errors)
Data One hour traffic matrices (don’t need fine grained data) 506 data sets, comprising the majority of June 2002 Includes all times of day, and days of week
![Page 26: AT&T Labs - Research Internet Measurement Conference 2003 27-29 Of October, 2003 Miami, Florida, USA Date for student](https://reader036.vdocuments.mx/reader036/viewer/2022062801/56649e7b5503460f94b7d372/html5/thumbnails/26.jpg)
AT&T Labs - Research
Robustness (input errors)
![Page 27: AT&T Labs - Research Internet Measurement Conference 2003 27-29 Of October, 2003 Miami, Florida, USA Date for student](https://reader036.vdocuments.mx/reader036/viewer/2022062801/56649e7b5503460f94b7d372/html5/thumbnails/27.jpg)
AT&T Labs - Research
Robustness (missing data)
![Page 28: AT&T Labs - Research Internet Measurement Conference 2003 27-29 Of October, 2003 Miami, Florida, USA Date for student](https://reader036.vdocuments.mx/reader036/viewer/2022062801/56649e7b5503460f94b7d372/html5/thumbnails/28.jpg)
AT&T Labs - Research
Point-to-multipoint
We don’t see whole Internet – What if an edge link fails?Point-to-point traffic matrix isn’t invariant
![Page 29: AT&T Labs - Research Internet Measurement Conference 2003 27-29 Of October, 2003 Miami, Florida, USA Date for student](https://reader036.vdocuments.mx/reader036/viewer/2022062801/56649e7b5503460f94b7d372/html5/thumbnails/29.jpg)
AT&T Labs - Research
Point-to-multipoint
Included in this approach Implicit in results above Explicit results worse
Ambiguity in demands in increased
More demands use exactly the same sets of routes
use in applications is better
Point-to-point Point-to-multipoint
Link failure analysis
![Page 30: AT&T Labs - Research Internet Measurement Conference 2003 27-29 Of October, 2003 Miami, Florida, USA Date for student](https://reader036.vdocuments.mx/reader036/viewer/2022062801/56649e7b5503460f94b7d372/html5/thumbnails/30.jpg)
AT&T Labs - Research
Independent model
![Page 31: AT&T Labs - Research Internet Measurement Conference 2003 27-29 Of October, 2003 Miami, Florida, USA Date for student](https://reader036.vdocuments.mx/reader036/viewer/2022062801/56649e7b5503460f94b7d372/html5/thumbnails/31.jpg)
AT&T Labs - Research
Conditional independence
Internet routing is asymmetric A provider can control exit points for traffic going
to peer networks
peer links
access links
![Page 32: AT&T Labs - Research Internet Measurement Conference 2003 27-29 Of October, 2003 Miami, Florida, USA Date for student](https://reader036.vdocuments.mx/reader036/viewer/2022062801/56649e7b5503460f94b7d372/html5/thumbnails/32.jpg)
AT&T Labs - Research
Conditional independence
peer links
access links
Internet routing is asymmetric A provider can control exit points for traffic going
to peer networks Have much less control of where traffic enters
![Page 33: AT&T Labs - Research Internet Measurement Conference 2003 27-29 Of October, 2003 Miami, Florida, USA Date for student](https://reader036.vdocuments.mx/reader036/viewer/2022062801/56649e7b5503460f94b7d372/html5/thumbnails/33.jpg)
AT&T Labs - Research
Conditional independence
![Page 34: AT&T Labs - Research Internet Measurement Conference 2003 27-29 Of October, 2003 Miami, Florida, USA Date for student](https://reader036.vdocuments.mx/reader036/viewer/2022062801/56649e7b5503460f94b7d372/html5/thumbnails/34.jpg)
AT&T Labs - Research
Minimum Mutual Information (MMI)
Mutual Information I(S,D)=0 Information gained about S from D
I(S,D) = relative entropy with respect to independence Can also be given by Kullback-Leibler information
divergence
Why this model In the absence of information, let’s assume no
information Minimal assumption about the traffic Large aggregates tend to behave like overall network?
ds dDpsSp
dDsSpdDsSpDSI
, )()(
),(log),(),(
![Page 35: AT&T Labs - Research Internet Measurement Conference 2003 27-29 Of October, 2003 Miami, Florida, USA Date for student](https://reader036.vdocuments.mx/reader036/viewer/2022062801/56649e7b5503460f94b7d372/html5/thumbnails/35.jpg)
AT&T Labs - Research
Dependence on Topology
Unknowns per Relative
Errors (%)
NetworkPoPs Links
measurement
Geographic Random
Exodus* 17 58 4.69 12.6 20.0
Sprint* 19 100 3.42 8.0 18.9
Abovenet*
11 48 2.29 3.8 11.7
Star N 2(N-1) N/2=10 24.0 24.0
Clique N N(N-1) 1 0.2 0.2
AT&T - - 3.54-3.97 10.6* These are not the actual networks, but only estimates made by Rocketfuel
![Page 36: AT&T Labs - Research Internet Measurement Conference 2003 27-29 Of October, 2003 Miami, Florida, USA Date for student](https://reader036.vdocuments.mx/reader036/viewer/2022062801/56649e7b5503460f94b7d372/html5/thumbnails/36.jpg)
AT&T Labs - Research
Bayesian (e.g. Tebaldi and West) J(x) = -log(x), where (x) is the prior model
MLE (e.g. Vardi, Cao et al, …) In their thinking the prior model generates extra
constraints Equally, can be modeled as a (complicated) penalty
function• Uses deviations from higher order moments predicted by
model
![Page 37: AT&T Labs - Research Internet Measurement Conference 2003 27-29 Of October, 2003 Miami, Florida, USA Date for student](https://reader036.vdocuments.mx/reader036/viewer/2022062801/56649e7b5503460f94b7d372/html5/thumbnails/37.jpg)
AT&T Labs - Research
Acknowledgements
Local traffic matrix measurements George Varghese
PDSCO optimization toolkit for Matlab Michael Saunders
Data collection Fred True, Joel Gottlieb
Tomogravity Albert Greenberg and Nick Duffield