measurement-based inter-domain traffic engineering · • a. akella, b. maggs, s. seshan, and a....

36
Measurement-based Inter-domain Traffic Engineering W. Shao, J.L. Rougier, L. Iannone Telecom ParisTech (not an operator) member of University of Paris-Saclay F. Devienne, M. Viste Border 6 NANOG 67 June 13-15, 2016

Upload: others

Post on 22-Jul-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Measurement-based Inter-domain Traffic Engineering · • A. Akella, B. Maggs, S. Seshan, and A. Shaikh, ... c v of hour volume SA SB SC SD SE SF SG SH SI Volume importance vs predictability

Measurement-based Inter-domain

Traffic EngineeringW. Shao, J.L. Rougier, L. Iannone

Telecom ParisTech (not an operator) member of University of Paris-Saclay

F. Devienne, M. Viste Border 6

NANOG 67 June 13-15, 2016

Page 2: Measurement-based Inter-domain Traffic Engineering · • A. Akella, B. Maggs, S. Seshan, and A. Shaikh, ... c v of hour volume SA SB SC SD SE SF SG SH SI Volume importance vs predictability

Outline

• Measurement-based inter-domain traffic engineering

• A problem of scalability

Page 3: Measurement-based Inter-domain Traffic Engineering · • A. Akella, B. Maggs, S. Seshan, and A. Shaikh, ... c v of hour volume SA SB SC SD SE SF SG SH SI Volume importance vs predictability

1. Local_Pref 2. AS_Path 3. MED 4. eBGP > iBGP 5. tie-breaking

1. IGP costs 2. oldest path 3. etc

3

A scenario: Out-bound TE for multi-homed stub AS

Remote Networks

Internet

Remote Networks

Transit N

Remote Network

Transit 2Transit 1

BGP router jBGP router i

Local Network

Page 4: Measurement-based Inter-domain Traffic Engineering · • A. Akella, B. Maggs, S. Seshan, and A. Shaikh, ... c v of hour volume SA SB SC SD SE SF SG SH SI Volume importance vs predictability

• N. Feamster, D. G. Andersen, H. Balakrishnan, and M. F. Kaashoek, “Measuring the effects of internet path faults on reactive routing,” ACM SIGMETRICS, vol. 31, no. 1, p. 126, 2003.

• A. Akella, B. Maggs, S. Seshan, A. Shaikh, and R. Sitaraman, “A measurement-based analysis of multihoming,” SIGCOMM, 2003.

• D. K. Goldenberg, L. Qiu, H. Xie, Y. R. Yang, and Y. Zhang, “Optimizing cost and performance for multihoming,” CCR, vol. 34, no. 4, p. 79, Oct. 2004.

• A. Akella, B. Maggs, S. Seshan, and A. Shaikh, “On the Performance Benefits of Multihoming Route Control,” IEEE/ACM Trans. Netw., vol. 16, no. 1, pp. 91–104, Feb. 2008.

4

An old scenario

Page 5: Measurement-based Inter-domain Traffic Engineering · • A. Akella, B. Maggs, S. Seshan, and A. Shaikh, ... c v of hour volume SA SB SC SD SE SF SG SH SI Volume importance vs predictability

5

Remote Networks

Internet

Prefix selection

Traffic Statistics Collection Route DecisionPath Performance

Measurement

Remote Networks

Transit N

Remote Network

Transit 2Transit 1

BGP router jBGP router i

Internet traffic

Path measurement

Traffic sampling

Internal exchange

Configuration

Legend

Local Network

Page 6: Measurement-based Inter-domain Traffic Engineering · • A. Akella, B. Maggs, S. Seshan, and A. Shaikh, ... c v of hour volume SA SB SC SD SE SF SG SH SI Volume importance vs predictability

6

Traffic statistics collection

Purpose: to select a set of ‘managed prefix’, i.e. destinations of importance;

Collector: netflow/sflow collector, PMACCT;

Storage: RAM, PostgreSQL.

Page 7: Measurement-based Inter-domain Traffic Engineering · • A. Akella, B. Maggs, S. Seshan, and A. Shaikh, ... c v of hour volume SA SB SC SD SE SF SG SH SI Volume importance vs predictability

Active measurementsTarget: probes discovered in ‘managed’ prefixes

Method: TCP SYN -> RTT and loss;

Path: via all available BGP next-hops;

Steering: source-based routing, SDN, etc;

Storage: RAM, PostgreSQL.

7

Page 8: Measurement-based Inter-domain Traffic Engineering · • A. Akella, B. Maggs, S. Seshan, and A. Shaikh, ... c v of hour volume SA SB SC SD SE SF SG SH SI Volume importance vs predictability

Route decision

8

Objective: performance, availability & transit cost;

Metrics: RTT, loss and BW w.r.t. CDR;

Algorithm: depends on user’s needs;

Steering: BGP as SDN southbound interface.

Page 9: Measurement-based Inter-domain Traffic Engineering · • A. Akella, B. Maggs, S. Seshan, and A. Shaikh, ... c v of hour volume SA SB SC SD SE SF SG SH SI Volume importance vs predictability

Algo example—CostO

utgo

ing

BW

Total Transit1 Transit2 Transit3

9

Transit1 has the lowest BW price.

horizontal lines 95th percentile

Cost optimisation activated

Transit1 Transit2Transit3

Page 10: Measurement-based Inter-domain Traffic Engineering · • A. Akella, B. Maggs, S. Seshan, and A. Shaikh, ... c v of hour volume SA SB SC SD SE SF SG SH SI Volume importance vs predictability

Algo example—Availability

Transit 1Transit 2BGP

Packet loss due to consistent congestion can be avoided

by simply change a BGP next-hop.

day 1 day 2 day 3

10

Loss

rate

10 %

0 %

Page 11: Measurement-based Inter-domain Traffic Engineering · • A. Akella, B. Maggs, S. Seshan, and A. Shaikh, ... c v of hour volume SA SB SC SD SE SF SG SH SI Volume importance vs predictability

Algo example—smaller RTT

11

Transit 1 Transit 2 BGP

RTT optimisation activated

RTT

Page 12: Measurement-based Inter-domain Traffic Engineering · • A. Akella, B. Maggs, S. Seshan, and A. Shaikh, ... c v of hour volume SA SB SC SD SE SF SG SH SI Volume importance vs predictability

For Inbound as well…

12

Route Preference Protocol (RPP) for Inbound TE

http://rpp.border6.com/https://github.com/border6/rpp

1. Tell traffic sourcing AS what is you favourite ingress point; 2. Traffic sourcing AS then does its best.

source destination

Page 13: Measurement-based Inter-domain Traffic Engineering · • A. Akella, B. Maggs, S. Seshan, and A. Shaikh, ... c v of hour volume SA SB SC SD SE SF SG SH SI Volume importance vs predictability

the scalability problem ~600K BGP prefixes

How to select prefixes of importance?

13

W. Shao, L. Ianonne, J.L. Rougier, F. Devienne, and M. Viste, “Scalable BGP Prefix Selection for Effective Inter-domain Traffic Engineering,” IEEE/IFIP NOMS, 2016.

Page 14: Measurement-based Inter-domain Traffic Engineering · • A. Akella, B. Maggs, S. Seshan, and A. Shaikh, ... c v of hour volume SA SB SC SD SE SF SG SH SI Volume importance vs predictability

100 101 102 103 104 105

Number of top prefix

10�9

10�8

10�7

10�6

10�5

10�4

10�3

10�2

10�1

100

101

102

Traf

ficVo

lum

e

SASBSCSDSE

SFSGSHSIZipf ref.

A large portion traffic is concentrated on a small number of prefixes

(log-log graph not CDF)

14Prefix volume ranking

Traf

fic v

olum

e pe

rcen

tage

Page 15: Measurement-based Inter-domain Traffic Engineering · • A. Akella, B. Maggs, S. Seshan, and A. Shaikh, ... c v of hour volume SA SB SC SD SE SF SG SH SI Volume importance vs predictability

However… traffic value associated to each prefix evolves over time.

100 101 102 103 104 105

Number of top prefix

0

20

40

60

80

100

Traf

ficVo

lum

e

2AM10AM

June 3rdWeek

15

Traf

fic v

olum

e pe

rcen

tage

Page 16: Measurement-based Inter-domain Traffic Engineering · • A. Akella, B. Maggs, S. Seshan, and A. Shaikh, ... c v of hour volume SA SB SC SD SE SF SG SH SI Volume importance vs predictability

However… we have some ten thousands of time-series/prefixes to predict.

16

Page 17: Measurement-based Inter-domain Traffic Engineering · • A. Akella, B. Maggs, S. Seshan, and A. Shaikh, ... c v of hour volume SA SB SC SD SE SF SG SH SI Volume importance vs predictability

Predictively select BGP destination prefixes that stand for a large portion of traffic, with a simple and efficient method.

17

Wisely save sources for measurement and optimization.

Page 18: Measurement-based Inter-domain Traffic Engineering · • A. Akella, B. Maggs, S. Seshan, and A. Shaikh, ... c v of hour volume SA SB SC SD SE SF SG SH SI Volume importance vs predictability

Save resources of data plane as well.

18

full BGP routing table

one optimized route for each selected prefix

+ one default route for the rest

expensive edge router cheaper DC switch/ white-box SDN switch

Page 19: Measurement-based Inter-domain Traffic Engineering · • A. Akella, B. Maggs, S. Seshan, and A. Shaikh, ... c v of hour volume SA SB SC SD SE SF SG SH SI Volume importance vs predictability

FIB caching/prediction

• W. Zhang, J. Bi, J. Wu, and B. Zhang, “Catching popular prefixes at AS border routers with a prediction based method,” Comput. Networks, vol. 56, no. 4, pp. 1486–1502, Mar. 2012. • GM(1,1) outperforms LFU, LRU

David Barroso, Spotify Building an extensible SDN Internet Router with commodity hardware https://youtu.be/o1njanXhQqM

19

Page 20: Measurement-based Inter-domain Traffic Engineering · • A. Akella, B. Maggs, S. Seshan, and A. Shaikh, ... c v of hour volume SA SB SC SD SE SF SG SH SI Volume importance vs predictability

Difference from FIB caching/predictionBasically, a matter of time scales

20

FIB caching, 5min interval or less, memory of hours; Prefix selection, 1 hour interval, memory of days till weeks.

Why?Not only data-plane is involved. Probes discovery, probe selection, long term pattern and bursty-traffic, prefix churn, etc.

And more than that….

Page 21: Measurement-based Inter-domain Traffic Engineering · • A. Akella, B. Maggs, S. Seshan, and A. Shaikh, ... c v of hour volume SA SB SC SD SE SF SG SH SI Volume importance vs predictability

Traffic dynamism

What is case for traffic aggregated by BGP prefix over longer interval?

21

“… shows that there is no clear correlation between the mean and the coefficient of variation of the bandwidth of a network prefix flow.”

K. Papagiannaki, N. Taft, and C. Diot, “Impact of Flow Dynamics on Traffic Engineering Design Principles,” INFOCOM, 2004.

Page 22: Measurement-based Inter-domain Traffic Engineering · • A. Akella, B. Maggs, S. Seshan, and A. Shaikh, ... c v of hour volume SA SB SC SD SE SF SG SH SI Volume importance vs predictability

100.010.010.10.010.0010.0001Traffic Volume % Range

0

2

4

6

8

10

12

14

c vof

hour

volu

me

SASBSCSDSESFSGSHSI

Volume importance vs predictability Prefix volume share — Cv

Cv = std/mean

22

pred

icta

ble

prefix of big volume shareprefix of small volume share

unst

able

Page 23: Measurement-based Inter-domain Traffic Engineering · • A. Akella, B. Maggs, S. Seshan, and A. Shaikh, ... c v of hour volume SA SB SC SD SE SF SG SH SI Volume importance vs predictability

A solution

as simple as Moving Average

23

Page 24: Measurement-based Inter-domain Traffic Engineering · • A. Akella, B. Maggs, S. Seshan, and A. Shaikh, ... c v of hour volume SA SB SC SD SE SF SG SH SI Volume importance vs predictability

Volume coverage of MV compared to Grey Model GM(1,1)

24

good coverage

Poor coverage

long memoryshort memory

1 12 24 168Historical length (hour)

0

20

40

60

80

100

Traf

ficVo

lum

e%

MV Grey

Page 25: Measurement-based Inter-domain Traffic Engineering · • A. Akella, B. Maggs, S. Seshan, and A. Shaikh, ... c v of hour volume SA SB SC SD SE SF SG SH SI Volume importance vs predictability

Summary

• An old scenario with many remaining challenges;

• A possible approach realizing it.

25

Page 26: Measurement-based Inter-domain Traffic Engineering · • A. Akella, B. Maggs, S. Seshan, and A. Shaikh, ... c v of hour volume SA SB SC SD SE SF SG SH SI Volume importance vs predictability

Many other challenges…

26

Page 27: Measurement-based Inter-domain Traffic Engineering · • A. Akella, B. Maggs, S. Seshan, and A. Shaikh, ... c v of hour volume SA SB SC SD SE SF SG SH SI Volume importance vs predictability

Destination network

Local network

Same AS Path

Measure a same AS path with multiple probes

Are these measurements different? How?

Where does the difference take place?

Which probe should we use?

W. Shao, J.L. Rougier, F. Devienne, and M. Viste, “Improve RTT Measurement Quality via Clustering in Inter-Domain TE ,” IEEE/IFIP AnNet, 2016.

27

Page 28: Measurement-based Inter-domain Traffic Engineering · • A. Akella, B. Maggs, S. Seshan, and A. Shaikh, ... c v of hour volume SA SB SC SD SE SF SG SH SI Volume importance vs predictability

28

end2end RTT Group 1

Page 29: Measurement-based Inter-domain Traffic Engineering · • A. Akella, B. Maggs, S. Seshan, and A. Shaikh, ... c v of hour volume SA SB SC SD SE SF SG SH SI Volume importance vs predictability

29

end2end RTT Group 2

Page 30: Measurement-based Inter-domain Traffic Engineering · • A. Akella, B. Maggs, S. Seshan, and A. Shaikh, ... c v of hour volume SA SB SC SD SE SF SG SH SI Volume importance vs predictability

30

To measure is to see. To see is to understand.

Understanding allows automation.

Page 31: Measurement-based Inter-domain Traffic Engineering · • A. Akella, B. Maggs, S. Seshan, and A. Shaikh, ... c v of hour volume SA SB SC SD SE SF SG SH SI Volume importance vs predictability

31

Q&A

Page 32: Measurement-based Inter-domain Traffic Engineering · • A. Akella, B. Maggs, S. Seshan, and A. Shaikh, ... c v of hour volume SA SB SC SD SE SF SG SH SI Volume importance vs predictability

Appendix Not all references are given.

The listed ones could be a good starting point.

https://github.com/WenqinSHAO/NANOG67_prez.git

32

Page 33: Measurement-based Inter-domain Traffic Engineering · • A. Akella, B. Maggs, S. Seshan, and A. Shaikh, ... c v of hour volume SA SB SC SD SE SF SG SH SI Volume importance vs predictability

Appendix-I

• N. Feamster, D. G. Andersen, H. Balakrishnan, and M. F. Kaashoek, “Measuring the effects of internet path faults on reactive routing,” ACM SIGMETRICS Perform. Eval. Rev., vol. 31, no. 1, p. 126, 2003.

• A. Akella, B. Maggs, S. Seshan, A. Shaikh, and R. Sitaraman, “A measurement-based analysis of multihoming,” SIGCOMM, 2003.

• D. K. Goldenberg, L. Qiu, H. Xie, Y. R. Yang, and Y. Zhang, “Optimizing cost and performance for multihoming,” CCR, vol. 34, no. 4, p. 79, Oct. 2004.

• A. Akella, B. Maggs, S. Seshan, and A. Shaikh, “On the Performance Benefits of Multihoming Route Control,” IEEE/ACM Trans. Netw., vol. 16, no. 1, pp. 91–104, Feb. 2008.

• W. Shao, L. Ianonne, J.L. Rougier, F. Devienne, and M. Viste, “Scalable BGP Prefix Selection for Effective Inter-domain Traffic Engineering,” IEEE/IFIP NOMS, 2016.

Measurement-based Inter-domain TE

33

Page 34: Measurement-based Inter-domain Traffic Engineering · • A. Akella, B. Maggs, S. Seshan, and A. Shaikh, ... c v of hour volume SA SB SC SD SE SF SG SH SI Volume importance vs predictability

Appendix-II

• ARMA family, e.g. ARIMA, SARIMA, FARIMA, O(L^2), pre-process needed for each individual trace, L being historical record length.

• Artificial Neural Network family, e.g. TLFN, O(L*M), M for number of hidden nodes, usually bigger than L.

• Wavelet,O(L), pre-process needed, less accurate than FARIMA and ANN. H. Feng and Y. Shu, “Study on network traffic prediction techniques,” Proceedings. Int. Conf. Wirel. Commun. Netw. Mob. Comput., vol. 2, no. 3, pp. 995-998, 2005.

• Grey model GM(1,1) predicts the accumulated value of a time series, O(L). D. Julong, “Introduction to Grey System Theory,'' J. Grey Syst., vol. 1, pp. 1-24, 1989.

Traffic volume forecasting

34

Page 35: Measurement-based Inter-domain Traffic Engineering · • A. Akella, B. Maggs, S. Seshan, and A. Shaikh, ... c v of hour volume SA SB SC SD SE SF SG SH SI Volume importance vs predictability

Appendix-III

• K. Papagiannaki, N. Taft, and C. Diot, “Impact of Flow Dynamics on Traffic Engineering Design Principles,” INFOCOM, 2004.

• no clear correlation between throughput and its stability.

• J. J. Wallerich and A. Feldmann, “Capturing the variability of internet flows across time,” INFOCOM, 2006.

• throughput ranking of flows can change drastically over time.

• W. Zhang, J. Bi, J. Wu, and B. Zhang, “Catching popular prefixes at AS border routers with a prediction based method,” Comput. Networks, vol. 56, no. 4, pp. 1486–1502, Mar. 2012.

• assumed positive correlation between popularity and stability.

35

Traffic dynamism

Page 36: Measurement-based Inter-domain Traffic Engineering · • A. Akella, B. Maggs, S. Seshan, and A. Shaikh, ... c v of hour volume SA SB SC SD SE SF SG SH SI Volume importance vs predictability

• L. Iannone and O. Bonaventure, “On the cost of caching locator/ID mappings,” CoNEXT, 2007.

• C. Kim, M. Caesar, A. Gerber, and J. Rexford, “Revisiting route caching: The world should be flat,” PAM, 2009.

• H. Ballani, P. Francis, T. Cao, and J. Wang, “Making routers last longer with ViAggre,” NSDI, pp. 453–466, 2009.

• N. Sarrar, S. Uhlig, A. Feldmann, R. Sherwood, and X. Huang, “Leveraging Zipf’s law for traffic offloading,” CCR, vol. 42, no. 1, p. 16, Jan. 2012.

• W. Zhang, J. Bi, J. Wu, and B. Zhang, “Catching popular prefixes at AS border routers with a prediction based method,” Comput. Networks, vol. 56, no. 4, pp. 1486–1502, Mar. 2012.

• GM(1,1) outperforms LFU, LRU

Appendix-IV

36

FIB caching