cheetah (circuit-switched high-speed end-to-end transport architecture)
DESCRIPTION
CHEETAH (Circuit-Switched High-speed End-to-End Transport Architecture). Outline Research goals Topology & equipment Schedule & Status Extension of work. Malathi Veeraraghavan Univ. of Virginia [email protected]. Talk at the JET meeting, April 20, 2005. Team & Acknowledgment. - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: CHEETAH (Circuit-Switched High-speed End-to-End Transport Architecture)](https://reader035.vdocuments.mx/reader035/viewer/2022062501/56816331550346895dd3b484/html5/thumbnails/1.jpg)
1
CHEETAH (Circuit-Switched High-speed
End-to-End Transport Architecture)
• Outline• Research goals• Topology & equipment• Schedule & Status• Extension of work
Malathi Veeraraghavan Univ. of Virginia
Talk at the JET meeting, April 20, 2005
![Page 2: CHEETAH (Circuit-Switched High-speed End-to-End Transport Architecture)](https://reader035.vdocuments.mx/reader035/viewer/2022062501/56816331550346895dd3b484/html5/thumbnails/2.jpg)
2
Team & Acknowledgment• Team (PI/Co-PIs):
– Malathi Veeraraghavan, Univ. of Virginia– Nagi Rao, Bill Wing, Tony Mezzacappa, ORNL– John Blondin, NCSU– Ibrahim Habib, CUNY
• UVA funding sources:– NSF EIN grant ANI-0335190 – NSF ITR small grant ANI-0312376– DOE FG02-04ER25640
![Page 3: CHEETAH (Circuit-Switched High-speed End-to-End Transport Architecture)](https://reader035.vdocuments.mx/reader035/viewer/2022062501/56816331550346895dd3b484/html5/thumbnails/3.jpg)
3
UVA team members• Postdoc: Xuan Zheng• Graduate students
– Xiuduan Fang– Zhanxiang Huang– Anant P. Mudambi– Xiangfei Zhu
![Page 4: CHEETAH (Circuit-Switched High-speed End-to-End Transport Architecture)](https://reader035.vdocuments.mx/reader035/viewer/2022062501/56816331550346895dd3b484/html5/thumbnails/4.jpg)
4
Research goals• Deploy a shared circuit-switched network to complement
connectionless IP service– establish, use & release, rate-guaranteed high-bandwidth
host-to-host circuits– support TSI (Terascale Supernova Initiative) applications
• large file transfers• remote visualization• remote computational steering
– leverage equipment that supports protocols for the creation of distributed (large-scale) shared circuit-switched networks
• Extend solution:– to a connection-oriented internet – to partial connections (impt.: setup triggered from end hosts)
![Page 5: CHEETAH (Circuit-Switched High-speed End-to-End Transport Architecture)](https://reader035.vdocuments.mx/reader035/viewer/2022062501/56816331550346895dd3b484/html5/thumbnails/5.jpg)
5
Primary research goal: Highly dynamic circuits(held for only for ms-to-sec/min for file transfers)(requests for circuits generated by file transfer
applications)
• Gateways available that can crossconnect a Gigabit Ethernet port to an equivalent-rate time-division or wavelength-division multiplexed signal dynamically
Control
GigabitEthernetinterfacecard
Time-divisionor wavelength-divisionmultiplexing optical interfacecard
Circuitbased
gateway
Circuitbased
gateway
Circuitbased
gateway
Circuitbased
gateway
Circuitbased
gateway
signaling engine: dynamic call setup/release
GigabitEthernetinterfacesto hosts
GigabitEthernetinterfacesto hosts
Setup connection(make reservation)
Release connection(release resources)
Transfer file
![Page 6: CHEETAH (Circuit-Switched High-speed End-to-End Transport Architecture)](https://reader035.vdocuments.mx/reader035/viewer/2022062501/56816331550346895dd3b484/html5/thumbnails/6.jpg)
6
Strategy• Leverage the presence of connectionless IP
services (host-to-host IP connectivity; DNS)• Use off-the-shelf circuit-based gateways
– that support GMPLS routing and signaling protocols for such dynamic circuit setup/release
• Implement cheetah software to run on end hosts
• Integrate with host applications that will create a large offered load for circuits
![Page 7: CHEETAH (Circuit-Switched High-speed End-to-End Transport Architecture)](https://reader035.vdocuments.mx/reader035/viewer/2022062501/56816331550346895dd3b484/html5/thumbnails/7.jpg)
7
Cheetah software on end hosts
TCP NIC I
NIC II
FRTPPrimary TCP/IP path
End-to-end CHEETAH circuit
GridFTP Web server
Remote viz.
Signalingclient
End-host CHEETAH software
Routing decision
DNS lookupApplications
DNS query(to check if far end
host is also on cheetah)Routing decision
to check whether to use the TCP/IP path or
a cheetah circuitSignaling clientto request a circuit
Fixed Rate Transport Protocol (FRTP)
designed for circuits
![Page 8: CHEETAH (Circuit-Switched High-speed End-to-End Transport Architecture)](https://reader035.vdocuments.mx/reader035/viewer/2022062501/56816331550346895dd3b484/html5/thumbnails/8.jpg)
8
Demo #1 (at SC2004): Web Application Web serverWeb client
• At the web server side– Hyperlink to file is a CGI script (download.cgi); filename embedded in hyperlink– Download.cgi is started automatically at server when user clicks hyperlink,
which triggers CHEETAH FT sender– CHEETAH FT sender initiates CHEETAH circuit setup by calling RSVP-TE client.
• At the web client side– A CHEETAH FT receiver is running as daemon to receive the user data
Web Browser(e.g. Mozilla)
Web Server (e.g. Apache)
download.cgi
Data transfer
URL
Response
RSVP-TEMessages CHEETAH FT
senderFRTP +RSVP-TE
TCP
CHEETAH FT receiver
FRTP +RSVP-TE TCP
Goal: Make usage of circuit-switched network seamless to user
![Page 9: CHEETAH (Circuit-Switched High-speed End-to-End Transport Architecture)](https://reader035.vdocuments.mx/reader035/viewer/2022062501/56816331550346895dd3b484/html5/thumbnails/9.jpg)
9
Tactics: Lower the crossover file size above which circuits make sense
• Use second NICs at hosts for circuit connectivity leaving primary NIC for Internet access
Connectionless Internet
End host I
End host II
Circuit-Switched Network
• Attempt circuit setup• If rejected, fall back to
using TCP/IP
Should we attempt a circuit setup for ALL file transfers?
Two paths available
Or is there a crossover file size below which we use the TCP/IP network and above which we attempt a circuit setup?
![Page 10: CHEETAH (Circuit-Switched High-speed End-to-End Transport Architecture)](https://reader035.vdocuments.mx/reader035/viewer/2022062501/56816331550346895dd3b484/html5/thumbnails/10.jpg)
10
Two metrics to decide the crossover file size
• Delay• Utilization
![Page 11: CHEETAH (Circuit-Switched High-speed End-to-End Transport Architecture)](https://reader035.vdocuments.mx/reader035/viewer/2022062501/56816331550346895dd3b484/html5/thumbnails/11.jpg)
11
Delay metric• Need measurements for both paths:
– TCP/IP path: packet loss rate, bottleneck link rate and RTT
– circuit: call-blocking prob. and setup delay• For most regions of operation,
– in wide-area scenarios• attempt a circuit setup even for very small files
– in local-area scenarios• if link rate is 100Mbps
– crossover file size: 100’s of KBs to ones of 1MBs• if link rate is 1Gbps
– crossover file size: 1s to 10s of MBs
![Page 12: CHEETAH (Circuit-Switched High-speed End-to-End Transport Architecture)](https://reader035.vdocuments.mx/reader035/viewer/2022062501/56816331550346895dd3b484/html5/thumbnails/12.jpg)
12
Utilization metric• Two opposing factors
– If the crossover file size is increased• per-circuit utilization increases
– transfer time should exceed call setup delay• traffic load decreases because fewer files
will exceed – aggregate utilization decreases
![Page 13: CHEETAH (Circuit-Switched High-speed End-to-End Transport Architecture)](https://reader035.vdocuments.mx/reader035/viewer/2022062501/56816331550346895dd3b484/html5/thumbnails/13.jpg)
13
Aggregate utilization ua
(simple ErlangB formula)
m
k
k
m
bb
ak
mPmPu
0!/
!/ where,)1(
: traffic loadm: number of circuitsPb: call blocking probability
m ua
For a 1% call blocking probability Pb = 0.01
24.8%58.2%84.6%
110100
417117
Assuming file size follows Pareto distribution– Define fractional offered load
1' ( ) [ ] | ] ( )( )
kP X E X XE X
(fraction of )40KB 81%330KB 71%80MB 51%
![Page 14: CHEETAH (Circuit-Switched High-speed End-to-End Transport Architecture)](https://reader035.vdocuments.mx/reader035/viewer/2022062501/56816331550346895dd3b484/html5/thumbnails/14.jpg)
14
In other words• To make the circuit-switched network a truly
shared network (“dynamic”)– many ongoing short-lived calls– ErlangB model works best with large loads and high m
• Why share?– a solution to offer rate-guaranteed connections w/o
intense sharing on a call-by-call basis will be expensive • GMPLS signaling implementations at the switches
– manage bandwidth of the switch interfaces by implementing a simple “complete sharing” model
• dole out bandwidth until you run out and then reject calls– works well when the number of circuits that can be
doled out per interface is large AND holding times are short
![Page 15: CHEETAH (Circuit-Switched High-speed End-to-End Transport Architecture)](https://reader035.vdocuments.mx/reader035/viewer/2022062501/56816331550346895dd3b484/html5/thumbnails/15.jpg)
15
Other apps besides file transfers for such shared circuit-switched
networks• Delay/jitter-sensitive applications
like high-quality video-telephony, Internet games
• lower-rate but longer-lived (still in minutes)– average telephone call– multiple cameras in a casual office/home setting
• chose SONET switches to get low granularity (MPLS is a fine choice too)
![Page 16: CHEETAH (Circuit-Switched High-speed End-to-End Transport Architecture)](https://reader035.vdocuments.mx/reader035/viewer/2022062501/56816331550346895dd3b484/html5/thumbnails/16.jpg)
16
Problems with this vision• Problem 1:
– Fear on the part of network operators to enable user-application triggered call setups
• Problem 2:– eScience apps, the driver for these
networks, pose clashing requirements
![Page 17: CHEETAH (Circuit-Switched High-speed End-to-End Transport Architecture)](https://reader035.vdocuments.mx/reader035/viewer/2022062501/56816331550346895dd3b484/html5/thumbnails/17.jpg)
17
Problem 1: the operator’s fear
• Of allowing end host applications to generate signaling requests for rate-guaranteed connections– Some strange host sends RSVP-TE messages and ties up
bandwidth• Have IPsec devices on control-plane ports• Enforce authentication and integrity of RSVP-TE messages
– An application could set up a circuit and hold forever• One answer: billing mechanism (commercial)• Research and education context:
– Write RSVP-TE client software at end hosts to limit holding time» similar to the concept of MTU» disallow holding circuits for very long durations» rejoin queue and compete for bandwidth
• Ideal: add holding time parameter to RSVP-TE and enforce at switches
![Page 18: CHEETAH (Circuit-Switched High-speed End-to-End Transport Architecture)](https://reader035.vdocuments.mx/reader035/viewer/2022062501/56816331550346895dd3b484/html5/thumbnails/18.jpg)
18
Problem 2: clashing eScience application needs
• Long-lived vs. short-lived calls– A scientist would like to hold a high-
bandwidth circuit for 2-3 hours• for remote visualization• a 1TB file move on an end-to-end 1Gbps
circuit takes ~2.3 hours
File transfers
Suitability for high-speed circuits:- higher the bandwidth, better the service- can keep the pipe full (bits from one disk to another)
eScience
GMPLS
![Page 19: CHEETAH (Circuit-Switched High-speed End-to-End Transport Architecture)](https://reader035.vdocuments.mx/reader035/viewer/2022062501/56816331550346895dd3b484/html5/thumbnails/19.jpg)
19
Limits sharing• For long-lived calls
– Cannot use complete-sharing bandwidth management engines
– Mean waiting time related to mean holding time• Example:
– The first 10 calls, each for 1Gbps, fill up a 10GbE link and everyone else waits
• Need software to support advance reservations
![Page 20: CHEETAH (Circuit-Switched High-speed End-to-End Transport Architecture)](https://reader035.vdocuments.mx/reader035/viewer/2022062501/56816331550346895dd3b484/html5/thumbnails/20.jpg)
20
Pushing us backwards into non-scalable networks (centralized
contol)• Current GMPLS standards/implementations do
not support bandwidth management into the future
• Hence, external schedulers being written to manage bandwidth into the future
• Negates goals of the triumvirate of GMPLS protocols– LMP to discover neighbors– OSPF-TE for routing– RSVP-TE for signaling
which is to create large-scale networks by having each switch have control-plane software to manage its own bandwidth and respond to requests for reservations
![Page 21: CHEETAH (Circuit-Switched High-speed End-to-End Transport Architecture)](https://reader035.vdocuments.mx/reader035/viewer/2022062501/56816331550346895dd3b484/html5/thumbnails/21.jpg)
21
Steps executed by a GMPLS signaling engine at a switch for call
setup
GMPLS-equipped
switch
Path message(requesting BW)
Route lookup
Check BW availability (CAC)
Select
timeslots//ports/labels
Program switch fabric
Path message(requesting BW)(to next switchon path)
These 4 functions are distributed:
i.e, they are executed at each switch
GMPLS-equipped
switch
![Page 22: CHEETAH (Circuit-Switched High-speed End-to-End Transport Architecture)](https://reader035.vdocuments.mx/reader035/viewer/2022062501/56816331550346895dd3b484/html5/thumbnails/22.jpg)
22
Takes us partially “backwards” towards a centralized solution
GMPLS-equipped
switch
GMPLS-equipped
switch
GMPLS-equipped
switch
SchedulerA new protocol (BW requested + time)
Route lookup/computatio
n
Check BW availability (CAC)(across multiple links)
Select timeslots//ports/
labels
Program switch fabric
Third-party Path message with ERO(just before scheduled time)
Answer
Path message
![Page 23: CHEETAH (Circuit-Switched High-speed End-to-End Transport Architecture)](https://reader035.vdocuments.mx/reader035/viewer/2022062501/56816331550346895dd3b484/html5/thumbnails/23.jpg)
23
Reconcile these differences
• Partitioning bandwidth managed by external scheduler vs. bandwidth managed by GMPLS on-switch engines– will lead to inefficiencies
• Important to really connect this main driver (eScience) with GMPLS scalable solution
• Extend GMPLS to manage bandwidth over a period into the future (like airline reservation systems)
![Page 24: CHEETAH (Circuit-Switched High-speed End-to-End Transport Architecture)](https://reader035.vdocuments.mx/reader035/viewer/2022062501/56816331550346895dd3b484/html5/thumbnails/24.jpg)
24
Status check
• Outline• Research goalsTopology & equipment• Schedule & status• Extension of work
![Page 25: CHEETAH (Circuit-Switched High-speed End-to-End Transport Architecture)](https://reader035.vdocuments.mx/reader035/viewer/2022062501/56816331550346895dd3b484/html5/thumbnails/25.jpg)
25
CHEETAHTopology & equipment (May
1)
Raleigh PoP(MCNC)
(Sycamore SN16000)
Atlanta PoP(SoX/SLR)
(Sycamore SN16000)
Ethernetswitch
Hosts5 GbEs
Enterprise networks
NCSU
Ethernetswitch
HostsGbE
OC192(NLR, SLR)
G. Tech
Ethernetswitch
Hosts
3 GbEs (near-term)OC192 or 10GbE (long-term)
ORNL
![Page 26: CHEETAH (Circuit-Switched High-speed End-to-End Transport Architecture)](https://reader035.vdocuments.mx/reader035/viewer/2022062501/56816331550346895dd3b484/html5/thumbnails/26.jpg)
26
CHEETAHTopology & equipment (year-
end)
Raleigh PoP(MCNC)
(Sycamore SN16000)Atlanta PoP(SoX/SLR)
(Sycamore SN16000)
5 GbEsNCSU
DukeGbE
GbE
OC192
G. Tech
OC192
Connectivity to USN
ORNLDragon
UVAGbE
CUNY
McLean PoP
HOPI andNYSERnet
![Page 27: CHEETAH (Circuit-Switched High-speed End-to-End Transport Architecture)](https://reader035.vdocuments.mx/reader035/viewer/2022062501/56816331550346895dd3b484/html5/thumbnails/27.jpg)
27
Status• SN16000 switches purchased and installed
(tested)• NCSU – MCNC and MCNC – SOX/SLR
connections up (will be lit May 1)• SOX/SLR – ORNL MPLS tunnels up• Cheetah software
– RSVP-TE client, FRTP, Cheetah FT sender/receiver ready
• Applications– Web integration done– Testing PVFS/GridFTP– Soon to test Ensight remote viz. tool with cheetah
![Page 28: CHEETAH (Circuit-Switched High-speed End-to-End Transport Architecture)](https://reader035.vdocuments.mx/reader035/viewer/2022062501/56816331550346895dd3b484/html5/thumbnails/28.jpg)
28
Need to extend CHEETAH concept and apps to connection-oriented internet
• As plainly evident in the costs of creating a pure cheetah network – used VLANs in one segment– used MPLS tunnels in another
• Large numbers of engineers, salespeople and network admins – should expect multiple connection-oriented network
solutions in the near-term• Packet-switched:
– MPLS – VLAN capability in Ethernet switches
• Circuit-switched:– WDM – SONET
![Page 29: CHEETAH (Circuit-Switched High-speed End-to-End Transport Architecture)](https://reader035.vdocuments.mx/reader035/viewer/2022062501/56816331550346895dd3b484/html5/thumbnails/29.jpg)
29
Enabling not building• ALREADY DEPLOYED; Just flick the switch!
– Let user apps. peel out MPLS tunnel bandwidth as needed and put back to IP path when done
• Protocol interworking– User (data) plane - DONE– Signaling protocols (for connection
setup/release) – Routing protocols (for reachability, topology,
loading data dissemination)
![Page 30: CHEETAH (Circuit-Switched High-speed End-to-End Transport Architecture)](https://reader035.vdocuments.mx/reader035/viewer/2022062501/56816331550346895dd3b484/html5/thumbnails/30.jpg)
30
Summary• Rich new set of research problems• Experimental challenges a plenty!• Real opportunity to deploy a
connection-oriented internetwork!• Web site:
http://cheetah.cs.virginia.edu
![Page 31: CHEETAH (Circuit-Switched High-speed End-to-End Transport Architecture)](https://reader035.vdocuments.mx/reader035/viewer/2022062501/56816331550346895dd3b484/html5/thumbnails/31.jpg)
31
Backup slides
![Page 32: CHEETAH (Circuit-Switched High-speed End-to-End Transport Architecture)](https://reader035.vdocuments.mx/reader035/viewer/2022062501/56816331550346895dd3b484/html5/thumbnails/32.jpg)
32
Mean TCP delays(using U. Mass model)
Cases Packet loss rate Bottleneck link rate Round- trip prop. delay Mean delay for a 1GB file (s)
Case 1 0.0001 100 Mbps 0.1ms 82.25
Case 2 5ms 89.45
Case 3 50ms 396.5
Case 4 0.0001 1Gbps 0.1ms 8.25
Case 5 5ms 39.6
Case 6 50ms 395.7
Case 7 0.001 100Mbps 0.1ms 82.93
Case 8 5ms 135.4
Case 9 50ms 1293
Case 10 0.001 1Gbps 0.1ms 8.64
Case 11 5ms 129.4
Case 12 50ms 1287
Case 13 0.01 100 Mbps 0.1ms 92.41
Case 14 5ms 471.7
Case 15 50ms 4417
Case 16 0.01 1Gbps 0.1ms 12.43
Case 17 5ms 441.7
Case 18 50ms 4387
Low impactof bottlenecklink ratein wide-areanetworks
Impactof packetloss rate
Impactof propagationdelay
![Page 33: CHEETAH (Circuit-Switched High-speed End-to-End Transport Architecture)](https://reader035.vdocuments.mx/reader035/viewer/2022062501/56816331550346895dd3b484/html5/thumbnails/33.jpg)
33
Should the application attempt a circuit setup or
not?• Mean delay if a circuit setup is attempted
])[][()][)(1(][ tcpfailbtransfersetupbcheetah TETEPTTEPTE
Pb: call blocking probability in the circuit-switched network
If circuit setup fails, fall back to Internet path
![Page 34: CHEETAH (Circuit-Switched High-speed End-to-End Transport Architecture)](https://reader035.vdocuments.mx/reader035/viewer/2022062501/56816331550346895dd3b484/html5/thumbnails/34.jpg)
34
Routing decision
setupcircuit attempt ][1
][ if
path TCP/IP theuse ][1
][ if
transfertcpb
setup
transfertcpb
setup
TTEP
TE
TTEP
TE
setupcircuit attempt ][][ if
path TCP/IP theuse ][][ if
tcpcheetah
tcpcheetah
TETE
TETE
![Page 35: CHEETAH (Circuit-Switched High-speed End-to-End Transport Architecture)](https://reader035.vdocuments.mx/reader035/viewer/2022062501/56816331550346895dd3b484/html5/thumbnails/35.jpg)
35
Numerical resultslink rate = 1Gbps
Tprop = 0.1ms Tprop = 50ms
![Page 36: CHEETAH (Circuit-Switched High-speed End-to-End Transport Architecture)](https://reader035.vdocuments.mx/reader035/viewer/2022062501/56816331550346895dd3b484/html5/thumbnails/36.jpg)
36
When rc = 100Mbps and Tprop = 0.1ms
Crossover file sizes
rc = 1Gbps, Tprop = 0.1ms
Measure of loading on
ckt. sw.network
TCP/IP path
Pb 0.01= Pb 0.1= Pb 0.3=
Ploss 0.0001= 22MB 24MB 30MB Ploss 0.001= 9MB 10MB 12MB Ploss 0.01= 1.2MB 1.4MB 1.8MB
When Tprop = 50ms, in most regions of operation
of the two networks, attempt a circuit
![Page 37: CHEETAH (Circuit-Switched High-speed End-to-End Transport Architecture)](https://reader035.vdocuments.mx/reader035/viewer/2022062501/56816331550346895dd3b484/html5/thumbnails/37.jpg)
37
Plot of utilization u withrc= 100Mbps, k=20
Pb=0.3
Pb=0.01