lhcopn perfsonar mdm status

39
Connect. Communicate. Collaborate LHCOPN perfSONAR MDM status Otto Kreiter, DANTE LHCOPN, Copenhagen, 17.10.2008

Upload: halee-rocha

Post on 30-Dec-2015

40 views

Category:

Documents


0 download

DESCRIPTION

LHCOPN perfSONAR MDM status. Otto Kreiter, DANTE LHCOPN, Copenhagen, 17.10.2008. Agenda. MoU status Deployment status Training programme Next steps. MoU status - I. Three MoU’s signed by all parties: SURFnet/SARA RAL DFN/DE-KIT Two MoUs partially signed: RENATER/IN2P3 TW-ASGC - PowerPoint PPT Presentation

TRANSCRIPT

Connect. Communicate. Collaborate

LHCOPN perfSONAR MDM status

Otto Kreiter, DANTE

LHCOPN, Copenhagen, 17.10.2008

Connect. Communicate. CollaborateAgenda

• MoU status• Deployment status• Training programme• Next steps

Connect. Communicate. CollaborateMoU status - I

• Three MoU’s signed by all parties:– SURFnet/SARA– RAL– DFN/DE-KIT

• Two MoUs partially signed:– RENATER/IN2P3– TW-ASGC

• Gentlemen's agreement:– TRIUMF

Connect. Communicate. CollaborateMoU status - II

• Waiting for last approvals:– CERN– RedIRIS / PIC

• In development:– GARR/ CNAF– NORDUNET / NDGF

• Gentlemen’s agreement expected after CERN signs MoU:– FERMI– BNL

Connect. Communicate. CollaborateDeployment status

• Only two servers will be deployed per site

• Deployment split in three phases (3-4 sites / phase)

• Deployment started for three Tier-1 sites (Phase – 1):1. SARA2. RAL3. DE-KIT

• The servers were ordered

• Deployment initiation meetings with the Tier-1 sites

Connect. Communicate. CollaborateDeployment Roll Out Steps

1. Site Survey & Purchase

2. Site Preparation

3. Boxes Delivery (Tier-1 Site + GN2 Partner)

4. HADES (Bee) Server Deployment

5. MDM (Sun) Server Deployment

Connect. Communicate. CollaborateRoll Out Schedule

1 2 3 4 5 6 7 8 9 10 11 12

Site Preparation

Boxes Delivery(GN2 Partner + Site)

13 14

GN2 Partner Configuration (HADES Box)

Shipment to Site

Physical Installation andConfiguration Validation

Preparation, Deployment andConfiguration (MDM Box)

weeks

ConfigurationValidation

Physical Installation

Site Survey &Purchase

Optional + 3 Weeks

Depends Site Order

Connect. Communicate. CollaborateperfSONAR Training (1)

• A one-day course.

• Will focus on the perfSONAR visualisation tools.

• Will provide examples specific to the LHCOPN.

• Will include hands-on exercises.

Connect. Communicate. CollaborateperfSONAR Training (2)

• Objectives:

– Understand the role of perfSONAR in network monitoring.

– Describe the perfSONAR visualisation tools, explaining the purpose of each.

– Use the perfSONAR visualisation tools to investigate network issues.

• Audience:

– Network administrators from the LHCOPN sites.

Connect. Communicate. CollaborateNext steps

• Finalise remaining MoUs/agreements

• Deployment stepped up at full speed

• Further visualisation and monitoring requirements gathering + enhancements (more later)

Connect. Communicate. Collaborate

perfSONAR Visualisation demo

Connect. Communicate. CollaborateMonitoring demo

• The alarming and visualisation tools presented are not necessarily in final shape

• Your help will be highly valued to shape them

• Context of a potential debugging process

• Some demos are live other are off-line

Connect. Communicate. CollaborateThe call !

Researcher reports to the network operator that his/her data-flow can not exceed more than 400Mbps between Tier-1 and Tier-0

1. He/she can do the next procedure herself

1. The NOC of the Tier-1 will start investigating – they can use the example procedure

Connect. Communicate. Collaborate

Connect. Communicate. Collaborate

Connect. Communicate. Collaborate

Alarm function - EPCC

Florian Scharinger - EPCC University of Edinburgh

Connect. Communicate. Collaborate

Connect. Communicate. Collaborate

Connect. Communicate. CollaborateCase #1

Connect. Communicate. Collaborate

Connect. Communicate. CollaborateE2EMon display

Connect. Communicate. Collaborate

Connect. Communicate. Collaborate

Connect. Communicate. CollaborateUser input

Hop DNS IP

0 LHC T-1 10.10.10.2

1 iucc.rt1.fra.de.geant2.net 62.40.125.121

2 so-6-2-0.rt1.gen.ch.geant2.net 62.40.112.21

3 LHC T-0 11.11.11.2

Connect. Communicate. Collaborate

Connect. Communicate. Collaborate

Connect. Communicate. Collaborate

Connect. Communicate. Collaborate

Connect. Communicate. Collaborate

Connect. Communicate. Collaborate

Connect. Communicate. Collaborate

Connect. Communicate. Collaborate

Connect. Communicate. Collaborate

Connect. Communicate. Collaborate

Connect. Communicate. Collaborate

Connect. Communicate. Collaborate

Connect. Communicate. Collaborate

LHCOPN weather-map

Connect. Communicate. CollaborateWeather map questions• 15/10: -1- OK to enquire with LHC-OPN about providing circuit information to the E2EMon for the to Taiwan (no E2E link) and two links in the US (CBF) (DFN to provide the

exact list of circuits). • 09/10: -2- OK to enquire wiht LHC-OPN if the current plans for the weathermap are acceptable. • xx/10: -1- OK to enquire wiht LHC-OPN if the current plans for the alarming are acceptable. • 15/10: -1- OK to enquire with LHC-OPN about providing circuit information to the E2EMon for the to Taiwan (no E2E link) and two links in the US (CBF) (DFN to provide the

exact list of circuits). • 15/10: -2- OK to enquire with LHC-OPN about: as the e2e circuit is terminated on the first site equipment (that can be a switch and not a router), would it be possible to get

from the OPN a topology for each site showing everything between the terminiation of the E2E circuit and the IP router. – We need the router(s), its IP addresses relevant for the OPN and interface names, the E2E circiut ID. We need to be able to associate the E2E circuit ID with the

interface of the router it is connected to. – If VLAN or multiple logical interfaces are used over the circuits between two sites, the VLAN IDs and the their mapping to the router interface should be indicated. – If there are switches between the routers and the logical E2E circuits termination, they should be indicated. The mapping between the E2E circuit ID, the router

interface and the site local L2 channel used to link the router to the E2E circuit should be indicated. (e.g. if a VLAN is used, indicate the VLAN ID, the E2E circuit ID it links to which router interface).

– We need that information to design our weathermap and the mapping between circuits. • 15/10: -3- OK to enquire with LHC-OPN if the OPN is OK to show the status of the E2E circuit and not of the whole path to the router (e.g. in the case the circuit terminates on

a switch and not on the router - see -3-). – Note: We could provide more the information up to the router, but that requires work on our side and on theirs to provide the status of the L2 circuits between the

termination of the E2E circuit and the router interface. • 15/10: -4- OK to enquire with LHC-OPN if they plan to do some trunking (multiple 10Gbps circuit put together as a logical circuit with a single IP address on both side).

– We would prefer not to as is complexify greatly our task. • 15/10: -5- OK to enquire with LHC-OPN how frequently the mapping between an e2e circuit and a router interface would change. This can involved any of the following

changes: move the e2e circuit to another equipment, change the router interface, change the router interface IP address. – In the case there are switches between the termination of the E2E circuit and the router (see -3-), indicates estimate of the change frequency of the mapping

between E2E ID, the site L2 circuit ID and the router interfaces. • 15/10: -6- OK to enquire with LHC-OPN if the operational people can update the service desk in a timely fashion if any changes related to their the E2E circuit or the router

interface to which the circuit is linked (e.g. circuit terminated on a different interface, new circuit, etc). The SD will update the file accordingly, so that the change can be taken into consideration by the weathermap. – OPN to be notified that as the process is manual, there would be some mismatch for few hours between the infrastructure and the weathermap.

• 15/10: -7- Only if -3- indicates that there are some active equipment in between the E2E circuit termination and the router interface. OK to enquire with LHC-OPN if they plan to use internally statis L2 circuits or dynamic ones (e.g. GFRP). We are expecting static methods of circuit provisionning.

• 15/10: -8- OK to enquire with OPN if the following is acceptable Decouple the e2e active measurements from the L3 and L2 measuremements. The weathermap would be presented with a single link between two sites, even if there are multiple ones. When clicking on it, the status of each circuit would be presented as well as the utilisation of the IP interfaces, etc. The delay and throughput would as well be presented, but not mapped to a particular circuit.

• 15/10: -9- OK to enquire with OPN if they got multiple circuits between two sites, if they plan to have a primary and an unused backup or if they plan to perform load ballancing (and which one: per packet or per flow).

Connect. Communicate. CollaborateNext steps

• General monitoring feedback mechanism– LHCOPN monitoring feedback group– 3-4 tech. people involved in the LHCOPN operations

• LHCOPN Portal– Central secure access for all the tools and monitoring

information