application redundancy tool a.r.t. cs 495 fall 2005 kristi olson

16
Application Redundancy Tool A.R.T. CS 495 Fall 2005 Kristi Olson

Upload: eric-may

Post on 29-Dec-2015

212 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Application Redundancy Tool A.R.T. CS 495 Fall 2005 Kristi Olson

Application Redundancy Tool

A.R.T.

CS 495 Fall 2005

Kristi Olson

Page 2: Application Redundancy Tool A.R.T. CS 495 Fall 2005 Kristi Olson

Description

A.R.T. is an system of hot standby for applications.

Designed for applications which need to run continuously.

Intended for use with applications which require socket connections.

Page 3: Application Redundancy Tool A.R.T. CS 495 Fall 2005 Kristi Olson

Internship

Internship with GCI’s Network Support Group, Operations System Support.

NSG OSS responsibilities: Provision phone service, calling cards,

internet services. Internal Support (data collection) Network Monitoring

Page 4: Application Redundancy Tool A.R.T. CS 495 Fall 2005 Kristi Olson

Applications supporting these services

Homegrown. Most run around the clock. Some applications are mission critical. Loss of productivity when applications

are down. No formal system of redundancy.

Lack of existing product to buy. Lack of funds.

Page 5: Application Redundancy Tool A.R.T. CS 495 Fall 2005 Kristi Olson

Applications continued

Methods of transmission vary widely. Fiber optic

Reliable, fast Satellite

Prone to weather related outages, inherent 600 ms delay.

Microwave Even more prone to weather: fog, rain, or

even a hot day causes outages.

Page 6: Application Redundancy Tool A.R.T. CS 495 Fall 2005 Kristi Olson

Applications continued

Socket connections: Some applications establish one or more

“permanent” socket connections. Others repeatedly establish multiple

“temporary” socket connections.

Page 7: Application Redundancy Tool A.R.T. CS 495 Fall 2005 Kristi Olson

How to provide redundancy?

V.R.R.P. Virtual Routing Redundancy Protocol

System of dynamic redundancy. One router is designated master. Other routers are backups. Uses multicasting.

Page 8: Application Redundancy Tool A.R.T. CS 495 Fall 2005 Kristi Olson

V.R.R.P. and A.R.T.

Establish “Master” and “backup” instances of the application.

Identical except for a configuration file. The backups loop continuously listening for

status polls from the Master. If the Master stops sending polls, the

backup comes online. Put the application instances on the same

multicast group.

Page 9: Application Redundancy Tool A.R.T. CS 495 Fall 2005 Kristi Olson

Requirements

Minimal modifications to existing code. Configurable for any type of application and

transmission protocol. Reliability:

A backup can not come online prematurely, nor can it come online too late.

The actual switch over should be minimally service affecting.

Applications should take control or release sockets accordingly.

Page 10: Application Redundancy Tool A.R.T. CS 495 Fall 2005 Kristi Olson

Configuration

Priority: The instance with the highest priority becomes

the master. Broadcast Interval:

How often the application sends and listens for status polls.

Allowable Missed Polls: Multicasting uses UDP. Less reliable transmission technologies.

Page 11: Application Redundancy Tool A.R.T. CS 495 Fall 2005 Kristi Olson

Status Polls

Application Name Each application only reads messages

pertaining to itself. Timestamp:

Latency is possible. A.R.T. ignores old messages.

Priority

Page 12: Application Redundancy Tool A.R.T. CS 495 Fall 2005 Kristi Olson

Putting it all together

Configuration file: Contains instance information.

A.R.T. Perl module: Defines multicast groups. Evaluates status polls. Other related overhead

Modifications to existing code: Where to poll.

Don’t want to poll too often, or not often enough. Encapsulate socket connections.

Re-factoring opportunity

Page 13: Application Redundancy Tool A.R.T. CS 495 Fall 2005 Kristi Olson

High level data flow

Page 14: Application Redundancy Tool A.R.T. CS 495 Fall 2005 Kristi Olson

A.R.T. Monitoring

Email Alerts: Notify admins should a switch over occur.

Web Page: Traffic Light:

Green Light - Master is working. Yellow Light – Backup is listening. Red Light - application is down.

Page 15: Application Redundancy Tool A.R.T. CS 495 Fall 2005 Kristi Olson

A.R.T. Monitoring Web Page

Page 16: Application Redundancy Tool A.R.T. CS 495 Fall 2005 Kristi Olson

A.R.T.

Questions?