uptime institutetier standard for data centers facts vs. … · uptime institute tier standard for...

38
Uptime Institute Tier Standard for Data Centers Facts vs. Myths and Misconceptions Presented by Kenneth (“Ken”) Brill, Founder Uptime Institute [email protected]

Upload: lamquynh

Post on 10-Jun-2018

229 views

Category:

Documents


3 download

TRANSCRIPT

Page 1: Uptime InstituteTier Standard for Data Centers Facts vs. … · Uptime Institute Tier Standard for Data Centers Facts vs. Myths and Misconceptions Presented by Kenneth (“Ken”)

Uptime InstituteTier Standard for Data Centers

Facts vs. Myths and Misconceptions

Presented byKenneth (“Ken”) Brill, Founder

Uptime [email protected]

Page 2: Uptime InstituteTier Standard for Data Centers Facts vs. … · Uptime Institute Tier Standard for Data Centers Facts vs. Myths and Misconceptions Presented by Kenneth (“Ken”)

2 © 2011 Uptime Institute

Agenda Introduction to Tiers Benefits and misconceptions Impact of IT availability trends on required

facility Tier level Certifications Training More Tier information

Page 3: Uptime InstituteTier Standard for Data Centers Facts vs. … · Uptime Institute Tier Standard for Data Centers Facts vs. Myths and Misconceptions Presented by Kenneth (“Ken”)

3 © 2011 Uptime Institute

Uptime Institute Is the ONLY Authority for Tier Standard Certification

Page 4: Uptime InstituteTier Standard for Data Centers Facts vs. … · Uptime Institute Tier Standard for Data Centers Facts vs. Myths and Misconceptions Presented by Kenneth (“Ken”)

4 © 2011 Uptime Institute4

Uptime’s International Tier Standard Has Two Parts Topology Tier I Tier II Tier III Tier IV

Operational Sustainability (new 2010) Gold Silver Bronze

Chronology Topology introduced in 1996 with update in 2008 Operational Sustainability introduced July, 2010

Page 5: Uptime InstituteTier Standard for Data Centers Facts vs. … · Uptime Institute Tier Standard for Data Centers Facts vs. Myths and Misconceptions Presented by Kenneth (“Ken”)

5 © 2011 Uptime Institute

This Table Was Eliminated in 2008 to Reduce Confusion & Misconceptions

TIER I TIER II TIER III TIER IVBuilding Type Tenant Tenant Standalone StandaloneStaffing None 1 Shift 1+ Shifts 24 by

ForeverUsable for critical load 100%N 100%N 90%N 90%NInitial build-out UPS output watts/ft² (typ)

20-30 40-50 40-60 50-80

Ultimate UPS output watts/ft² (typical)

20-30 40-50 100-150 150+

Class A uninterruptible cooling

No No Maybe Yes

Support space to raised floor ratio

20% 30% 80-90+% 100+%

Raised floor height (typical) 12” 18” 30-36” 30-36”Floor loading lbs/ft² (typical) 85 100 150 150+Utility voltage (typical) 208,480 208,480 12-15 kV 12-15 kVSingle points-of-failure Many +

human errorMany + human error

Some + human error

Fire+ EPO + human error

Annual site caused IT downtime (actual field data)

28.8 hours 22.0 hours 1.6 hours 0.8 hours

Representative site availability

99.67% 99.75% 99.98% 99.99%

Months to implement 3 3 to 6 15 to 20 15 to 20Year first deployed 1965 1970 1985 1995

Page 6: Uptime InstituteTier Standard for Data Centers Facts vs. … · Uptime Institute Tier Standard for Data Centers Facts vs. Myths and Misconceptions Presented by Kenneth (“Ken”)

6 © 2011 Uptime Institute

All Drawings Have Been Eliminated

Page 7: Uptime InstituteTier Standard for Data Centers Facts vs. … · Uptime Institute Tier Standard for Data Centers Facts vs. Myths and Misconceptions Presented by Kenneth (“Ken”)

7 © 2011 Uptime Institute

Common Myths & Misconceptions Two utility sources are required for Tier IV Tier III has no single points-of-failure Fault Tolerance of Tier III is higher than Tier II Site location can preclude a topology rating MTBF calculation determines Tier Tier is determined by equipment count Tier III+, Tier III.6, etc., have meaning Tier IV eliminates human error

Page 8: Uptime InstituteTier Standard for Data Centers Facts vs. … · Uptime Institute Tier Standard for Data Centers Facts vs. Myths and Misconceptions Presented by Kenneth (“Ken”)

8 © 2011 Uptime Institute

Common Myths & Misconceptions Two utility sources are required for Tier IV Tier III has no single points-of-failure Fault Tolerance of Tier III is higher than Tier II Site location can preclude a Tier rating MTBF calculation determines Tier Tier is determined by equipment count Tier III+, Tier 111.6, etc. have meaning Tier IV eliminates human error

Page 9: Uptime InstituteTier Standard for Data Centers Facts vs. … · Uptime Institute Tier Standard for Data Centers Facts vs. Myths and Misconceptions Presented by Kenneth (“Ken”)

9 © 2011 Uptime Institute

Data Center Site Infrastructure Tier Standard: Topology

Page 10: Uptime InstituteTier Standard for Data Centers Facts vs. … · Uptime Institute Tier Standard for Data Centers Facts vs. Myths and Misconceptions Presented by Kenneth (“Ken”)

10 © 2011 Uptime Institute

Tier Topology Certifications Tier IV – Fault Tolerance Constructed facility 5 Complete 0 In-progress

Design documents 14 Complete 3 In-progress

Tier III – Concurrent Maintenance Constructed facility 10 Complete 8 In-progress

Design documents 33 Complete 23 In-progress

As of March 1, 2011

Page 11: Uptime InstituteTier Standard for Data Centers Facts vs. … · Uptime Institute Tier Standard for Data Centers Facts vs. Myths and Misconceptions Presented by Kenneth (“Ken”)

11 © 2011 Uptime Institute

Data Center Site Infrastructure Tier Standard: Operational Sustainability

Page 12: Uptime InstituteTier Standard for Data Centers Facts vs. … · Uptime Institute Tier Standard for Data Centers Facts vs. Myths and Misconceptions Presented by Kenneth (“Ken”)

12 © 2011 Uptime Institute

Site Certification:Topology + Operational Sustainability Tier III Gold Certifications awarded UnitedHealth Care, Next Technology Center Target Corporation, Target Technology Center

Brooklyn Park, MN Target Corporation, Target Technology Center

Elk River, MN Fujitsu Services, London North Data Centre

London, UK Others are in progress

Page 13: Uptime InstituteTier Standard for Data Centers Facts vs. … · Uptime Institute Tier Standard for Data Centers Facts vs. Myths and Misconceptions Presented by Kenneth (“Ken”)

13 © 2011 Uptime Institute

Uptime’s Tier Standard System Provides an unbiased review of a single site or

a comparison of two sites or a portfolio of sites Assists owners in maximizing the life-cycle value of

their investment Assists purchasers of third-party services in

quantifying what they are getting Encourages operations to maximize uptime,

minimize risks, and achieve efficiency Incents and measures processes optimizing energy

efficiency Quantifies site infrastructure management

effectiveness

Page 14: Uptime InstituteTier Standard for Data Centers Facts vs. … · Uptime Institute Tier Standard for Data Centers Facts vs. Myths and Misconceptions Presented by Kenneth (“Ken”)

14 © 2011 Uptime Institute

Tier Standard Does NOT Compete With or Replace Other Standards & Codes Tier Standard exclusions: Construction and building codes Environmental regulations Safety policies and practices Security (physical and data) Personnel policies Energy efficiency International Standards Organization® (ISO ®)

Page 15: Uptime InstituteTier Standard for Data Centers Facts vs. … · Uptime Institute Tier Standard for Data Centers Facts vs. Myths and Misconceptions Presented by Kenneth (“Ken”)

15 © 2011 Uptime Institute

Tier Standard Was Developed By and For “Owners” with “Skin in the Game” BMO Financial Group Boeing Delta Airlines Fidelity Investments Fujitsu Services GE HSBC Johnson and Johnson JP Morgan Chase Kaiser Permanente Lowe’s

Morgan Stanley Nationwide Mutual

Insurance Progressive Insurance Saudi ARAMCO Social Security

Administration State Farm Insurance Target United Health US Bank VISA

Page 16: Uptime InstituteTier Standard for Data Centers Facts vs. … · Uptime Institute Tier Standard for Data Centers Facts vs. Myths and Misconceptions Presented by Kenneth (“Ken”)

16 © 2011 Uptime Institute16

Tier Standard Topology DefinitionsTier I Basic CapacityTier II Redundant ComponentsTier III Concurrent Maintenance - applies to

Each and Every component and path on a planned basis*

Tier IV Fault Tolerance - anticipates a single, mechanical or electrical system or equipment event and subsequent consequential impact(s)** Excludes EPO (North America only), fire, single-corded IT equipment, incorrectly installed dual-corded IT equipment, sabotage, rules violations, and/or mistakes. See Standard for additional details and exclusions.

Page 17: Uptime InstituteTier Standard for Data Centers Facts vs. … · Uptime Institute Tier Standard for Data Centers Facts vs. Myths and Misconceptions Presented by Kenneth (“Ken”)

17 © 2011 Uptime Institute

Tier Standard Topology Ratings Are Vendor and Solution Agnostic Encourages innovative engineering Allows for client equipment preferences Not a proscriptive checklist, design menu, or a

recipe

Page 18: Uptime InstituteTier Standard for Data Centers Facts vs. … · Uptime Institute Tier Standard for Data Centers Facts vs. Myths and Misconceptions Presented by Kenneth (“Ken”)

18 © 2011 Uptime Institute

Availability Trends - Today and Future Do you expect prime time and/or availability

requirements to go up or down in the future? Is your compensation tied to an Availability

Service Level Agreement? Is planned and unplanned Facilities downtime

included or excluded from your SLA? Do you have a financially enforceable SLA with your

Facility provider (internal or external) that reflects IT’s commitments to users?

Page 19: Uptime InstituteTier Standard for Data Centers Facts vs. … · Uptime Institute Tier Standard for Data Centers Facts vs. Myths and Misconceptions Presented by Kenneth (“Ken”)

19 © 2011 Uptime Institute19

Tier I & II Sites Require IT Downtime Tier I and II sites require total shutdown of all IT

hardware during facility maintenance, i.e., all IT hardware dependent upon UPS power Critical power distribution Cooling backbones

Some capacity units (Tier II) MAY be maintained without impacting IT (but often with considerable risk of unplanned downtime)

Page 20: Uptime InstituteTier Standard for Data Centers Facts vs. … · Uptime Institute Tier Standard for Data Centers Facts vs. Myths and Misconceptions Presented by Kenneth (“Ken”)

20 © 2011 Uptime Institute

UtilityBatteryGenerator

THREE POWER PATHS

ONE POWER PATH

COMPUTER HARDWARE

SINGLE POINTS–OF–FAILURE• UPS system level failure• Major circuit breakers (2-20)• Minor circuit breakers (20-500)• Plugs and receptacles (21-505)• Electrical connections (258-6180)• Human error• False EPO

Single Power Path

Page 21: Uptime InstituteTier Standard for Data Centers Facts vs. … · Uptime Institute Tier Standard for Data Centers Facts vs. Myths and Misconceptions Presented by Kenneth (“Ken”)

21 © 2011 Uptime Institute

Utility UtilityBatteryGenerator

THREE POWER PATHS

TWO POWER PATHS

COMPUTER HARDWARE

Generator Battery

23

1

SINGLE POINTS–OF–FAILURE• Go to UPS failure• Human error

•False EPO•A/B circuiting errors

Dual Power Path

45

Page 22: Uptime InstituteTier Standard for Data Centers Facts vs. … · Uptime Institute Tier Standard for Data Centers Facts vs. Myths and Misconceptions Presented by Kenneth (“Ken”)

22 © 2011 Uptime Institute

Planned or Unplanned Site Downtime Can Trigger a Corporate Heart Attack!

Customers/UsersNetwork Nodes

ApplicationsOperating System

HardwareSite

10,000s100s100s10s to 100s10s to 100s1

Page 23: Uptime InstituteTier Standard for Data Centers Facts vs. … · Uptime Institute Tier Standard for Data Centers Facts vs. Myths and Misconceptions Presented by Kenneth (“Ken”)

23 © 2011 Uptime Institute

Fault Tolerance Is Justified By Eliminating Unplanned User Downtime

Site Restoration = ¼ to 1hr

Hardware Restore = ½ to 1hr

Software Restore = ¾ to 2hr

Telecom Restore = 0 to ¾hr

TOTAL OUTAGE = 1½ to 4¾hr

Page 24: Uptime InstituteTier Standard for Data Centers Facts vs. … · Uptime Institute Tier Standard for Data Centers Facts vs. Myths and Misconceptions Presented by Kenneth (“Ken”)

24 © 2011 Uptime Institute24

Concurrent Maintainability Is Justified by Eliminating Planned User Downtime Tier I sites require 2 total shut downs per year 2 x 14 hours = 28 hours of IT downtime per year Plus the impact of any unplanned failures

Tier II sites require 3 total shut downs every two years 3 x 14 hours/2 = 21 hours of IT downtime per year Plus the impact of any unplanned failures

Tier III and Tier IV sites do NOT require planned shut downs**Be aware that many “self proclaimed” Tier III Concurrently Maintainable) and Tier IV (Fault Tolerant) sites are not!

Page 25: Uptime InstituteTier Standard for Data Centers Facts vs. … · Uptime Institute Tier Standard for Data Centers Facts vs. Myths and Misconceptions Presented by Kenneth (“Ken”)

25 © 2011 Uptime Institute25

Tier Standard: Topology Requirements Are Deceptively Simple Active capacity components Distribution paths Compartmentalization Concurrent Maintainability Fault Tolerance

Page 26: Uptime InstituteTier Standard for Data Centers Facts vs. … · Uptime Institute Tier Standard for Data Centers Facts vs. Myths and Misconceptions Presented by Kenneth (“Ken”)

26 © 2011 Uptime Institute

Topology Requirements by Tier Level Tier I Tier II Tier III Tier IV

Active Capacity Components to Support IT Load

N N+1 N+1 N after any

failureDistribution Paths

1 1 1 active, 1 inactive

2 active

Compartmentalization No No No YesConcurrent Maintainability No No Yes YesFault Tolerance (1 event) No No No YesContinuous Cooling,Continuous Duty EnginesAmbient Temperature

LoadYes

YesYes

Worst

Page 27: Uptime InstituteTier Standard for Data Centers Facts vs. … · Uptime Institute Tier Standard for Data Centers Facts vs. Myths and Misconceptions Presented by Kenneth (“Ken”)

27 © 2011 Uptime Institute

Accredited Tier Designer (ATD) Intense, three-day curriculum

with concluding test Exclusively for professional or

chartered engineers responsible for infrastructure center design

Focuses on practical applicationof Tier topology

Examples drawn from real-life designs submitted for Design Certification

Addresses, in technical detail, the misconceptions that lead to needless cost and complexity

Page 28: Uptime InstituteTier Standard for Data Centers Facts vs. … · Uptime Institute Tier Standard for Data Centers Facts vs. Myths and Misconceptions Presented by Kenneth (“Ken”)

28 © 2011 Uptime Institute

A Worldwide Growing Community of ATD ProfessionalsAustralia 2Belgium 1Brazil 5Canada 3China 2Costa Rica 7Czech Republic 1Egypt 1England 6France 1Greece 2India 1Hong Kong 1Kenya 1Luxembourg 1Mexico 3Malaysia 1

Netherlands 1Russia 15Saudi Arabia 2Slovakia 1Spain 3South Africa 1Taiwan 6Thailand 1Turkey 1United Emirates 3United States 30Venezuela 1

As of September 31, 2010

Page 29: Uptime InstituteTier Standard for Data Centers Facts vs. … · Uptime Institute Tier Standard for Data Centers Facts vs. Myths and Misconceptions Presented by Kenneth (“Ken”)

29 © 2011 Uptime Institute

Site Certification Process:Tier Certified Design Documents Uptime, in-office, Principal-

level review of 30%, 60%, 85%, 100% design document package

Topology deficiencies and potential design enhancements

Conference call with owner and design team to discuss deficiencies report

Compliance review of revised drawings

Award letter and foil

Page 30: Uptime InstituteTier Standard for Data Centers Facts vs. … · Uptime Institute Tier Standard for Data Centers Facts vs. Myths and Misconceptions Presented by Kenneth (“Ken”)

30 © 2011 Uptime Institute

Site Certification Process: Tier Certified Constructed Facility On-site inspection by Uptime

team during commissioning Identify discrepancies between

certified design drawings and as-built construction

Observe delivered functionality Topology deficiencies and

other enhancements Conference call with owner

team Award letter, foil, and plaque

Page 31: Uptime InstituteTier Standard for Data Centers Facts vs. … · Uptime Institute Tier Standard for Data Centers Facts vs. Myths and Misconceptions Presented by Kenneth (“Ken”)

31 © 2011 Uptime Institute

Tier Standard:Operational Sustainability (New – 2010) Operational Sustainability is defined as the

behaviors and risks beyond design Topology that impact the ability of a data center to meet business objectives over the long term.

Page 32: Uptime InstituteTier Standard for Data Centers Facts vs. … · Uptime Institute Tier Standard for Data Centers Facts vs. Myths and Misconceptions Presented by Kenneth (“Ken”)

32 © 2011 Uptime Institute

Abnormal Incident Reports (AIRs) By: Root Cause

2009 Failures

(Previous Years)

2009 Incidents

(Previous Years)

Manufacturer 5 19% (25%) 211 52% (43%)

Design 1 4% (3%) 20 5% (5%)

External Environment - 0% (14%) 10 2% (4%)

Installation 4 15% (7%) 25 6% (7%)

Operations 16 62% (51%) 144 35% (41%)

Total Root Cause 26 100% (100%) 410 100% (100%)

Page 33: Uptime InstituteTier Standard for Data Centers Facts vs. … · Uptime Institute Tier Standard for Data Centers Facts vs. Myths and Misconceptions Presented by Kenneth (“Ken”)

33 © 2011 Uptime Institute

“Gold” Operational Sustainability Behaviors Prevent 70% of AIRs Failures Failures are never the result of one factor Between 5 and 10 things (typically 7)

contribute to a failure Management is the leading cause (4 out of 7)

A failure will consistently occur every time the 7 things line up

Swiss cheese or dominos analogy

Page 34: Uptime InstituteTier Standard for Data Centers Facts vs. … · Uptime Institute Tier Standard for Data Centers Facts vs. Myths and Misconceptions Presented by Kenneth (“Ken”)

34 © 2011 Uptime Institute

Operational Sustainability: Categories Management & operations (immediate ability to improve)

Staffing and organization Maintenance Training Planning, coordination, and management

Building characteristics (limited ability to improve) Features Infrastructure Operating conditions Pre-operational

Site location (limited ability to eliminate or even mitigate) Natural disasters Man-made disasters Utility robustness and redundancy

Page 35: Uptime InstituteTier Standard for Data Centers Facts vs. … · Uptime Institute Tier Standard for Data Centers Facts vs. Myths and Misconceptions Presented by Kenneth (“Ken”)

35 © 2011 Uptime Institute

Accredited Tier Specialist (ATS) (New 2010) Intense three-day curriculum with concluding

test For data center professionals (i.e., for non-

engineers) Includes both Topology and Sustainability

concepts ATS accredited individuals now in 12

countriesAs of March 1, 2011

Page 36: Uptime InstituteTier Standard for Data Centers Facts vs. … · Uptime Institute Tier Standard for Data Centers Facts vs. Myths and Misconceptions Presented by Kenneth (“Ken”)

36 © 2011 Uptime Institute

Conclusions Tier Standards for Data Centers Topology + Operational Sustainability For owner/operators and buyers of hosting services Together, assure life-cycle business value Downtime frequency, duration, and recoverability CapEx and OpEx effectiveness Life-cycle (resale) value

Operational Sustainability practices often have greater impact on actual IT availability than Topology

Page 37: Uptime InstituteTier Standard for Data Centers Facts vs. … · Uptime Institute Tier Standard for Data Centers Facts vs. Myths and Misconceptions Presented by Kenneth (“Ken”)

37 © 2011 Uptime Institute

More Information Tier Standard: Topology at

http://uptimeinstitute.org/content/view/302/281/#Tier-Classification

Tier Standard: Operational Sustainability athttp://uptimeinstitute.org/content/view/302/281/#Operational-Sustainability

Current world-wide listing of ATD engineers at http://atd.uptimeinstitute.com/award_list.htm

Certified sites or certifications in process at http://professionalservices.uptimeinstitute.com/tiercert.htm

Upcoming ATD and ATS seminars at http://atd.uptimeinstitute.com/schedule.htm

Tier Myths and Misconceptions at http://professionalservices.uptimeinstitute.com/myths.htm

Page 38: Uptime InstituteTier Standard for Data Centers Facts vs. … · Uptime Institute Tier Standard for Data Centers Facts vs. Myths and Misconceptions Presented by Kenneth (“Ken”)

38 © 2011 Uptime Institute

Questions