emc cloud-enabled infrastructure for · pdf filewhite paper emc solutions abstract this white...

83
White Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises Cloud-enabled Infrastructure for SAP. It explains the transformation of a single datacenter into a mission-critical business continuity solution with active/active datacenters. The solution is enabled by EMC ® VPLEX ® Metro, EMC next-generation VNX ® , VMware vCloud Suite, and VMware vSphere Metro Storage Cluster with vSphere High Availability and vSphere Distributed Resource Scheduler. December 2013 EMC CLOUD-ENABLED INFRASTRUCTURE FOR SAP—BUSINESS CONTINUITY SERIES: HIGH AVAILABILITY AND APPLICATION MOBILITY BUNDLE—VNX EMC VPLEX, EMC Next-Generation VNX, VMware vCloud Suite, and VMware vSphere Metro Storage Cluster with vSphere HA and DRS Resilient mission-critical SAP deployment in a private cloud Application mobility across datacenters Active/active datacenters

Upload: tranque

Post on 18-Mar-2018

223 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

White Paper

EMC Solutions

Abstract

This white paper focuses on high availability and application mobility add-on bundle of the on-premises Cloud-enabled Infrastructure for SAP. It explains the transformation of a single datacenter into a mission-critical business continuity solution with active/active datacenters. The solution is enabled by EMC® VPLEX® Metro, EMC next-generation VNX®, VMware vCloud Suite, and VMware vSphere Metro Storage Cluster with vSphere High Availability and vSphere Distributed Resource Scheduler.

December 2013

EMC CLOUD-ENABLED INFRASTRUCTURE FOR SAP—BUSINESS CONTINUITY SERIES: HIGH AVAILABILITY AND APPLICATION MOBILITY BUNDLE—VNX EMC VPLEX, EMC Next-Generation VNX, VMware vCloud Suite, and VMware vSphere Metro Storage Cluster with vSphere HA and DRS

Resilient mission-critical SAP deployment in a private cloud Application mobility across datacenters Active/active datacenters

Page 2: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

2

Copyright © 2013 EMC Corporation. All Rights Reserved.

EMC believes the information in this publication is accurate as of its publication date. The information is subject to change without notice.

The information in this publication is provided as is. EMC Corporation makes no representations or warranties of any kind with respect to the information in this publication, and specifically disclaims implied warranties of merchantability or fitness for a particular purpose.

All test results contained in this report were obtained in a rigorously controlled environment. Results obtained in other operating environments may vary significantly. EMC Corporation does not warrant or represent that a customer can or will achieve similar results.

Use, copying, and distribution of any EMC software described in this publication requires an applicable software license.

For the most up-to-date listing of EMC product names, see EMC Corporation Trademarks on EMC.com.

All trademarks used herein are the property of their respective owners.

Part Number H12587

Page 3: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

3 EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

Table of contents

Executive summary ............................................................................................................................. 5

Business case ................................................................................................................................... 5

Solution overview ............................................................................................................................. 5

Key results ........................................................................................................................................ 6

Introduction ....................................................................................................................................... 7

Purpose ............................................................................................................................................ 7

Objectives ......................................................................................................................................... 8

Audience .......................................................................................................................................... 8

Terminology ...................................................................................................................................... 9

Technology overview ........................................................................................................................ 10

Introduction ................................................................................................................................... 10

Solution architecture ..................................................................................................................... 13

Physical architecture ...................................................................................................................... 15

Logical architecture ........................................................................................................................ 17

Protection layers ............................................................................................................................ 18

Hardware resources ....................................................................................................................... 19

Software resources ........................................................................................................................ 19

Key components ............................................................................................................................... 20

Overview ........................................................................................................................................ 20

EMC VPLEX ..................................................................................................................................... 20

EMC next-generation VNX ............................................................................................................... 21

EMC PowerPath/VE ........................................................................................................................ 21

EMC VSI ......................................................................................................................................... 21

VMware vCloud Suite ..................................................................................................................... 21

Symantec ApplicationHA ................................................................................................................ 22

EMC VPLEX Metro infrastructure ....................................................................................................... 23

Introduction ................................................................................................................................... 23

VPLEX Metro solution configuration ............................................................................................... 26

VPLEX Witness configuration .......................................................................................................... 29

VMware virtual datacenter ................................................................................................................ 30

Introduction ................................................................................................................................... 30

Configuring VMware vCloud Director .............................................................................................. 32

Configuring vCenter Chargeback Manager ...................................................................................... 36

VMware deployments on VPLEX Metro ........................................................................................... 36

VMware stretched cluster configuration ......................................................................................... 39

Page 4: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

4

VMware vSphere HA configuration ................................................................................................. 41

Configuring VMware vSphere DRS .................................................................................................. 46

EMC VSI and VPLEX ........................................................................................................................ 49

Symantec ApplicationHA ................................................................................................................ 50

SAP system architecture ................................................................................................................... 54

Introduction ................................................................................................................................... 54

Configuring SAP system ................................................................................................................. 54

Outage impact ............................................................................................................................... 55

Key design considerations ............................................................................................................. 56

EMC storage infrastructure ............................................................................................................... 58

Introduction ................................................................................................................................... 58

EMC VNX HA ................................................................................................................................... 58

Configuring VNX ............................................................................................................................. 58

Workload generation ........................................................................................................................ 60

SAP standard SD Benchmark ......................................................................................................... 60

Testing and validation ...................................................................................................................... 61

Introduction ................................................................................................................................... 61

Planned downtime ......................................................................................................................... 61

Unplanned downtime ..................................................................................................................... 64

Conclusion ....................................................................................................................................... 79

Summary ....................................................................................................................................... 79

Findings ......................................................................................................................................... 80

References ....................................................................................................................................... 81

EMC ............................................................................................................................................... 81

VMware .......................................................................................................................................... 81

SAP ................................................................................................................................................ 82

Symantec ....................................................................................................................................... 83

Page 5: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

5 EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

Executive summary

Application and infrastructure availability is a key consideration in today’s modern business continuity strategies for SAP cloud infrastructures stretched across distances.

In traditional infrastructure designs for SAP high availability, workloads across physical datacenters are disrupted when any of the infrastructure components fails at the same time. These traditional high availability designs are limited and usually require special combinations of software and hardware that are operating system-specific, database-specific, or both, making them complex, inflexible, costly, and difficult to implement and maintain. Often these designs require more effort to keep them running than actually providing the required protection and availability promised in the first place.

This white paper introduces an EMC solution for mission-critical SAP high availability with the following objectives:

High availability, which manages risks in an IT environment and plans for recovery across distances resulting from unplanned outages.

Application mobility, which enables non-disruptive movement of running SAP workloads seamlessly across servers or datacenters.

The main business challenges addressed by this solution include:

Protect against single points of failure (SPOFs)

Minimize the impact caused by planned and unplanned downtimes

Reduce the infrastructure complexity and operational costs

Provide an end-to-end automated resilience model

Improve the utilization of resources across datacenters

This solution is an add-on to an existing EMC Cloud-enabled Infrastructure for SAP foundation bundle, and it adds the vSphere Metro Storage Cluster (vMSC) configuration commonly referred to as a stretched cluster.

VMware vMSC is a VMware certified solution, which is a combination of storage array based clustering and synchronous replication provided by EMC® VPLEX® Metro.

The main goal of this VMware vMSC solution is to extend the high availability capability provided by the local clusters within a local datacenter to a geographically separated model with two datacenters in different locations.

The architecture extends what is defined as “local” in terms of network and storage. This enables the network and storage subsystems to span a metropolitan area, presenting a single and common base infrastructure set of resources to the VMware vSphere cluster at both datacenters.

This solution provides a new level of availability with an end-to-end, out-of-the-box, integrated, simplified, and cost-effective solution that is also application-aware.

Business case

Solution overview

Page 6: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

6

The white paper demonstrates how the following technologies create this innovative solution:

EMC VPLEX Metro—Provides the virtual storage layer that enables an active/active Metro datacenter and supports continuous availability, even in the event of disruption at one of the datacenters.

VMware vCloud Director—Applies the principles of pooling, abstraction, and automation to all datacenter services such as compute, storage, networking, and security. It enables the multi-tenancy business model for IT operation and provides the portal for self-service provisioning.

EMC next-generation VNX® flash-optimized unified storage platform—Delivers innovation and enterprise capabilities for file, block, and object storage in a single, scalable, and easy-to-use solution.

VMware vSphere—Transforms datacenters into simplified cloud computing infrastructures and enables IT organizations to deliver flexible and reliable IT services. Provides high availability independent of hardware, operating system, and applications. Reduces planned downtime for common maintenance operations and prevents extended downtime with automatic, rapid recovery in case of failures.

VMware vSphere Distributed Resource Scheduler (DRS)—Aligns compute resources with business priorities and affinity rules by automatically balancing virtual machines across hosts.

Symantec ApplicationHA—Simplifies and centralizes application administration and management through integration with VMware vSphere and guest applications.

The solution builds the HA and application mobility add-on bundle by providing the following functions:

Mission-critical application protection with the automatic restart of virtual machines and SAP application-specific processes in the event of a datacenter, server, or process outages

Simple and automated HA protection for complete SAP cloud datacenters provided by vSphere HA

Additional benefits include the following:

Increased use of hardware and software assets:

Improved resource utilization across datacenters

Automatic load balancing between datacenters

Zero downtime on planned infrastructure maintenance

Reduced resources required with single clustered systems that eliminate the need for additional standby systems.

Reduced management, maintenance costs and risks with full end-to-end application restart automation, minimizing human intervention.

Key results

Page 7: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

7 EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

Introduction

This white paper describes the HA and application mobility add-on bundle for the EMC Cloud-enabled Infrastructure for the SAP foundation bundle.

This solution is intended to provide enhanced application availability and application mobility for workloads across physical datacenter boundaries with EMC VPLEX, VMware vSphere HA, vSphere DRS, and Symantec ApplicationHA in a vMSC configuration.

Business continuity for SAP

Because SAP is a major part of the core business operations in many organizations, it is critical that these organizations have a business continuity management plan to:

Safeguard the continuity of their business operations

Protect revenue

Recover to at least a minimum level of operation if an outage occurs

A business continuity management plan:

Helps to manage risks in an IT environment

Details contingency strategies of business processes following a disruption of operations

Business continuity for IT comprises three components:

High availability (HA)

Data protection (DP)

Disaster recovery (DR)

One way to provide high availability is to minimize the effect of an unplanned outage by masking the outage from the end users. This requires an availability automation solution to manage application failover within the same server, or between servers, within or across datacenters to ensure that high availability measures are achieved.

The IT Infrastructure Library (ITIL) is a public framework of best practices for IT service management. For more information, refer to the ITIL publication ITSC – IT Service Continuity Management. The SAP Solution Management Business Continuity Best Practices document refers to ITIL, which was considered in this white paper.

Types of failures

Three main types of failures should be considered in IT business continuity management:

Technical failures—Range from crashes of individual hardware components to entire datacenters. They disrupt normal operations and require different solutions based on their severity to resume the business operations.

Logical failures—Can be caused by faulty or malicious software, or the incorrect use of software such as corrupt data that can disrupt business processes.

Logistical failures—Are operational or logistical business operations such as unavailable staff or facilities.

Purpose

Page 8: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

8

This paper focuses on the high availability component to address technical failures and increase the availability of a cloud-enabled SAP landscape.

Recovery

The following are two major types of recovery after an outage:

System recovery—Technical availability of failed systems must be reestablished. Technical availability through high availability is the main focus of this paper. Disaster recovery is another form of system recovery after a disaster event. The EMC Cloud-enabled Infrastructure for SAP disaster recovery add-on bundle addresses this kind of recovery type.

Business recovery—Logical or data inconsistencies must be corrected. (Logical errors may sometimes be the result of technical failures). The EMC Cloud-enabled Infrastructure for SAP data protection add-on bundle addresses the technical aspect of this recovery type.

Simplifying SAP high availability

This white paper describes a solution that addresses technical failures and provides high availability protection in a simplified and cost-effective manner compared to traditional HA solutions in the market. This solution was implemented in an EMC lab environment to validate the protection of an SAP system or landscape after minor and major technical failures.

The objectives of the white paper are to:

Introduce the key enabling technologies

Describe the solution architecture and design

Describe how the key components are configured

Present the results of the tests and validation performed

Identify the key business benefits of the solution

This white paper is intended for SAP Basis Administrators, storage administrators, IT architects, and technical managers responsible for designing, creating, and managing mission-critical SAP applications in 24/7 landscapes. Previous technical knowledge of cloud, virtualization, server, networking, and storage solutions is required to fully comprehend the concepts and benefits described in this paper.

Objectives

Audience

Page 9: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

9 EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

Table 1 defines the terms and abbreviations used in this white paper.

Table 1. Terminology

Term Description

AAS SAP Additional Application Server (previously known as Dialog instance)

ABAP SAP Advanced Business Application Programming, the programming language used by SAP, describes the type of SAP architecture used in this solution.

ASCS ABAP SAP Central Services (runs the central enqueue and message server services)

dvSwitch vSphere distributed switch

EHP SAP Enhancement Package

ERP Enterprise resource planning, powered by the SAP NetWeaver technology platform, is a fully-integrated enterprise resource planning (ERP) application that fulfills the core business needs of midsize companies and large enterprises across all industries and market sectors.

SAP ERP DB SAP Database Server that runs the database instance

Terminology

Page 10: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

10

Technology overview

This section introduces the concept of the Cloud-enabled Infrastructure as a whole and how its key components are integrated. This section also describes the hardware and software components part of this solution.

The Cloud-enabled Infrastructure for SAP solution is a result of the preferred three-way partnership between EMC, SAP, and VMware. The infrastructure is divided into several different functionalities, which are delivered as separate add-on bundles, as illustrated in Figure 1. The EMC Cloud-enabled Infrastructure for SAP solution is designed to offer flexibility so that customers can choose the required cloud functionalities (hereon referred to as add-on bundles), which they can enable without losing sight of their end goal—an efficient and reliable private cloud.

Figure 1. Cloud-enabled Infrastructure for SAP

A journey to the private cloud starts with the foundation bundle, which is described in EMC Cloud-Enabled Infrastructure for SAP Foundation Bundle—VNX White Paper.

Introduction

Page 11: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

11 EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

The EMC Cloud-enabled Infrastructure for SAP foundation bundle is a mandatory component and it is the base for all add-on bundles. Table 2 outlines the functions and benefits of the EMC Cloud-enabled Infrastructure foundation bundle.

Table 2. On-premises EMC Cloud-enabled Infrastructure for SAP solution: Foundation bundle

Function Benefits Technology

Virtual datacenter for SAP

Autonomy of business units and application operations

Service catalogs

Service level agreements (SLAs)

Management of vCloud tenants

Resource pooling

VMware vCloud Suite Enterprise

Infrastructure chargeback

Cost measurement, analysis, and reporting of the use of compute, network, storage, and backup resources (in combination with data protection add-on bundle)

VMware vCenter Chargeback

Integrated cloud management and performance analysis

Manage availability, capacity, performance, and health in the SAP landscape

VMware vCenter Operations Manager

EMC Storage Resource Management

EMC Storage Analytics (ESA)

EMC Virtual Storage Integrator (VSI)

Storage tiering Automatically get the right data to the right place at the right time

EMC next-generation VNX

EMC FAST™ Suite

Cloud networking and security for SAP

Cloud-enabled Infrastructure security framework

Authorization concepts

Compliance and non-compliance tracking

VMware vCloud networking and security

For more information about the foundation bundle, refer to the EMC Cloud-Enabled Infrastructure for SAP Foundation Bundle—VNX White Paper.

Page 12: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

12

Table 3 lists the five add-on bundles that enable more cloud functionalities and data center enhancements.

Table 3. EMC Cloud-enabled Infrastructure for SAP solution: Add-on bundles

IT strategy Bundle Benefits Technology

Business continuity

High availability (HA) and application mobility for SAP

High availability within one datacenter and across two datacenters with application awareness for the complete SAP landscape

Non-disruptive movement of applications from one datacenter to another datacenter

Improved resource utilization across datacenters

Minimize downtime for infrastructure maintenance

EMC VPLEX Metro

vSphere HA

vSphere DRS

Symantec ApplicationHA

Disaster recovery (DR) for SAP

Provide disaster recovery protection for cloud management applications and SAP systems

VMware vCenter Site Recovery Manager

EMC RecoverPoint®

Data protection (DP) for SAP Provide data protection to cover the cloud management applications and SAP systems

Backup and recovery at all datacenters with remote replication of backup sets for offsite protection

EMC Avamar®

EMC Data Domain®

EMC Data Protection Advisor

Service support/ provisioning

Automation and operations for SAP

SAP application virtualization enables any service any time on any server

Provision SAP systems on demand with automated end-to-end process

Reduce downtime window during maintenance leveraging mass operations

SAP NetWeaver Landscape Virtualization Management

Enhanced security

Enhanced security and compliance for SAP

Efficient, collaborative enterprise governance, risk and compliance (eGRC) program across IT, finance, operations, and legal domains

Data loss prevention

Secure user to network authentication

RSA Archer eGRC

RSA Data Loss Prevention

RSA SecurID

Page 13: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

13 EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

The following sections describe the additional components required for the EMC Cloud-enabled Infrastructure for SAP to transition from the foundation bundle to the HA and application mobility add-on bundle, as shown in Figure 2.

Figure 2. Transition from foundation to HA/Mobility for SAP bundle

This add-on bundle combines EMC VPLEX Metro and VMware vSphere stretched cluster with the EMC Cloud-enabled Infrastructure for SAP foundation bundle, as well as providing a high level of availability. VPLEX Metro and vSphere HA/DRS in a stretched cluster provide high availability through the effective use of cluster resources at both datacenters, as shown in Figure 3.

Figure 3. High availability infrastructure layers

Solution architecture

Page 14: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

14

Storage high availability

In the EMC Cloud-enabled Infrastructure for SAP foundation bundle, all vSphere virtual machines in both the management and resource clusters are stored in VMware datastores based on LUNs in the local EMC VNX.

This bundle uses EMC VPLEX Metro federated solution across the two datacenters to provide the distributed storage to the all the vSphere ESXi hosts across datacenters, using the EMC VPLEX Metro distributed virtual volumes.

An EMC VPLEX Metro Distributed Virtual Volume is seen as only one volume (datastore). It is replicated synchronously (read/write) on the EMC VNX arrays on both datacenters, independent from where updates are being made to them.

EMC VPLEX Metro provides active copies of data at both datacenters and across a stretched Layer 2 IP network. VMware vMotion provides uninterrupted mobility of application workloads across the datacenters enabled by EMC VPLEX.

This bundle also uses EMC VPLEX Witness to monitor connectivity between the two EMC VPLEX clusters on each datacenter and ensure continued availability in the event of an inter-cluster link failure or a datacenter failure. EMC VPLEX Witness is deployed on a virtual machine at a third, separate failure domain (Datacenter C).

SAN resiliency with EMC PowerPath® /VE

EMC PowerPath/VE provides the capability to:

Automate and optimize data paths in virtual environments to ensure business process availability and performance.

Protect virtual environments from physical hardware failures to ensure uninterrupted service and the automatic failover and recovery.

Simplify load balancing to help eliminate I/O bottlenecks.

The load balancing algorithms of EMC PowerPath/VE automatically adjust the I/O path use from virtual machines to local I/O loads. With EMC VPLEX Cross-Cluster Connect, PowerPath/VE leaves the cross cluster volumes in the standby mode, if it detects a preferred path to the local VPLEX storage volumes.

Network high availability

In each datacenter, a redundant 10 GbE fabric provides the core IP Network for datacenter traffic and uplinks for user access. The same IP subnets are available at both datacenters. The Layer 2 routers extend the Layer 2 broadcast domains across the two datacenters.

Page 15: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

15 EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

Figure 4 shows the physical architecture of all layers of the solution, including the IP and SAN network components.

Figure 4. Physical architecture

The physical architecture depicted in Figure 4 provides redundant components and connections in all infrastructure layers. Brocade gear was used for the SAN and IP networks in the validated lab environment, but Cisco gear with the same specifications can be used instead, or a combination of both can be deployed.

IP network layer

The IP network in each datacenter has one Brocade MLXe router and two Brocade VDX 6720 switches in a Virtual Cluster Switch VCS configuration. All vSphere ESXi hosts are connected to the network with redundant 10 GbE connections provided by 10 Gb converged network adapter (CNA) cards. The two switches at each datacenter are connected to a router with a Virtual Link Aggregation Group (vLAG). The routers extend the Layer 2 network between two datacenters.

Physical architecture

Page 16: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

16

Note: A vLAG is a fabric service that enables a Link Aggregation Group (LAG) to originate from switches. As the standard LAG, a vLAG uses the Link Aggregation Control Protocol (LACP) to control the bundling of several physical ports together to form a single logical channel.

All network traffic between Datacenter A and Datacenter B are routed by using multiple ports configured as a LAG. Figure 5 shows the IP network layer.

Figure 5. IP network layer

SAN layer

The SAN in each datacenter was built with Brocade DCX 8510 Backbone switches, as shown in Figure 6. All vSphere ESXi hosts are connected to the SAN by using redundant 8 GbE connections provided by the pair of host bus adapters (HBAs) in each server. The multiple Fibre Channel (FC) connections between the Brocade DCX 8510 Backbone switches are not only used for mirroring EMC VPLEX Metro storage across datacenters, but also provide a HA Cross-Cluster Connect between datacenters.

Figure 6. SAN Layer

Page 17: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

17 EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

Figure 7 shows the logical architecture of all layers of the solution, including the network components.

Figure 7. Logical architecture

The logical architecture depicted in Figure 7 demonstrates that the management and resource clusters are stretched to span both datacenters.

Logical architecture

Page 18: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

18

Table 4 summarizes the high availability layers provided before and after the transition between the foundation and HA bundles. DC represents datacenter in the table.

Table 4. High availability in local and remote datacenters

ID Layer Protected components DC DC Protection

01 SAP Application SAP work processes (DIA/UPD/UPD2/SPO) A A & B Multiple SAP Appl. Servers

02 SAP Application & Database

SAP Database, SAP AAS1, SAP AAS2, SAP ASCS, and SAP shared file systems

A A & B Symantec ApplicationHA

03 Operating System Operating system services and file systems A A & B VMware vSphere HA and Symantec ApplicationHA

04 Virtual machine Management and SAP virtual machines A A & B VMware vSphere HA / DRS

05 Host Server A A, B, & C VMware vSphere ESXi

06 Fabric SAN Fabric paths management A A & B EMC PowerPath/VE

07 LAN IP Network uplinks A A & B Redundant 10GbE IP networks

08 SAN Storage area network A A & B Redundant 8Gb FC networks

09 SAN Storage area network paths A A & B EMC Cross-Cluster Connect

10a SAN Storage array A EMC VPLEX Local

10b SAN Storage array A & B EMC VPLEX Metro

11 Storage Local storage resources (drives, FAs, DAs, others) A B EMC VNX

12 Storage Local storage resources (drives, FAs, DAs, others) A A EMC VNX

Local high availability can also be provided for a single datacenter by using the components numbered 01 to 08 and 12 described in Table 4. For more information, refer to the VMware and EMC documents listed in the References section.

EMC VPLEX Local provides a higher level of local high availability with a clustering architecture that provides virtual volumes replicated between two local storage arrays. This allows the servers in the local datacenter to have read/write access to both shared block storage devices. This can be accomplished by using the components described in Table 4 except for component numbered 10a. For more information, refer to the EMC VPLEX documents listed in the References section.

EMC VPLEX Metro extends the concept of local high availability to an additional metro distance datacenter. This storage transformation takes high availability to a new level of mission-critical business continuity. The components required for this configuration are all the components described in the Protection column in Table 4, except the component with the number 10b.

All scenarios described above require the installation, configuration, and the implementation of the best practices for each component as described in the corresponding sections later in this document and in the documents listed in the References section.

Figure 7 shows the high-availability design with VPLEX Witness and Cross-Cluster Connect deployed in this bundle, providing the highest level of resilience.

Each of the components shown in Table 4 is explored in more detail in the relevant sections of this white paper.

Protection layers

Page 19: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

19 EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

Table 5 details the hardware resources for the solution.

Table 5. Solution hardware resources

Purpose Quantity Configuration

Storage (Datacenter A) 1 EMC VNX8000

Storage (Datacenter B) 1 EMC VNX8000

Distributed storage federation 2 EMC VPLEX Metro cluster, with2 x VS2 engines

ESXi hosts for management cluster 4 2 x four-core CPUs, 128 GB RAM

ESXi hosts for resource cluster 4 4 x eight-core CPU, 256GB RAM

ESXi hosts for VPLEX Witness 2 2 x two-core CPUs, 48 GB RAM

Network switching and routing platform

2

Brocade DCX 8510 Backbone, with: Fx8-24 FC extension card 2 x 48-port FC Blades with 16 Gb FC line speed support

Brocade MLXe Router 4 Brocade VDX 6720 in VCS mode

Table 6 details the software resources used in the solution.

Note: Other operating system and database combinations supported by the EMC Solutions Support Matrix (SSM) can also be used.

Table 6. Solution software resources

Software Version Purpose

EMC Block Operating Environment (OE) 05.33 Operating environment for VNX

EMC PowerPath/VE 5.8 Multipathing software providing continuous active paths

EMC Unisphere 1.3.0 VNX management software

EMC Unisphere for VPLEX 5.2 VPLEX management software

EMC VPLEX Witness 5.2 Handles VPLEX failures & inter-cluster communication loss

SAP ERP 6.0 EHP5 Unicode—standard system

SAP NetWeaver 7.02 Unicode x86_64

Microsoft SQL Server 2012 Used on the SAP systems

2008 R2 Used by VMware vCenter Chargeback and vCloud Director

VMware vSphere 5.1 Hypervisor hosting all virtual machines

VMware vCenter Server 5.1 Management and resource clusters

VMware vCloud Director 5.1 Orchestration Tool for Provisioning Virtual Infrastructure

VMware vCloud Networking and Security 5.1 Security software

Microsoft Windows Enterprise Server 2008 R2 Operating system for the SAP virtual machines

Symantec ApplicationHA 6.11 Application-aware clustering software for the guest OS of the virtual machines

1 For release availability information, see the Symantec website.

Hardware resources

Software resources

Page 20: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

20

Key components

This section introduces the following technology components:

EMC VPLEX

EMC next-generation VNX

EMC PowerPath/VE

EMC Virtual Storage Integrator (VSI)

VMware vCloud Suite

Symantec ApplicationHA

EMC VPLEX is a storage virtualization solution for both EMC and non-EMC storage arrays, which offers many advantages for data migrations between arrays. VPLEX performs non-disruptive copying of the underlying storage volumes between the old and new array, as shown in Figure 8. The VPLEX tunable copy process enables dynamic copying without affecting host performance. All hosts remain online during this process. VPLEX substantially reduces the service expenses and time-to-value related in moving to a new infrastructure.

Figure 8. Data migration between arrays with VPLEX

EMC VPLEX Metro is the primary technology enabled in the solution. EMC VPLEX Metro is a storage area network-based (SAN) federation solution that delivers both local and distributed storage. VPLEX Metro enables the same data to exist in two datacenters in separate geographical locations that can be accessed and updated at both datacenters at the same time. With EMC VPLEX Witness, applications continue to be available, with minimal interruption or downtime, if an EMC VPLEX outage was to occur at one of the datacenters.

EMC offers the following three VPLEX configurations for high availability and data mobility:

EMC VPLEX Local

EMC VPLEX Metro

EMC VPLEX Geo

For more information, refer to EMC VPLEX Metro infrastructure.

Overview

EMC VPLEX

Page 21: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

21 EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

The EMC VNX flash-optimized unified storage series delivers innovation and enterprise capabilities for file, block, and object storage in a single, scalable, and easy-to-use solution. Ideal for mixed workloads in physical or virtual environments, VNX combines powerful and flexible hardware with advanced efficiency, management, and protection software to meet the demanding needs of virtualized application environments.

VNX includes many features and enhancements designed and built upon the first success of previous generations. The following features and enhancements support the performance and scalability demands of virtual environments:

More capacity with optimized multicore optimization (MCx) with Multicore Cache, Multicore RAID, and Multicore FAST Cache

Greater efficiency with a flash-optimized hybrid storage system

PowerPath/VE is a path management solution for VMware. It provides the highest level of dynamic load balancing, path failover, path restoration, path testing, and automated performance optimization.

EMC VSI provides multiple feature sets including Storage Viewer (SV), Path Management, and Unified Storage Management. It facilitates the discovery and identification of all EMC storage devices allocated to vSphere ESXi hosts and virtual machines. Unified Storage Management simplifies the provisioning of VNX pool-based storage for datacenters, vSphere ESXi host, and resource pools. Path Management enables you to control how users access datastores.

VMware vCloud Suite provides all components for building and running a private cloud infrastructure, based on VMware vSphere that leverages the software-defined datacenter architecture. This architectural approach delivers virtualized infrastructure services (compute, network, security, and availability) with built-in intelligence to automate the on-demand provisioning, placement, configuration and control of applications based on defined policies.

This solution includes the following vCloud Suite components:

VMware vSphere—Compute virtualization platform with policy-based automation

VMware vCenter Site Recovery Manager2—Automated disaster recovery planning, testing, and execution

VMware vCloud networking and security3—Networking and security with ecosystem integration for a virtualized compute environment.

VMware vCenter Operations Management Suite4—Integrated, proactive performance, capacity, and configuration management for dynamic cloud environments.

2 Used in the disaster recovery bundle solution. 3 Used in the foundation bundle solution. 4 Used in the foundation bundle solution.

EMC next-generation VNX

EMC PowerPath/VE

EMC VSI

VMware vCloud Suite

Page 22: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

22

VMware vCloud Director—Virtualized datacenters with multi-tenancy and public cloud extensibility

VMware vSphere

VMware vSphere virtualizes and aggregates the underlying physical hardware resources across multiple systems and provides pools of virtual resources to the datacenter. As a cloud operating system, VMware vSphere manages large collections of infrastructure (such as CPUs, storage, and networking) as a seamless and dynamic operating environment, managing the complexity of a datacenter. vSphere HA maximize uptime across your virtualized infrastructure, reducing unplanned downtime and eliminating planned downtime for server and storage maintenance.

VMware vCloud networking and security

VMware vCloud networking and security is the leading software-defined networking and security solution that enhances operational efficiency and unlocks agility, enables extensibility to rapidly respond to business needs, and provides a broad range of services in a single solution, including virtual firewall, virtual private network (VPN), load balancing, and VXLAN extended networks.

VMware vCloud Director

VMware vCloud Director orchestrates the provisioning of software-defined datacenter services as complete virtual datacenters that are ready for consumption in a matter of minutes. Virtual datacenters provide virtualized computing, networking, storage, and security. Software-defined datacenter services and the virtual datacenters fundamentally simplify infrastructure provisioning, and enable IT to move at the speed of business.

Symantec ApplicationHA is a software solution integrated on top of vSphere HA and provides application awareness to vSphere HA, so it can monitor and restart them automatically if a hardware or software failure occurs inside the VMware virtual machines.

Symantec ApplicationHA adds a layer of application awareness to the core HA functionality offered by VMware vSphere virtualization technology.

Symantec ApplicationHA is based on Veritas Cluster Server and uses similar concepts such as agents, resources, and service groups. Symantec ApplicationHA has a lightweight server footprint that allows faster installation and configuration.

Symantec ApplicationHA

Page 23: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

23 EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

EMC VPLEX Metro infrastructure

This section describes the VPLEX Metro infrastructure for the solution, which is composed of the following components:

EMC VPLEX Metro cluster at each datacenter (Datacenter A and B)

EMC VPLEX Witness in a separate failure domain (Datacenter C)

EMC VPLEX

EMC VPLEX is a storage virtualization solution for both EMC and non-EMC storage arrays. EMC offers VPLEX in different configurations to address customer needs for high availability and data mobility.

For detailed descriptions of these VPLEX configurations, refer to the documents listed in the References section.

EMC VPLEX Metro

VPLEX Metro uses a unique clustering architecture to allow servers at multiple datacenters, geographically dispersed, to have read/write access to shared block storage devices. VPLEX Metro delivers active/active, block-level access to data on two datacenters within synchronous distances with a round-trip time of up to 5 milliseconds. For this bundle, a round-trip time is not to exceed 1 millisecond with the Cross-Cluster Connect configuration.

EMC VPLEX Witness

VPLEX Witness is an external server that is installed as a virtual machine in a separate failure domain to the VPLEX clusters. VPLEX Witness connects to both VPLEX clusters using a VPN over the management IP network. It requires a round-trip time (RTT) that does not exceed 1 second.

By reconciling its own observations with information reported periodically by the clusters, VPLEX Witness enables the cluster(s) to distinguish between inter-cluster network partition failures and cluster failures and automatically resume I/O at the appropriate datacenter. VPLEX Witness failure handling semantics apply only to distributed volumes within a consistency group and only when the detach rules identify a static preferred cluster for the consistency group (see VPLEX consistency groups on page 25 for further details).

EMC VPLEX management interface

You can manage and administer a VPLEX environment with the web-based Unisphere for VPLEX or you can connect directly to a management server and start a VPLEX command line interface (VPLEXcli) session.

Introduction

Page 24: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

24

EMC VPLEX HA

VPLEX Metro enables application and data mobility. When configured with VPLEX Witness, it provides a high-availability infrastructure for clustered applications. VPLEX Metro enables you to build a stretched cluster like a local cluster, and removes the datacenter as an SPOF. Furthermore, as the data and applications are active at both datacenters (active/active), this configuration provides a simple high availability business continuity solution. VPLEX Witness is part of the VPLEX Metro and enables two optional architectures as follows:

VPLEX Metro HA with Cross-Cluster Connect—Hosts are connected to the local VPLEX cluster on which they reside, and also have an alternate path to the remote VPLEX cluster. You can achieve the highest degree of availability by using a VPLEX Cross-Cluster Connect configuration. In the unlikely event that an entire VPLEX cluster, storage array, or SAN fails, with Cross-Cluster Connect configuration, hosts have an alternate path to the VPLEX Metro distributed volumes through the surviving VPLEX cluster, eliminating a short downtime associated with restarting the virtual machine on the secondary datacenter. The inter-cluster network latency should not exceed 1 millisecond RTT between VPLEX clusters.

VPLEX Metro HA without Cross-Cluster Connect—Hosts are only connected to the local VPLEX cluster on which they reside. Because host clusters are connected to the VPLEX distributed volume providing the same block data between two datacenters, when components (hosts, storage, servers and so on) fail, it minimizes the recovery time by automatically restarting the virtual machine on the secondary datacenter. The latency exceeds 1 millisecond but should be less than 5 milliseconds RTT.

VPLEX logical storage structures

VPLEX encapsulates traditional physical storage array devices and applies layers of logical abstraction to these exported LUNs, as shown in Figure 9.

Figure 9. VPLEX logical storage structures

Page 25: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

25 EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

VPLEX storage volume

A storage volume is a LUN exported from an array and encapsulated by VPLEX. An extent is the mechanism VPLEX uses to divide storage volumes and may use all or part of the capacity of the underlying storage volume. A device encapsulates an extent or combines multiple extents or other devices into one large device with a specific RAID type.

At the top layer of the VPLEX storage structures are virtual volumes. These are created from a top-level device (a device or distributed device) and always use the full capacity of the top-level device. Virtual volumes are the elements that VPLEX exposes to hosts using its front-end ports. VPLEX presents a virtual volume to a host through a storage view.

VPLEX can encapsulate devices across heterogeneous storage arrays, including virtually provisioned thin devices and traditional LUNs.

VPLEX consistency groups

Consistency groups aggregate virtual volumes so that the same properties (detach rules and others) can be applied to all volumes in the group. There are two types of consistency groups:

Synchronous consistency groups—These are used in VPLEX Local and VPLEX Metro to apply the same detach rules and other properties to a group of volumes in a configuration. This simplifies configuration and administration in large systems.

With write-through caching in synchronous consistency groups, in the separated cluster environment, VPLEX Metro supports up to 5 milliseconds of latency. VPLEX Metro sends writes to the back-end storage volumes, and acknowledges a write to the application only when the back-end storage volumes in both clusters acknowledge the write.

Asynchronous consistency groups—These are used for distributed volumes in VPLEX Geo, where clusters can be separated by up to 50 milliseconds of latency.

Consistency groups are particularly important for databases and their applications. For example:

Write-order fidelity—Maintains data integrity, vSphere ESXi LUNs forming a datastore cluster should be placed together in a single consistency group.

Transactional dependency— Multiple databases often have transaction dependencies, such as when an application issues transactions to multiple databases and expects the databases to be consistent with each other. All LUNs that require I/O dependency to be preserved should reside in a single consistency group.

Application dependency—SAP stores database files within a set of datastores that must be accessible to maintain database availability. The datastore devices of database files should reside in a single consistency group.

Detach rules

Detach rules are predefined rules that determine I/O processing semantics for a consistency group when connectivity with a remote cluster is lost, for example in the case of a network partitioning or remote cluster failure.

Page 26: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

26

Synchronous consistency groups support the following detach rules to determine cluster behavior during a failure:

The static preference rule identifies a preferred cluster.

The no-automatic-winner rule suspends I/O on both clusters.

Setting a detach rule is always invoked when connectivity is lost between clusters. However, VPLEX Witness can be deployed to override the static preference rule and ensure that the non-preferred cluster remains active if the preferred cluster fails.

Storage structure

Figure 10 shows the physical and logical storage structure used by VPLEX Metro in this solution.

Figure 10. VPLEX physical and logical storage structures for this solution

A one-to-one mapping between storage volumes, extents, and devices exists at each datacenter. The devices inside both Datacenter A (cluster-1) and Datacenter B (cluster-2) are virtually provisioned thin devices.

All cluster-1 devices are mirrored remotely on cluster-2, in a distributed configuration, to create distributed devices. These distributed devices are encapsulated by virtual volumes, which are then presented to the hosts through storage views.

VPLEX Metro solution configuration

Page 27: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

27 EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

Configuration process

For this bundle, we used the configuration wizard provided by Unisphere for VPLEX to configure the VPLEX Metro logical storage structure.

Storage volumes—Figure 11 shows that several storage volumes were created on Datacenter A, as displayed in the VPLEX Management Console.

Figure 11. EMC VPLEX storage volumes (Datacenter A)

Extents—VPLEX divides storage volumes in extents. This bundle includes a one-to-one mapping between extents and storage volumes, as shown in Figure 12. Extents have the same size as the storage volumes from which they are created.

Figure 12. EMC VPLEX Extents

Devices—One-to-one mapping was configured between devices and extents. Figure 13 shows the option used to configure this mapping.

Page 28: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

28

Figure 13. EMC VPLEX device creation wizard: mapping extents

Distributed devices—The distributed devices were created by mirroring a device remotely in a distributed RAID 1 configuration, as shown in Figure 14.

Figure 14. EMC VPLEX device creation wizard: selecting mirrors

Virtual volumes—All top-level devices are distributed devices. These devices are encapsulated by virtual volumes, which EMC VPLEX presents to the hosts through storage views. The storage views define which hosts access which virtual volumes on which EMC VPLEX ports.

Consistency groups—In this bundle, the following two synchronous consistency groups were created, as shown in Figure 15.

Resource cluster consistency group contains the SAP virtual machines.

Management cluster consistency group contains the management virtual machines.

Figure 15. Consistency-group configuration in Unisphere for VPLEX

Page 29: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

29 EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

This bundle uses EMC VPLEX Witness to monitor connectivity between the two VPLEX clusters and ensure continued availability when an inter-cluster network partition fails or a datacenter fails. This is considered a VPLEX Metro HA configuration as storage availability is ensured at the surviving datacenter.

EMC VPLEX Witness was deployed at a third, separate failure domain (Datacenter C) and was connected to the EMC VPLEX clusters at Datacenter A and Datacenter B. Datacenter C is located at a distance of less than 1 second RTT from Datacenters A and B. When a VPLEX Witness has been installed and configured, the VPLEX Management Console displays the status of cluster witness components, as shown in Figure 16.

Figure 16. EMC VPLEX Witness components and status

For additional details, refer to the Implementation and Planning Best Practices for EMC VPLEX Technical Note and the Using VPLEX Metro with VMware High Availability and Fault Tolerance for Ultimate Availability White Paper listed in the References section.

VPLEX Witness configuration

Page 30: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

30

VMware virtual datacenter

VMware virtual datacenter is the key enabling technology introduced in the EMC Cloud-Enabled Infrastructure for SAP Foundation Bundle—VNX White Paper. This solution reuses the existing architecture in the foundation bundle and configures the HA capability as an add-on. The SAP systems are fully virtualized in vApps using VMware vSphere and vCloud Director. The same networking and security configurations in foundation bundle also apply to the HA and application mobility add-on bundle.

This section describes the following components, technologies and options:

VMware vCloud Director

VMware vSphere

VMware vCenter Server

VMware vCenter Chargeback Manager

VMware vSphere vMotion

VMware vSphere HA

VMware vSphere DRS

EMC PowerPath/VE for VMware vSphere

EMC VSI for VMware vSphere

VMware vCloud Director

VMware vCloud Director applies the principles of pooling, abstraction, and automation to all datacenter services like storage, networking, and security using virtual datacenters. A virtual datacenter is an elastic logical container that provides all infrastructure services necessary to make workloads operational in minutes. Applications provisioned into these containers are automatically placed in the most optimal VMware vCenter server cluster.

VMware vSphere

VMware vSphere is a virtualization platform that provides infrastructure services transforming IT hardware into a shared computing platform, and application services helping IT organizations deliver high levels of availability, security, and scalability. vSphere is the base component in the EMC Cloud-enabled Infrastructure for SAP on all add-on bundles. It provides an abstraction of the physical server layer that allows virtual machines to be independent of the brand, model and type of x86 server architecture where they are running.

VMware vCenter Server

VMware vCenter Server is the centralized management platform for vSphere environments, enabling control and visibility at every level of the virtual infrastructure. vCenter Servers manage vSphere HA/ DRS clusters that are created between the two datacenters using VMware vSphere 5 hosts, and are connected to the vSphere hosts at both datacenters.

Introduction

Page 31: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

31 EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

VMware vCenter Chargeback Manager

VMware vCenter Chargeback Manager improves utilization of your virtual infrastructure with accurate visibility into the true costs of virtualized workloads. It enables line-of-business owners to have full cost transparency and accountability for self-service resource requests. It allows IT organizations to customize rate cards and prices to the processes and policies of different organizations.

VMware vSphere vMotion

VMware vSphere vMotion is VMware technology that supports live migration of virtual machines across servers without any interruption in the availability of the virtual machine. This allows the live relocation of virtual machines to new datastores.

VMware vSphere HA

VMware vSphere HA is a vSphere component that provides high availability for any application running in a virtual machine, regardless of its operating system or underlying hardware configuration.

vSphere HA provides uniform failover protection against hardware and operating system outages within your virtualized IT environment. vSphere HA specifically reduces unplanned downtime by using multiple VMware vSphere ESXi hosts, configured as a cluster, to provide rapid recovery from outages and cost-effective high-availability for applications running on virtual machines.

VMware vSphere DRS

VMware vSphere DRS dynamically and automatically balances load distribution and virtual machine placement across multiple ESXi hosts using vSphere vMotion.

EMC PowerPath/VE for VMware vSphere

EMC PowerPath/VE for VMware vSphere delivers multipathing features that optimize VMware vSphere virtual environments. PowerPath/VE installs as a kernel module on the vSphere ESXi host and works as a multipathing plug-in (MPP) that provides enhanced path management and load-balancing capabilities for vSphere ESXi hosts.

EMC VSI for VMware vSphere

EMC VSI for VMware vSphere is a vCenter plug-in that provides a single management interface for managing EMC storage. VSI provides a unified and flexible user experience that allows each feature to be updated independently, and allows new features to be introduced rapidly in response changing customer requirements.

When PowerPath/VE is installed on a vSphere ESXi host, VSI presents important multipathing details for devices, such as the load-balancing policy, the number of active paths, and the number of dead paths.

Page 32: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

32

This section describes how the VMware vCloud Director settings were configured to integrate this HA bundle with the EMC Cloud-enabled Infrastructure for SAP foundation bundle.

Figure 17 illustrates the cloud architecture with the HA add-on bundle.

Figure 17. Cloud architecture with the HA bundle

To enable the HA capability, the following components must be configured:

vSphere DRS cluster

Storage

Provider vDC (PvDC)

Organization vDC (OvDC)

Configuring VMware vCloud Director

Page 33: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

33 EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

Configuring DRS cluster

DRS clusters were configured to fully use HA capability for both management and resource clusters:

Management cluster – vSphere ESXi hosts from Datacenter B were added to the original management DRS cluster, facilitating the management cluster with cross datacenter HA capability.

Resource cluster – A new DRS cluster named Premium was created. vSphere ESXi hosts from both datacenters were added to the Premium DRS cluster, facilitating the resource cluster with cross datacenter HA capability.

Configuring storage

New datastores were created based on the EMC VPLEX Metro Distributed virtual volumes for both resource and management clusters together with storage DRS clusters and storage profiles to enable HA on the storage level.

The virtual machines on management cluster were non-disruptively migrated from the local VNX located on Datacenter A to the EMC VPLEX Metro distributed virtual volumes.

New datastores were created for both management and resource clusters, and the storage settings configured in vCloud Director are listed in Table 7.

Table 7. Storage configuration in vCloud Director

Consistency group Storage DRS cluster Storage profile Datastores

Management clusters

HA MGMT N/A VPLEX_MGMT_001 (4 TB)

VPLEX_MGMT_002 (4 TB)

Resource clusters HA Gold Tier HA Gold VPLEX_DS1 (4 TB) VPLEX_DS2 (4 TB) VPLEX_DS3 (4 TB) VPLEX_DS4 (4 TB) VPLEX_DS5 (4 TB)

HA Silver Tier HA Silver VPLEX_DS1_Silver (4 TB)

Eight 4 TB datastores were created for the management and resource clusters. For the resource cluster, two storage DRS clusters were created to group datastores with different performance levels:

HA Gold Tier—A storage DRS cluster for placing datastores set to the highest performance.

HA Silver Tier—A storage DRS cluster for placing datastores set to high performance.

Five 4 TB datastores were assigned to the HA Gold Tier storage DRS cluster and one 4 TB datastore to the HA Silver Tier storage DRS cluster. Two new storage profiles, HA Gold and HA Silver were created to characterize the datastores with HA capabilities. Afterwards, the storage profiles were assigned to the corresponding datastores in the storage DRS clusters as shown in Table 7.

Page 34: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

34

All created distributed volumes were presented to the all four vSphere ESXi hosts on both stretched clusters. Figure 18 shows the configuration details for the datastore cluster HA Gold Tier created for the resource cluster.

Figure 18. HA Gold Tier datastore cluster

Configuring PvDC

Two PvDCs, Advanced and Standard, were defined in EMC Cloud-Enabled Infrastructure for SAP Foundation Bundle—VNX White Paper.

In this bundle a new PvDC named Premium was created based on the Premium cluster from the DRS cluster. The Premium PvDC is dedicated for SAP systems that require cross datacenter HA capability. Storage profiles HA Gold and HA Silver were added to the PvDC storage profile tab in the PvDC configuration. Figure 19 shows the storage profile configuration needed for PvDC.

Page 35: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

35 EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

Figure 19. PvDC configuration

For external networks in PvDC, the configuration in Advanced and Standard PvDC also applies to the Premium PvDC.

Configuring OvDC

We create new OvDCs to use cross datacenter HA capability. In this solution, a new OvDC called OrgB_Dedicated_HA for OrgB was built on top of Premium PvDC. The same organization networks configured in the foundation bundle were also created in the new OvDC. For vShield Edge gateways, we enabled high availability in the edge property as shown in Figure 20.

Figure 20. vShield Edge gateway configuration

Page 36: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

36

After this configuration, vCloud Director automatically deployed an additional standby vShield Edge gateway. Once the primary vShield Edge gateway is out of service, all the networking and security functions switch over to the standby gateway.

After successful configuration of the new OvDC, new SAP systems can be deployed. For the existing SAP systems that also require HA capability, but originally deployed in the foundation bundle, you can schedule a downtime and migrate them to the new OvDC.

vCenter Chargeback Manager provides the virtualized infrastructure metering functionality. It integrates with vCloud Director seamlessly, provides the service provider with the capability to chargeback the resources consumed by tenants, and generates cost and utilization reports periodically or on demand.

The EMC Cloud-Enabled Infrastructure for SAP Foundation Bundle—VNX White Paper defines several pricing models to differentiate OvDC with different allocation models and storage profiles with different storage performance levels. There are several approaches to reflect the chargeback of HA capability. You can create and bind a pricing model to a new OvDC with HA capability, with reasonable charges for infrastructure including CPU, memory, and storage. In this way, you can accurately charge the infrastructure and account for the resources consumed in Datacenter B.

Another simpler approach is to set a rate factor for the storage profiles HA Gold and HA Silver to reflect a charge for the HA capability. You only need to set it once and all the vApps deployed on the storage profiles HA Gold and HA Silver will be charged accordingly with base rate times rate factor. That simplifies the management efforts needed to create pricing models for each OvDC compared to the previously described approach. Figure 21 shows how to set the rate factor in vCenter Chargeback Manager.

Figure 21. Setting the factor rate for DR storage profiles

EMC VPLEX Metro delivers concurrent access to the same set of devices at two physically separate locations and thus provides the active/active infrastructure that enables geographically stretched clusters based on VMware vSphere. The use of Layer 2 routing technology enables extension of virtual LANs, and their subnets, across different physical datacenters.

Configuring vCenter Chargeback Manager

VMware deployments on VPLEX Metro

Page 37: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

37 EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

Deploying VPLEX Metro with the following features and components provides the described functionality:

vMotion—Provides the ability to non-disruptively migrate virtual machines between datacenters in anticipation of planned events such as hardware maintenance or power outage of the datacenter.

vSphere DRS—Provides automatic load distribution and virtual machine placement across datacenters through the use of DRS groups and affinity rules.

vSphere HA—A VPLEX Metro environment with VPLEX Witness configured is considered as a VPLEX Metro HA configuration, as it ensures storage availability at the surviving datacenter in the event of a datacenter-level failure. Combining VPLEX Metro HA with a host failover clustering technology such as vSphere HA and Symantec ApplicationHA provides an end-to-end automatic application-aware restart for any datacenter-level failure or other outages. Figure 22 illustrates this HA architecture.

Figure 22. vSphere HA with Cross-Cluster Connect and VPLEX Witness–logical view

VPLEX Metro HA Cross-Cluster Connect—Provides additional protection of the vSphere HA cluster by adding a Cross-Cluster Connect between the local vSphere ESXi hosts and the VPLEX cluster on the remote datacenter.

EMC VPLEX distributed volumes are created across both locations. With underlying EMC VPLEX distributed volumes, vSphere HA clusters are stretched across both datacenters.

Page 38: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

38

The physical vSphere ESXi hosts are connected to the local VPLEX cluster on which they physically reside, and also have an alternate path to the remote EMC VPLEX cluster through the additional cross-connect network that physically breaks the VPLEX internal link connecting both VPLEX clusters, as show in Figure 225.

Figure 23 shows the paths for Cross-Cluster Connect as displayed by EMC VSI. EMC PowerPath/VE automatically detects and sets the cross-cluster paths and puts in auto standby mode. This minimizes cross-datacenter traffic under normal operation, and handles all paths down (APD) situations more efficiently when a datacenter fails.

Figure 23. EMC VSI PathViewer: VPLEX distributed devices and cross cluster paths

Best practices for EMC VPLEX Cross-Cluster Connect configuration

VPLEX Witness must be deployed in a third failure domain. The inter-cluster network latency is not to exceed 1 millisecond RTT between VPLEX clusters at the time of writing.

According to EMC best practice, all remote VPLEX connections should be zoned to the local host and local host initiators must be registered to the remote VPLEX. The distributed volume is then exposed from both VPLEX clusters to the same host. The host path preference should have a local path preference set, ensuring the remote path will only be used if the primary one fails so that no additional latency is incurred.

This bundle uses VPLEX Metro HA with Cross-Cluster Connect to maximize the availability of the VMware vSphere virtual machines. The key benefit of this solution is the ability to minimize any recovery time if components or even an entire VPLEX cluster fails. This is unlikely because there is no SPOF within a VPLEX engine. Since the physical host has an alternate path to the same storage actively served up by the remote VPLEX cluster, it will automatically remain online due to the VPLEX Witness, regardless of the rule set.

5 EMC VPLEX Metro Witness Technology and High Availability EMC TechBook and Using VPLEX Metro with VMware High Availability and Fault Tolerance for Ultimate Availability White Paper in the References section provides more details.

Page 39: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

39 EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

VMware and EMC support a stretched cluster configuration that includes vSphere ESXi hosts from multiple datacenters. A VMware vSphere Metro Storage Cluster (vMSC)6 solution is also referred to as a stretched cluster.

In this bundle, both management and resource clusters, as described in the EMC Cloud-Enabled Infrastructure for SAP Foundation Bundle—VNX White Paper, are stretched between Datacenter A and Datacenter B by using the EMC VPLEX Metro distributed virtual volumes with vSphere HA and vSphere DRS.

There are four vSphere ESXi hosts (physical servers) in each cluster, two at each datacenter, two for the management cluster and two for the resource cluster. VMware vCloud Director manages the resource cluster.

EMC VPLEX Metro HA Cross-Cluster Connect provides increased resiliency to the configuration as shown in Figure 22.

In VMware vCenter, you can view the vSphere Web Client configuration of the stretched cluster and the features enabled for it, as shown in Figure 24. This view also shows the memory, CPU, and storage resources available to the resource cluster.

Figure 24. vSphere cluster with HA and DRS enabled

The management cluster was configured in the same manner, with vSphere HA and DRS enabled and all virtual machines stored in VPLEX-based datastores.

6 For detailed requirements and scenarios, see the VMware Knowledge Base articles 1026692: Using VPLEX Metro with vSphere HA and 2007545: Implementing vSphere Metro Storage Cluster (vMSC) using EMC VPLEX document in the References section.

VMware stretched cluster configuration

Page 40: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

40

Configuring virtual network

Each vSphere ESXi host is configured with two 10 GbE physical adapters to provide network failover and high performance. A vSphere distributed switch (dvSwitch)7 provides a single, common switch across all hosts. The 10 GbE physical adapters (also referred to as uplink adapters) are assigned to the dvSwitch.

Four distributed port groups are assigned to the dvSwitch:

dvPG_Host—For virtual machine datacenter network traffic

dvPG_Management—For management traffic

dvPG_Corp —For uplinks to corporate IT Network

dvPG_vMotion—For vMotion traffic

Figure 25 shows the dvSwitch configuration. As both vSphere 5.1 distributed switches and physical switches support Link Layer Discovery Protocol (LLDP), the properties of the associated physical switches can also be easily identified from vSphere vCenter server.

Figure 25. dvSwitch configuration and LLDP detail

For details of the dvSwitch configurations, refer to the EMC Cloud-Enabled Infrastructure for SAP Foundation Bundle—VNX White Paper. The configuration of the dvSwitch does not require any changes when moving to the HA bundle. 7 A dvSwitch provides a network configuration that spans all member hosts and enables virtual machines to maintain consistent network configuration for migration between hosts. For further information, see the VMware vSphere Networking ESXi 5.1 document in the References section.

Page 41: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

41 EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

Configuring VMware vSphere HA and VMware vSphere DRS

vSphere HA provides high availability for virtual machines by pooling the virtual machines and the vSphere ESXi hosts that they reside on into a cluster. Hosts in the cluster are monitored and in the event of a failure, the virtual machines on a failed host are restarted on alternate hosts. 8

vSphere HA uses multiple vSphere ESXi hosts, configured as a cluster, to provide rapid recovery from outages and cost-effective high availability for applications running in virtual machines. vSphere HA protects application availability in the following ways:

It protects against a server failure by restarting the virtual machines on other vSphere ESXi hosts within the cluster.

It protects against application failure by continuously monitoring a virtual machine and resetting it in the event of guest OS failure (VM monitoring).

In this bundle, both vSphere HA and DRS were enabled, as shown in Figure 26.

Figure 26. vSphere HA configuration settings

8 For further information on vSphere HA, see the VMware vSphere Availability ESXi 5.1 documentation.

VMware vSphere HA configuration

Page 42: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

42

VM monitoring

VM monitoring was configured to monitor the operating system and the application running inside the guest OS. VM monitoring was configured to restart individual virtual machines if their heartbeat is not received within the configured 60 seconds threshold. The virtual machine and application monitoring option was selected to allow Symantec ApplicationHA to communicate with vSphere HA to exchange virtual machine heartbeat status from the guest OS and application running.

Datastore heartbeating

To meet vSphere HA requirements for datastore heartbeating, additional datastores were created on VPLEX distributed volumes and presented to all the ESXi hosts on both clusters. In a production environment, vCenter automatically selects two or more datastores for this purpose, based on host visibility. The resource cluster was configured with five datastores (VPLEX_DS1, DS2, DS3, DS4, and DS5) as described earlier, and they were used for Datastore heartbeating, as shown in Figure 27.

Configuring multiple VPLEX datastores in a vMSC configuration provides higher redundancy for both datacenters. This enables vSphere HA to heartbeat to a datastore even in case of an IP link failure between datacenters, enabling vSphere HA to determine the state of a host in any failure scenario.

Figure 27. vSphere HA Cluster Status – heartbeat datastores

Virtual machine restart options

The virtual machine Restart Priority option for the SAP virtual machines was set per virtual machine. The ABAP SAP Central Instance Services (ASCS) and the SAP Database servers were set to High and the other SAP virtual machines were set to medium (default setting). This instructs the vSphere ESXi hosts to power on virtual machines first in the event of an outage. Figure 28 shows this setting. The Host Isolation Response setting was configured as Leave Powered On.

Page 43: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

43 EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

Figure 28. Virtual machine Restart Priority Response settings

vSphere HA monitoring

When you create a vSphere HA cluster, a single vSphere ESXi host is automatically elected as the master host. The master host monitors the state of all protected virtual machines and the slave hosts. When the master host cannot communicate with a slave host over the management network, it uses datastore heartbeating to determine whether the slave host has failed, resulting from a network partition or being isolated.

Although vSphere HA is configured by vCenter and exchanges virtual machine state information with vSphere HA, vCenter is not involved when vSphere HA responds to a failure, so while vSphere HA, by design, respond to failures without vCenter, HA relies on vCenter to be available to configure and monitor the cluster. In this bundle, both the management cluster and resource cluster vCenter virtual machines were configured with vSphere HA enabled with datastore heartbeating.

Admission Control

VMware vCenter Server uses HA Admission Control to reserve resources in the cluster to provide failover protection and ensure virtual machine resource reservations.

Admission Control was turned on for both resource and management clusters and the policy Percentage of Cluster Resources Reserved was selected and set to 50 percent. This configuration reserved cluster resources to guarantee the restart of all virtual machines in case of a simultaneous failure of two physical vSphere ESXi hosts in the same cluster.

Page 44: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

44

Best practices for HA clusters

EMC considered the following key vSphere HA cluster best practices9 in the bundle to ensure optimal performance:

Configure alarms to monitor cluster changes to expedite incident management

Configure alarms in VMware vCOPS to be triggered when vSphere HA takes action to maintain availability and send alerts, such as emails, to administrators for facilitating troubleshooting.

For further information about how to configure alarms in VMware vCOPS, refer to the EMC Cloud-Enabled Infrastructure for SAP Foundation Bundle—VNX White Paper.

Monitor cluster validity to guarantee that reserved capacity for failover is available

A valid cluster is a cluster in which the admission control policy was not violated. A cluster enabled for vSphere HA becomes invalid when the number of virtual machines powered on exceeds the failover requirement— that is either the current failover capacity is smaller than configured failover capacity or when ESXi hosts fail, reducing the available capacity for reservation.

Enable Admission Control and select the Percentage of Cluster Resources reserved policy

This policy offers the most flexibility in terms of host and virtual machine sizing. Choose a percentage for CPU and memory that reflects the number of host failures that you want to support. This policy uses the actual reservation per virtual machine instead of taking the worst scenario like the other two policies available in Admission Control. The cluster dynamically adjusts when resources are added.

Size all the cluster hosts equally

For the Percentage of Cluster Resources Reserved policy, an unbalanced cluster results in excess capacity being reserved to handle failures, since vSphere HA reserves capacity for the largest hosts.

Mask datastores in a cluster basis (all vSphere ESXi hosts in the cluster)

Maximize the chance of restarting virtual machines after a failure by masking the datastores to all vSphere ESXi hosts that are part of the cluster. The vSphere HA master host will be able to access all the datastores to try to communicate to all vSphere ESXi hosts in the cluster to determine their status if the management network is not available after the failure.

To enable better handling of all-paths-down scenarios, the value of the advanced setting das.maskCleanShutdownEnabled was set to true. This setting allows vSphere HA to trigger a restart response for a virtual machine that has been shut down automatically due to a power outage condition. This setting is not enabled by default.

Configure vSphere DRS to support vSphere HA in highly utilized clusters

9 For further information, see VMware vSphere Availability ESXi 5.1 and VMware vSphere High Availability Deployment Best Practices.

Page 45: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

45 EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

Combine vSphere HA and DRS to protect against failures and to provide load balancing across hosts within a cluster. In a failure scenario, if vSphere HA cannot restart some virtual machines, it asks DRS to try to defragment resources to offer HA another opportunity to restart virtual machines. In order to achieve this, DRS needs to be enabled and configured to be fully automated.

Best practices for networking

The best practices10 for the configuration of host NICs and network topology for vSphere HA include recommendations for vSphere ESXi hosts, cabling, switches, routers, and firewalls as follows.

Suspend the Host Monitoring feature when making network changes

vSphere HA uses the management network to send/receive heartbeats to/from the clustered ESXi hosts. During network maintenance, heartbeat interruptions may happen and vSphere HA may trigger undesired attempts to failover virtual machines, reducing availability.

Reconfigure vSphere HA on all cluster hosts after ESXi network changes

vSphere HA will re-inspect the network information, and then re-enable Host Monitoring.

Notify the vSphere HA Admin in advance before any network maintenance

Networking is a vital component of vSphere HA. If any network maintenance must be performed, the vSphere HA administrator must be informed in advance to prepare the environment to ignore false positives.

Use the das.isolationaddress advanced attribute to add additional isolation addresses for additional networks

A network isolation address is an IP address that is pinged to determine whether a host is isolated from the management network (VMkernel). This IP address is pinged when a host stops receiving heartbeats from other hosts in the cluster. If the host can ping its isolation address, the host is not network isolated. If the host cannot ping the isolation address, the host has likely became isolated from the network.

Implement network redundancy at the NIC level with NIC teaming connecting to separate switches.

A single management network can result in failovers although only the network has failed. Redundant management networking allows the reliable detection of failures and prevents isolation or partition conditions from occurring, because heartbeats can be sent over multiple networks. Network heartbeating is the primary method to determine the state of a vSphere ESXi host, providing a resilient management network to enable a faster and proper host state determination, without involving datastore heartbeating.

Configure the fewest number of segments between clustered vSphere ESXi hosts

10 For further information, see VMware vSphere Availability ESXi 5.1 and VMware vSphere High Availability Deployment Best Practices.

Page 46: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

46

This recommendation is intended to limit the network SPOF. Routes with too many hops can cause networking packet delays for heartbeats, and increase the possible points of failure.

vSphere DRS host groups and virtual machine groups

DRS host groups and virtual machine groups simplify management of the vSphere ESXi host resources. In this bundle, the host groups and virtual machine groups were created for the resource cluster as shown in Figure 29.

Figure 29. Creating host groups and virtual machine groups

Table 8 and Table 9 show the DRS groups created for the solution and their assignments.

Table 8. Management cluster

DRS Group Name Group Type Type Group members

Datacenter_A_Servers Host DRS Group vSphere ESXi hosts r710a, r710b

Datacenter_B_Servers Host DRS Group vSphere ESXi hosts r710c, r710d

Datacenter_A_VMs VM DRS Group DNS virtual machine DNS server

Datacenter_B_VMs VM DRS Group DNS2 virtual machine DNS slave server

Cross_Datacenter_VMs VM DRS Group vCloud Director and vCenter virtual machines

vCloud Director, vCenterMC, vCenterRC, vChargeBack, SymantecHA, and others

Configuring VMware vSphere DRS

Page 47: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

47 EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

The group members, vCenterMC and vCenterRC respectively stand for management cluster vCenter and resource cluster vCenter. The group members r710a, r710b, r710c, and r710d, respectively stand for vSphere ESXi hosts r710a and r710b (both located on Datacenter A), and vSphere ESXi hosts r710c and r710d (both located on Datacenter B).

Table 9. Resource cluster

DRS Group Name Group Type Type Group members

Datacenter_A_Servers Host DRS Group ESXi hosts c460-1 and c460-3

Datacenter_B_Servers Host DRS Group ESXi hosts c460-2 and c460-4

Datacenter_A_VMs Virtual machine DRS Group vShield virtual machine vse-Edge_OrgB_HA2-0

Datacenter_B_VMs Virtual machine DRS Group vShield virtual machine vse-Edge_OrgB_HA2-1

Cross_Datacenter_VMs Virtual machine DRS Group SAP ERP 6.0 virtual machines

SAPAB1DB, SAPAB1ASCS, SAPAB1AAS1, APAB1AAS2

Note: The DRS groups identify which vSphere ESXi hosts are in which physical datacenters.

The group members c460-1, c460-3, c460-2, and c460-4, respectively stand for vSphere ESXi hosts c460-1 and c460-3 (both physically located on Datacenter A), vSphere ESXi hosts c460-2 and c460-4 (both physically located on Datacenter B).

The group members SAPAB1DB, SAPAB1ASCS, SAPAB1AAS1, and SAPAB1AAS2 respectively stand for the SAP AB1 ERP Database server, SAP AB1 ABAP SAP Central Services, SAP AB1 ABAP Application Server 1, and SAP AB1 Additional Application Server 2.

vSphere DRS affinity rules

DRS uses affinity rules to control the placement of virtual machines on hosts within a cluster. DRS provides the following two types of affinity rules:

A VM-Host affinity rule specifies an affinity relationship between a group of virtual machines and a group of hosts. VM-Host affinity rule controls two vShield Edge gateway virtual machines running on different datacenters, as shown in Figure 30.

Figure 30. VM-Host affinity rule for resource cluster

Page 48: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

48

A VM-VM affinity rule specifies whether particular virtual machines should run on the same host or be kept on separate vSphere ESXi hosts. In this solution, both vCenter servers and SQL server virtual machines were configured to run on the same vSphere ESXi hosts in the management cluster, as shown in Figure 31.

Figure 31. vSphere DRS affinity rules for management cluster

A rule was created to keep the SAP Database and ASCS instances from running on the same host, as shown in Figure 32. This is because both systems are SPOFs and must be protected against ESXi host failures. An additional rule was created to keep the SAP Application Servers running on different ESXi hosts to provide redundancy of SAP work processes. The vShield Edge virtual machines protecting the resource cluster have two redundant virtual machines that should run respectively on Datacenter A and Datacenter B. The rules vShield_Edge_Datacenter_X, where X can be A or B, were created to instruct vSphere DRS to enforce that requirement.

Figure 32. vSphere DRS affinity rules for resource cluster

Page 49: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

49 EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

Table 10 lists all the vSphere DRS affinity rules consolidated for the management and resource clusters. vCenterMC and vCenterRC stand for vCenter for the management cluster and for the resource cluster respectively.

Table 10. Consolidated vSphere DRS affinity and anti-affinity rules

Cluster DRS rule names DRS Affinity Rules configured Rule type Object

Management Datacenter A and B DNS1 and DNS 2 servers Separate Datacenters

vCloud vCloud Director and vCenterRC Separate ESXi hosts

vCenter SQL, vCenterMC and vCenterRC Same ESXi hosts

Resource vShield Edge Datacenter A vShield Edge 0 Should run Datacenter A

vShield Edge Datacenter B vShield Edge 1 Should run Datacenter B

vShield Edge Anti Affinity vShield Edge VMs 0 and 1 Separate ESXi hosts

SAP AB1_Application_Servers SAP AAS1 and AAS2 Separate ESXi hosts

SAP_AB1_Central_Services SAP Database and ASCS Separate ESXi hosts

vSphere DRS rules are created to protect virtual machines against vSphere ESXi host failures on both management and resource clusters, as shown in Table 10. The SAP AAS1 and AAS2 are in the same rule, which instructs vSphere DRS to keep them running in separate vSphere ESXi hosts, giving flexibility for vSphere DRS to provide the availability and the protection level required at the same time.

EMC VSI provides enhanced visibility into VPLEX directly from the vCenter GUI. The Storage Viewer and Path Management features are accessible through the EMC VSI tab, as shown in Figure 33.

In this solution, VPLEX distributed volumes host the VPLEX_ DS1 to VPLEX _DS5 Virtual Machine File System (VMFS) datastore, and Storage Viewer provides details of the datastore’s virtual volumes, storage volumes, and paths.

LUNs that make up the datastore are 4 TB distributed RAID 1 VPLEX Metro volumes that are accessible via PowerPath, as shown in Figure 33.

EMC VSI and VPLEX

Page 50: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

50

Figure 33. VSI Storage Viewer–datastores

Symantec ApplicationHA adds a layer of application awareness to the core vSphere HA functionality offered by VMware virtualization technology.

Key benefits include the following:

Out of the box integration with the VMware vSphere vCenter and vSphere HA

Full visibility and control over applications with the ability to start, stop, and monitor applications running inside virtual machines

Standardized way to manage applications using a single interface that is integrated with VMware vSphere Client

Specialized Application Maintenance mode, in which Symantec ApplicationHA allows you to intentionally take an application out of its execution for maintenance or troubleshooting

Symantec ApplicationHA components

Symantec ApplicationHA consists of the following components in a VMware virtualization environment:

Symantec - High Availability Console

The HA Console is installed separately in the Symantec ApplicationHA monitoring environment and resides on a separate virtual machine.

Symantec - High Availability guest components for virtual machines

The Symantec ApplicationHA guest components are installed separately on the virtual machines where you want to monitor applications. The guest components include the configuration wizard and the High Availability agents that are used for configuring and monitoring applications.

The guest components also include the Veritas Storage Foundation Messaging Service. This service communicates the application monitoring status to the Symantec High Availability VM tab and dashboard tab.

Symantec ApplicationHA

Page 51: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

51 EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

Symantec ApplicationHA agents

Agents are application-specific modules that plug into the High Availability framework that manages applications and resources of predefined resource types in a system. The agents are installed when you install Symantec ApplicationHA guest components. These agents start, stop, and monitor the resources configured for the applications and report state changes. If an application or its components fail, Symantec ApplicationHA restarts the application and its resources inside the virtual machine.

Symantec ApplicationHA agents are classified as follows:

Infrastructure agents

Agents such as NIC, IP, and Mount are classified as infrastructure agents. Infrastructure agents are automatically installed as part of the Symantec ApplicationHA installation on virtual machines.

Application agents

The Symantec ApplicationHA agent pack is released on a quarterly basis. The agent pack includes support for new applications as well as fixes and enhancements to existing agents. You can install the agent pack on an existing HA guest components installation.

Refer to the Symantec Operations Readiness Tools (SORT) website for information on the latest agent pack availability.

Refer to the agent-specific configuration guide for more details about the application agents.

Working with vSphere vCenter Server

Symantec ApplicationHA communicates with vSphere HA. High Availability conveys the application health status in the form of an application heartbeat. This allows vSphere HA to automatically reset or restart a virtual machine if the application heartbeat is not received within a specified interval.

Figure 34 displays the sample deployment of Symantec ApplicationHA.

Figure 34. Sample deployment of Symantec ApplicationHA

Page 52: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

52

Symantec ApplicationHA provides a vCenter plug-in for integration with the vSphere Client and adds the following interfaces to perform application monitoring tasks.

The following interfaces appear in the vSphere Client after you install Symantec High Availability Console:

Symantec High Availability VM tab: As shown in Figure 35, this tab is the primary interface for performing the application monitoring operations on a virtual machine. You configure application monitoring and then monitor and control the configured application on the virtual machines. After configuring the application monitoring, the Symantec High Availability VM tab displays the state of the application and the component dependencies.

Figure 35. Symantec High Availability VM tab

Symantec High Availability dashboard: This dashboard is the primary interface for administering the configured applications on a VMware environment. After configuring application monitoring, the High Availability dashboard displays the state of the application. Figure 36 displays the High Availability dashboard with the SAP and Microsoft SQL Server applications that are configured for monitoring the SAP systems.

System administrators with the correct permissions can coordinate the start and stop of SAP services from the dashboard if required, as shown in Figure 36.

Page 53: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

53 EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

Figure 36. High Availability dashboard

In this solution, Symantec ApplicationHA is transparent to vCloud Director and is an optional component that adds important application-awareness to the vSphere Administrator for operations, as well as providing fast and automated restart of SAP and database services inside the virtual machines during process, virtual machine, and vSphere ESXi hosts tested outages.

Symantec ApplicationHA provides support for several OS, DB and application combinations described in the compatibility lists referred in the Symantec Operations Readiness Tools (SORT) website.

Symantec High Availability Console server was installed in a VMware virtual machine named SymantecHA, which was deployed in the management cluster and protected by vSphere HA with a high restart priority configured. According to Symantec ApplicationHA architecture, the guest agents and their updates are deployed in the Symantec High Availability Console server and distributed by pushing technology to the guest virtual machines where they are installed or updated.

The Symantec ApplicationHA guest components were installed on all SAP virtual machines in the resource cluster. SAP and Microsoft SQL Server agents were configured to manage the automatic restart of the SAP and Microsoft SQL database services detected in a running state during the initial installation and activation of the agents and in case of service, process, guest OS, or virtual machine outages.

In Windows environments, the SAP and Database Services can be restarted automatically during start up and service outages. Symantec ApplicationHA does that and also monitors the health of the database and communicates with vSphere HA. Symantec ApplicationHA also provides application health status in the Symantec High Availability VM tabs in the vSphere Client or web browser.

Page 54: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

54

SAP system architecture

This section describes the SAP system architecture deployed for the solution in the two datacenters. The SAP application layer uses these SAP components:

SAP ERP 6.0 Core IDES Enhancement Package 5

SAP NetWeaver 7.02

SAP ABAP SAP Central Services (ASCS)

SAP Database Services (DB)

SAP ABAP Application Server 1 (AAS1)

SAP Additional Application Server (AAS2)

SAP Global File systems (\sapmnt, \usr\sap\<SID>, and \usr\sap\trans)

All the SAP systems were installed and configured in a distributed architecture where we had separate virtual machines for the SAP central services, the database, and application servers. All SAP instances were installed on VMware vSphere virtual machines with SAP Applications running inside of them.

Note: Other operating systems and database combinations supported by the SAP Product Availability Matrix (PAM) and the EMC Solutions Support Matrix (SSM) can also be used with this solution.

The solution implemented a recommended distributed SAP system architecture, as shown in Figure 37.

Figure 37. SAP system architecture

Introduction

Configuring SAP system

Page 55: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

55 EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

The enqueue server and message server were decoupled from the Central Instance and implemented as standalone services within the ASCS instance11. Two SAP Application Servers (AAS1 and AAS2) (formerly known as dialog instances) were installed to provide redundant work processes such as dialog (DIA), background (BGD), update (UPD), update V2 (UP2), spool (SPO), and gateway, to provide workload balance and protection against vSphere ESXi host, virtual machine or guest OS failures. Additional SAP Application Servers can be deployed to provide additional redundancy of SAP work processes.

The SAP ASCS, SAP Database, and the SAP Global File systems server are single points of failure (SPOF) components in SAP NetWeaver distributed architecture without the traditional OS/DB specific and complex clustering technologies until recently. This solution addresses these SPOFs as described in the Key design considerations section.

SAP Database Server outage

The whole SAP system is stopped in case the SAP Database server component fails. Once the database becomes available again, the SAP work processes in the SAP Application Servers reconnect automatically and the users can resume their work. All transactions currently in progress and not committed are rolled back for consistency.

SAP ASCS instance outage (enqueue and message server services)

In the case of SAP ASCS instance component failure, either or both the enqueue and the message server services fail. Either way a quick restart is required to restore these services to resume the SAP system operations.

If the enqueue service fails, the lock table stored in RAM is lost, the SAP application transaction locks stop, and the application is terminated. All open SAP transactions will be rolled back and the database stays consistent. When the enqueue server fails, the SAP functional team has to analyze the business process impact after this service is restarted.

If the message server stops working, new requests cannot be executed for dialog, update and enqueue server, but the existing connections are not affected.

SAP Global File systems (\sapmnt, \usr\sap\trans, and \usr\sap\<SID>)

In the case of SAP Global File systems (Windows file share) component failure, the file shares \sapmnt\<SID>, \usr\sap\<SID>, and the \usr\sap\trans are not available. The \sapmnt directory not being available prevents additional SAP application servers to be started, but the already active ones continue to run. If the global file system \usr\sap\<SID>\sys\global\ is not available, batch and spool processes cannot write their logs there, so batch and print job activities will be canceled. If the \usr\sap\trans transport directories are not available, the change and transport system cannot be used.

11 The enqueue server manages logical locks, also known as the lock server. The objective of this server is to minimize the duration of a database lock. Unlike database locks, an SAP lock can exist across several database logical units of work (LUWs). The message server informs all servers (instances) in an SAP system of the existence of the other servers. Other clients (for example, SAPlogon and remote function call clients with load balancing) can also contact the enqueue server for information about load balancing.

Outage impact

Page 56: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

56

The SAP system deployed for this solution implements the following key design features:

The Database and ASCS instances were installed with virtual hostnames to decouple them from the fixed virtual machine hostnames and IPs.

SAP patches, parameters, basis settings, and load balancing settings are all installed and configured according to the SAP installation guide and the SAP Notes listed in the References section.

VMware best practices for SAP were adopted in this solution12.

The SAP update processes (UPD/UP2) were configured on the primary and additional application server instances.

The SAP ASCS instance profile, start profiles, and dialog instance profiles were configured appropriately. The SAP ASCS instance also holds the SAP Global File systems \sapmnt, \usr\sap\<SID>, and \usr\sap\trans central directories, which are maintained by a central NFS server that restarts automatically in case of a virtual machine outage.

The SAP Database server running Microsoft SQL Server was configured according to the SAP notes listed in the References section.

SAP shared file systems, \sapmnt\<SID> and \usr\sap\<SID> were stored on the SAP ASCS server and shared to the all SAP AB1 virtual machines.

All virtual disks in the SAP virtual machines were created in the thick format (eagerzeroed). Multiple virtual disks were configured to separate the I/O pattern in different virtual disks (VMDKs) and in different paravirtual SCSI (PVSCSI) controllers—for example, SCSI Controller 1 Transaction Logs and SCSI Controller 2 Database Datafiles and so on.

This solution uses the encapsulated and virtualized storage for the entire SAP environment. Each SAP virtual machine was stored in one VPLEX-based datastore as shown in Table 11, as required by vCloud Director. Separate datastores were used for high availability purposes. The datastores are based on VPLEX virtual volumes distributed across the both datacenters and presented to the vSphere ESXi hosts running the SAP virtual machines through EMC VPLEX Metro.

All the SAP virtual machines in the solution were configured to be protected by Symantec ApplicationHA and VMware vSphere HA with restart priorities set, vSphere DRS with specific restart priorities, vSphere DRS with affinity and anti-affinity rules as described in Table 10.

The DRS anti-affinity rules were created to protect the SAP virtual machines against vSphere ESXi host failures that trigger a virtual machine restart, protecting the SAP systems, specially the SPOFs with fast restarts and a timely and automated recovery of the SAP services to the end users.

All SAP virtual machines had VMware tools installed and were configured with vmxnet3 network cards for best network performance.

12 For details about SAP Solutions on VMware vSphere, refer to High Availability and Best

Practices Guide.

Key design considerations

Page 57: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

57 EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

Because SAP allocates memory permanently and does not release it again, the memory reservation for all SAP system AB1 production-role virtual machines was set to the maximum size to avoid virtual machine paging and to optimize for best performance.

Table 11 shows the configuration of the SAP virtual machines. DC represents datacenter in the table.

Table 11. SAP virtual machines configuration

DC vSphere ESXi host

Virtual machine role

vSphere HA restart priority

Virtual CPUs

Memory

(GB) Disk (GB)

Virtual machine name

Datastore name13

A C460-1 SAP DB Medium 4 32 3700 SAPAB1DB VPLEX_DS2

B C460-2 SAP ASCS High 2 32 60 SAPAB1ASCS VPLEX_DS1

A C460-3 SAP Application Servers Low

4 32 60 SAPAB1AAS1 VPLEX_DS3

B C460-4 4 32 60 SAPAB1AAS2 VPLEX_DS4

13 VMware vCloud Director consumes storage from the configured storage profiles based on capacity requirements. For demonstration purposes, the SAP virtual machines were placed on HA Gold Tier (EFDs based) specific datastores, as shown in Table 11.

Page 58: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

58

EMC storage infrastructure

Overview

This section describes the storage infrastructure for the solution:

Next-generation VNX is the storage system at both datacenters.

Two VNX storage systems are deployed with a matching LUN configuration.

EMC VNX series provides five-nines availability in mission-critical business environments. VNX availability and redundancy features include:

Mirrored write cache, where each storage processor contains both primary cached data for its LUNs and a secondary copy of the cache for its peer storage processor

Battery backup to allow for an orderly shutdown and cache de-staging to vault disks to ensure data protection for possible power failures

RAID protection levels 0,1,1/0,3,5, and 6—all of which can co-exist in the same array simultaneously to match different protection requirements

Proactive hot sparing enhances system robustness and delivers maximum reliability and availability

Redundant data paths, power supplies, drive connections, and storage processors —all with non-disruptive field-replacement capabilities

Continuous system monitoring, call home notification, and advanced remote diagnostics

EMC VPLEX extends the continuous availability within and across datacenters.

EMC VNX Provisioning provides pool-based provisioning by implementing pool LUNs that can be either thin or thick. Thin LUNs provide on-demand storage that maximizes the utilization of storage by allocating storage as needed. In this solution, we created storage pools with different drive types and RAID configurations to take full advantage of FAST VP. We also selected Scheduled Auto-Tiering to enable FAST VP, as shown in Figure 38.

Introduction

EMC VNX HA

Configuring VNX

Page 59: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

59 EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

Figure 38. Creating a storage pool in Unisphere

We also created standard LUNs sizes of 4 TB that are presented from the VNX array to the VPLEX. Both VNX at datacenter A and datacenter B have similar configurations. Use Unisphere to create standard LUNs quickly and easily, as shown in Figure 39.

Figure 39. Creating LUNs in Unisphere

Page 60: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

60

Workload generation

An SAP Standard Application Benchmark, the SAP ERP Sales and Distribution (SD) Benchmark, was used to generate a sample SAP workload on the installed SAP ERP 6.0 EhP 5 AB1system, which was built for two of the test scenarios. The SAP-specific workload used in the Testing and validation scenarios demonstrates that even under a heavy workload, this solution provides consistent uptime without compromising the availability of the SAP system.

The SAP Standard SD Benchmark toolkit performed predefined standard SD transactions against the SAP AB1 system. The toolkit covers a sell-from-stock business scenario, which includes the creation of a customer sales order with five line items and its corresponding delivery with subsequent goods movement and invoicing. The scenario consists of the following transactions:

(VA01)Create a sales order with five line items

(VL01N) Create a delivery for the order

(VA03) Display the customer order

(VL02N) Change the delivery and post a goods issue

(VA05) List 40 orders for one sold-to party

(VF01) Create an invoice for the order

SAP Standard SD Benchmark configuration

SAP SD Benchmark Driver was installed on a virtual machine and ran the workload from a central location. To avoid data-locking situations, each benchmark user had its own master data. Table 12 presents the SD Benchmark configuration and workload, which is a typical customer activity.

Table 12. SD Benchmark configuration

Users per appl. server

Ramp-up time

Think time

Loops per user

High-load phase run time

Concurrent users

Sales orders/min

500 3 minutes

5 seconds

2 10 minutes 600 140

Benchmark run

A benchmark run consists of a ramp-up phase where all users log on one by one, a high-load phase where all users run their actions concurrently, and a ramp-down phase when all users log off. Figure 40 shows an example of the Benchmark run.

Figure 40. Benchmark run example

SAP standard SD Benchmark

Page 61: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

61 EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

Testing and validation

The EMC validation team initially installed and validated the environment without any high-availability protection scheme. We then transformed the environment to the mission-critical business continuity HA and application mobility solution described earlier in this white paper.

We carried out the following tests to validate different scenarios addressed in this solution and demonstrate the level of protection provided to the lab environment built. The testing and validation was divided between planned downtime and unplanned downtime test scenarios and from the less to the most disruptive, in order to demonstrate different benefits of this solution.

Planned downtime

a. vSphere ESXi host maintenance

Unplanned downtime

b. VMware virtual machine guest OS process failure with Symantec ApplicationHA

c. vSphere ESXi host hardware failure with Symantec ApplicationHA

d. EMC VPLEX isolation failure with SAP workload

e. Datacenter failure

Planned downtime can always be controlled. Reasons for planned downtime include hardware maintenance, hypervisor maintenance, installation of patches, upgrades of the server BIOS, and drivers or the hypervisor itself. These activities can cause downtime in your environment. However, you can plan the best time to perform these activities.

This solution provides the value of Downtime Avoidance in which SAP or other workloads can be moved across datacenters without causing interruption or foreseeable downtime from an external event, such as a power upgrade planned by the utility company.

a. Test scenario—vSphere ESXi host maintenance

The following test scenario validates that:

No disruption occurs in the SAP application running inside the VMware virtual machines, under a SAP Benchmark sample workload, running on a vSphere ESXi host that requires a planned hardware or software maintenance.

The EMC VPLEX Metro, vSphere HA, and vSphere DRS configurations described in earlier in this white paper ensure high protection levels during physical server maintenance.

The SAP workload described in Workload generation was executed in this test scenario. This test scenario demonstrates the SAP application mobility across datacenters during the planned downtime, as shown in Figure 41.

Introduction

Planned downtime

Page 62: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

62

Figure 41. SAP application mobility across datacenters

Objectives Verify that a vSphere ESXi host can be set to maintenance mode without interrupting

the SAP application running in its virtual machines.

Verify that the SAP virtual machines running under SAP workload on the vSphere ESXi host on maintenance mode located on Datacenter A, are moved across datacenters without interruption by using VMware vMotion and EMC VPLEX Metro.

Verify that the vSphere DRS affinity and anti-affinity rules are enforced by vCenter during the non-disruptive migration of the SAP virtual machines from Datacenter A to Datacenter B.

Testing procedure 1. Identify which vSphere ESXi hosts were running the SAP AB1 system virtual

machines. 2. Start the SAP Standard SD Benchmark run and wait for the High Load phase. 3. Select the vSphere ESXi host c460-1, where the SAP AB1 Database (SAPAB1DB)

and SAP Application Server 1 (SAPAB1AAS1) virtual machines were running and switched to maintenance mode.

4. Verify the non-disruptive migration of the SAP virtual machines from the vSphere ESXi host c460-1 to other vSphere ESXi hosts on Datacenter A and B.

5. Verify that the SAP system AB1system was not interrupted. 6. Verify that all vSphere DRS affinity and anti-affinity rules were followed.

Page 63: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

63 EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

Results Figure 42 summarizes the test environment and provides reference duration metrics in seconds obtained during the testing. The metric used was the Migration Time14.

Figure 42. Test environment

The SAP servers were moved across two datacenters that were physically separate, but the SAP end users never experienced any service interruption, as verified in the SAP system logs (SM21 transaction) of all SAP Application Servers.

Analysis The non-disruptive migration between two SAP virtual machines (SAPAB1 DB and SAPAB1 AAS1) was completed on average in 46 seconds after the c460-1 ESXi host switched to maintenance mode. The same test was repeated three times with the same conditions. The least favorable result was 59 seconds.

This test scenario demonstrates that:

A planned maintenance can be performed non-disruptively and on demand without affecting the planned downtime commitments on your current SLAs.

vSphere DRS automatically enforced the affinity rules configured to keep the SAP virtual machines SAPAB1ASCS, SAPAB1DB, and SAPAB1AAS1 and AAS2 instances in separate physical vSphere ESXi hosts, maintaining the same protection level that existed before the first server was set to enter in maintenance mode.

14 Migration Time is the total time taken for the migration to complete, beginning from the initiation of the migration. For more information refer to VMware vSphere 5.1 vMotion Architecture, Performance and Best Practices Technical White Paper.

Page 64: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

64

An administrator can proactively avoid the impact of a foreseeable outage, to preserve the availability of the business critical SAP applications, moving them live and within seconds to a remote datacenter, outside the area to be affected by the foreseeable outage.

Unplanned downtime cannot be controlled or anticipated, and can result from occurrences such as hardware failure, software failure, datacenter failure, and natural disasters. HA is a term used often connected to unplanned downtime. HA is a complex topic, which covers designing a solution to make not only your software highly available, but the whole IT infrastructure underneath in all layers (networks, routers, power supply, storage and so on) must be designed with high availability in mind.

b. Test scenario—VMware virtual machine guest OS process failure with Symantec ApplicationHA

This test scenario validates that an SAP system guest OS processes running inside virtual machines can be restarted quickly and automatically by Symantec ApplicationHA alone or in combination with VMware vSphere HA after a process failure.

No SAP workload was applied in the SAP system AB1 Application Servers during the test. This test scenario demonstrates a guest OS process failure during the unplanned downtime, as shown in Figure 43.

Figure 43. Guest OS process failure with Symantec ApplicationHA

Objectives Verify that Symantec ApplicationHA agent detects and restarts the SAP database

processes inside a virtual machine after an OS-triggered process failure.

Verify that the SAP database services (Microsoft SQL Server agent and Microsoft SQL Server database) running inside the SAP virtual machine are restarted automatically by Symantec ApplicationHA agents.

Unplanned downtime

Page 65: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

65 EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

Verify that Symantec ApplicationHA agent takes corrective action to restart the SAP database services (Microsoft SQL server Agent and Microsoft SQL server database) when a persistent failure affects the restart of database services.

Testing procedure The testing process below was executed three times in the same scenario to validate the results obtained. This process was executed with two variations, a single process failure and a persisting process failure with graceful shutdown, which are described as follows.

1. Identify the SAP Database instance that is specific to the running Microsoft SQL Server process and collect that process ID.

2. Verify the status of the SAP Database instance from the vSphere Client Symantec High Availability VM tab.

3. Open the Windows Task Manager on the guest OS of the virtual machine where the SAPAB1DB (SAP system AB1 Database server) is running and trigger an End Process Tree to the Microsoft SQL Server database process.

4. Verify that Symantec ApplicationHA agent detects the process outage and take actions to automatically restart the Microsoft SQL Server database processes in a timely manner without any manual intervention.

5. Verify that the SAP AB1 database server processes are restarted automatically and that the SAP AB1 system Application servers can reconnect to the SAP AB1 system Database instance.

6. Repeat step 3 to verify that Symantec ApplicationHA agents restart the SAP Database services and try to minimize the downtime15.

Results

Single process failure testing variation

Figure 44 summarizes the single SAP Database process failure set of tests performed and provides the duration metrics obtained during the three testing cycles. The metrics are shown in minutes and seconds (mm:ss).

Figure 44. Single SAP database instance (Microsoft SQL Server) process failure

Figure 44 depicts the durations of Microsoft SQL Server database restarting and SAP work processes reconnecting across the three testing cycles.

15 This step is applicable only for the variation Persisting Process Failure Testing, since the guest OS restart is only invoked after the number of process restart attempts cross the value configured in the Symantec ApplicationHA parameter App.RestartAttempts

Page 66: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

66

The values from the column Microsoft SQL Server Restart (mm:ss) were obtained from the Symantec ApplicationHA cluster log file (c:\Program Files\Veritas\cluster server\log\engine_A.log), which contains all of the activities executed by the Symantec ApplicationHA single cluster after the process failure occurrs. This log file lists the detailed actions taken and the output of database instance restart.

Symantec ApplicationHA agent automatically executed the following processes after a process failure:

1. Detects the process failure.

2. Cleans up all other dependent Microsoft SQL Server processes.

3. Restarts the Microsoft SQL Server agent and database.

The total duration of the execution of the above three processes was 24 seconds on average, considering all times obtained in all three tests cycles performed. This duration may vary based on the recovery process of the Microsoft SQL Server database at restart.

The SAP AB1 system Application Servers took on average 34 seconds, according to the SAP System Log to reconnect their work process to the SAP AB1 system database (***LOG BYY=> work process left reconnect state ) after manually initiating database failure.

The Symantec ApplicationHA agent for Microsoft SQL Server was configured to allow two restart attempts (App.RestartAttempts) to allow a single process failure testing without invoking the guest OS or vSphere HA to restart the OS or the virtual machine.

Persisting process failure testing

Figure 45 provides reference duration metrics obtained during the testing performed for the persisting SAP Database process failure with graceful shutdown. The metrics are shown in minutes and seconds (mm:ss).

Figure 45. Persisting SAP database (Microsoft SQL server) process failure

Figure 45 lists the durations of persisting process failures with the OS restart and the SAP work processes reconnection across three testing cycles.

Symantec ApplicationHA agent automatically executed the steps below after the manual trigger of the first process failure:

1. Detects the process failure.

2. Cleans up all other dependent Microsoft SQL Server processes.

3. Restarts the Microsoft SQL Server processes.

Page 67: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

67 EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

Symantec ApplicationHA automatically executed the following steps below after the manual trigger of the second process failure:

4. Detects the process failure.

5. Cleans up all other dependent Microsoft SQL Server processes.

6. Stops Microsoft SQL Server agent service.

7. Triggers a graceful shutdown of the guest OS.

8. Stops the virtual machine application heartbeat to notify vSphere HA and triggers a guest OS shutdown and restart.

The scenario durations of 3 minutes and 6 seconds on average were observed for the previous eight processes and the following four processes.

Restarts the guest OS.

Starts Symantec ApplicationHA cluster and agents.

Starts Microsoft SQL Server database.

Starts Microsoft SQL Server agent.

The Symantec ApplicationHA agent for Microsoft SQL Server was configured for one restart attempt (App.RestartAttempts), which forced a second process failure to invoke the restart of the guest OS.

The Symantec ApplicationHA parameter (VM.GracefullRebootPolicy) was set to enabled to execute a graceful shutdown of the operating system, before invoking a restart of the virtual machine.

Note: Symantec ApplicationHA parameter VM.GracefullRebootPolicy is used to enable or disable Symantec HA-initiated virtual machine restart policy. This option determines whether or not Symantec HA restarts the virtual machine in response to application and component failures. When a configured component application fails, Symantec HA attempts to restart the failed components. If the component fails to start, Symantec HA then takes the next corrective action. If this policy is disabled, and a component fails, then Symantec HA stops sending the heartbeat to VMware HA. VMware HA then restarts the virtual machine. If this policy is enabled, Symantec ApplicationHA invokes a native operating system command to restart the virtual machine.

For more information about the SAP NetWeaver database reconnect mechanism, refer to the SAP Note 98051—Database Reconnect: Architecture and function listed in the References section.

Page 68: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

68

Analysis Figure 46 summarizes the SAP Database instance process failures durations obtained in this test scenario performed and provides reference duration metrics obtained during the testing for comparison reasons. The metrics are shown in minutes and seconds (mm:ss).

Figure 46. Test-result summary of SAP Database instance process failures

This test scenario demonstrated the following:

Symantec ApplicationHA restarted the failed SAP Database Instance quickly, automatically and without any manual intervention of the SAP administrator or DBA, even after a persisting SAP critical database instance process failure.

Using Symantec ApplicationHA agents to recover from a single process failure inside the guest OS of a virtual machine is faster than executing a guest OS shutdown and then subsequently restarting it, due the wait required to load the guest OS and the applications services to restart correctly and resume the SAP services.

After a persistent process failure, the SAP WP reconnect time with a graceful shutdown durations obtained during the tests, indicated that the risk of a guest OS corruption is minimized when the parameter VM.GracefullRebootPolicy was set to enabled. This parameter enables a graceful shutdown and automatic restart of the guest OS and the Microsoft SQL Server services after a persistent process failure inside the guest OS.

Symantec ApplicationHA agents restarted the SAP database services (Microsoft SQL server AB1 database and Microsoft SQL server agent) to cleanly, quickly, and automatically recover from a severe incident, minimizing the total downtime of the SAP AB1 system. This significantly reduced the mission-critical SAP application downtime and the potential costs associated with the downtime.

Symantec ApplicationHA provides application monitoring, adding a new level of resiliency, integrating the virtualization layer with the application layer. Symantec ApplicationHA demonstrates that the application services were restarted in the correct order, avoiding the common problems associated with manual restarts and maintenance of SLAs.

Symantec ApplicationHA provides the automation required to start SAP application and database services in case of outages.

c. Test scenario—vSphere ESXi host hardware failure with vSphere HA and Symantec ApplicationHA

This test scenario validates that the management services and the SAP database services running inside the guest OS of a virtual machine can be restarted quickly and correctly in an automated manner using a combination of the vSphere HA, vSphere DRS,

Page 69: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

69 EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

and Symantec ApplicationHA technologies after a simulated vSphere ESXi host failure, as shown in Figure 47.

Figure 47. VMware vSphere ESXi host failure with vSphere HA and Symantec ApplicationHA

Objectives Verify that vSphere HA will restart the virtual machines of the surviving vSphere

ESXi hosts in the management and resource clusters after a vSphere ESXi host simulated failure.

Verify that the SAP database services (Microsoft SQL server database and Microsoft SQL server agent) running inside the SAPAB1DB virtual machine are restarted automatically by Symantec ApplicationHA agents after the virtual machine is restarted by vSphere HA.

Verify that the vSphere DRS anti-affinity rules are enforced during the restart of the virtual machines for both management and resource clusters.

Testing procedure Execute the following steps three times in the exact same conditions to validate the results:

1. Identify the vSphere ESXi hosts in which each management and SAP virtual machines were running.

2. Identify the vSphere ESXi hosts in which the SAP system AB1 database and ASCS instances virtual machines were running.

3. Select the vSphere ESXi host where the vCloud Director virtual machine of the management cluster is running and power it off from the hardware management console.

Page 70: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

70

4. Select the vSphere ESXi host where the SAPAB1DB virtual machine of the resource cluster is running and power it off from the hardware management console.

5. Verify that vSphere HA quickly and automatically restarts the vCloud Director virtual machine from one vSphere ESXi host 1 over to ESXi host 2.

6. Verify that the SAPAB1DB virtual machine restarts by using vSphere HA and the Microsoft SQL Server database services also automatically and correctly restart by using Symantec ApplicationHA agents.

7. Verify that all vSphere DRS anti-affinity rules were enforced on both the management and resource clusters.

Symantec ApplicationHA agents were only installed and configured on the SAP virtual machines for this solution.

Results Figure 48 summarizes the test environment described above and provides reference duration metrics obtained during the testing. The metrics are shown in minutes and seconds (mm:ss).

Figure 48. ESXi failure of resource and management clusters

As shown in Figure 48, vSphere HA restarted the vCloud Director virtual machine in the same datacenter from the vSphere ESXi host r710b to the vSphere ESXi host r710a after the failure (physical server powers off).

vSphere HA restarted the SAP database virtual machine (SAPAB1DB) that is running on the vSphere ESXi host c460-1 on Datacenter A, across datacenters, on the vSphere ESXi

Page 71: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

71 EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

host c460-2 on Datacenter B, after the failure (physical server power off) of the vSphere ESXi host c460-1, as shown in Figure 49. The virtual machine restart time observed was an average of 50 seconds, as shown in Figure 48.

Figure 49. Events pane in vCenter web client

Symantec ApplicationHA agent restarted the SAP database services (Microsoft SQL Server agent and database) inside the guest OS of the SAP AB1DB virtual machine in an average of 3 minutes and 36 seconds, as shown in Figure 48.

vSphere DRS, based on the configured affinity rules described in Table 10 and identified that the SAPAB1DB virtual machine was restarted by vSphere HA on the vSphere ESXi host c460-2 where the SAPAB1ASCS virtual machine was also running. Then VMware vMotion non-disruptively migrated the SAPAB1ASCS virtual machine from the vSphere ESXi host c460-2 to the vSphere ESXi host c460-3 on Datacenter B, as shown in Figure 50.

Figure 50. Events pane in vCenter web client

Analysis This test scenario demonstrates the following:

vSphere HA restarted the management services quickly, automatically, without any manual intervention after a vSphere ESXi host failure.

vSphere HA combined with Symantec ApplicationHA restarted the SAP Database services quickly, automatically, and across datacenters without any manual intervention after a severe vSphere ESXi host failure.

vSphere DRS affinity rules were enforced after the failure with the VMware vShield Edge gateway0 virtual machine being restarted on Datacenter A, as configured in the vSphere DRS rules to keep each vShield Edge gateways on their own datacenter for possible failures.

vSphere DRS identified that affinity rules were violated having the SAP Database virtual machine and the SAPAB1ASCS virtual machine running on the same vSphere ESXi host. Affinity rules were then enforced, triggering a VMware vMotion migration of the SAPAB1ASCS virtual machine to another vSphere ESXi host, which restored the previous level of protection provided before the vSphere ESXi host failure. Even after the failure, keeping the SAP virtual machines in separate vSphere ESXi hosts provided resiliency.

Symantec ApplicationHA automatically restarted the SAP database services (Microsoft SQL Server agent and database) after vSphere HA restarted the operating system, recovering from the incident and minimizing the downtime incurred to the SAP AB1 system.

Page 72: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

72

d. Test scenario—VPLEX cluster isolation failure with SAP workload

This test scenario validates that, when an isolation failure of a VPLEX cluster occurs, vCloud Suite, vCenter, the SAP applications, and the database continue operation without interruption.

The SAP workload described in Workload generation was executed in this test scenario.

Note: EMC VPLEX Witness, EMC Cross-Cluster Connect, and an inter-site link with 1 millisecond round trip time was used in this test scenario.

To test this failure scenario, we simulated the isolation of a specific cluster on Datacenter A, with both the external management IP network and the VPLEX WAN communications network partitioned. The LAG network remained available during the test. VPLEX Witness on Datacenter C and VPLEX cluster-2 on Datacenter B also remained available and in communication, as shown in Figure 51.

Figure 51. Datacenter A VPLEX cluster isolation failure

Objectives Verify that all SAP virtual machines on Datacenter A continue to run non-disruptively

through the EMC VPLEX counterpart in Datacenter B, through its Cross-Cluster Connect when an isolation failure of a VPLEX cluster occurs on Datacenter A.

Verify that the SAP workload continues to run without interruption by the EMC VPLEX storage failure.

Page 73: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

73 EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

Testing procedure 1. Start the SAP Standard SD Benchmark run and monitor until it reaches the high

load phase.

2. Simulate isolation of the preferred cluster on Datacenter A, with both the external management IP network and the VPLEX WAN communications network partitioned. The LAG network remains available.

3. Check PowerPath/VE status. The paths to Datacenter A VPLEX storage should be dead, but the paths to the VPLEX storage on Datacenter B should be alive and be serving the vSphere ESXi hosts on Datacenter A.

4. Verify the VPLEX functionality; check that the storage access was not lost.

5. Verify the status of SAP SD Benchmark and check that the processes were not stopped or interrupted at any point in time during the storage failover to Datacenter B.

Results Table 13 shows the expected and observed behaviors of the system when the VPLEX at Datacenter A was isolated.

Table 13. Expected and observed behaviors

System name Status prior to VPLEX isolation

Expected behavior

Observed behavior

Virtual machines on Datacenter A management cluster

On r710a:

vChargeback

ssenet (DNS server)

On r710b:

vCloud Director

Available Available Available

Virtual machines on Datacenter A resource cluster

On c460-1:

SAPAB1DB

On c460-3:

SAPAB1ASCS

vShield Edge

Available Available Available

VPLEX cluster VPLEX1 – Datacenter A – cluster-1

VPLEX2 – Datacenter B – cluster-2

VPLEX witness

Available Unavailable

Available

Available

Unavailable

Available

Available

SAP services Database, Enqueue, and Message Servers

Available Available Available

Analysis When VPLEX on Datacenter A became isolated, VPLEX Witness ensured that the consistency group detach rule, which defines cluster-1 as the preferred cluster, was overridden and the storage served by VPLEX cluster-2 on Datacenter B remained available, as shown in Figure 52.

Page 74: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

74

Figure 52. VPLEX status after VPLEX isolation on Datacenter A

Figure 53 shows the behavior of EMC PowerPath during the isolation event. Prior to the isolation, PowerPath sets the Cross-Cluster paths for the hosts on Datacenter A to automatic standby proxy. When the isolation event occurs, PowerPath is able to detect the isolation of the VPLEX LUNs on Datacenter A and service I/O on the standby paths. When the cluster is recovered, PowerPath automatically recovers the dead paths. For details of VPLEX Cross-Cluster Connect, refer to VMware deployments on VPLEX Metro.

Figure 53. PowerPath status from after VPLEX isolation on Datacenter A

Because the vSphere ESXi hosts are connected to both VPLEX clusters in each datacenter, vSphere ESXi with EMC PowerPath/VE simply routes the I/O to the alternate path, which is available since VPLEX is configured with a distributed volume protected by VPLEX Witness.

In both the management and resource clusters, the vSphere ESXi hosts on Datacenter A remain available and all virtual machines remain active because of VPLEX Metro HA Cross-Cluster Connect.

Page 75: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

75 EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

The SAP AB1 system keeps running without interruption, as shown in Figure 54.

Figure 54. SAP AB1system running during after VPLEX cluster isolation on Datacenter A

This test scenario demonstrates the following:

Even after VPLEX isolation, the SAP SD Benchmark workload is not interrupted during the transparent failover of the storage services from the VPLEX from Datacenter A to Datacenter B.

EMC PowerPath/VE identifies the failed paths and re-directs the FC traffic through the active paths, which are connected to the VPLEX storage on Datacenter B. This makes the VPLEX failure transparent to the SAP application running inside the SAP virtual machines on Datacenter A, as well as avoiding the virtual machines to be restarted on other ESXi hosts, avoiding a downtime.

Page 76: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

76

e. Test scenario—datacenter failure

This test scenario validates that, in the event of a complete datacenter failure, all virtual machines running on one datacenter in both management cluster and resource cluster are restarted quickly, with the correct sequence on surviving datacenter.

To test this failure scenario, we simulated a complete failure of Datacenter A, including VPLEX cluster, ESXi host, and network. The VPLEX Witness remained available on Datacenter C. VPLEX cluster-2 remained on Datacenter B to communicate with the VPLEX Witness on Datacenter B, as shown in Figure 55.

Figure 55. Complete failure of Datacenter A

Objectives Verify that the standby vShield Edge gateway on Datacenter B can take over and

resume the networking services.

Verify that all virtual machines on both management and resource clusters are restarted on Datacenter B.

Verify that the configured DRS affinity rules are followed and kept in compliance.

Testing procedure 1. Shut down all the ESXi hosts, switches, and VPLEX cluster-1 in Datacenter A to

simulate a complete failure.

2. Verify that the VPLEX cluster-2 on Datacenter B remains available.

3. Execute a continuous ping command from the SAPAB1AAS2 system running on Datacenter B to verify that the standby vShield Edge gateway on Datacenter B takes over the network service.

4. Verify that vSphere HA detects the ESXi hosts failure on Datacenter A.

Page 77: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

77 EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

5. On both management and resource clusters, verify that vSphere HA restarts all virtual machines, previously running on Datacenter A, on Datacenter B after failure of Datacenter A.

6. Verify that the vSphere DRS affinity rules are followed when virtual machines start running after the failure on Datacenter A, as configured in Table 10 in Configuring VMware vSphere DRS .

Results Table 14 shows the observed behaviors of the system and reference metrics in minutes and seconds (mm:ss) after the Datacenter A failure. DC represents datacenter in the table.

Table 14. Reference metrics after failure on Datacenter A

Before After

Cluster DC ESXi host

Virtual machines DC ESXi host Restart time

(mm:ss) DRS rules

Management cluster

A r710a vChargeBack B r710c 3:25 Compliant

A r710b vCloud Director B r710d 3:12 Compliant

A r710b DNS1 DNS1 was not restarted on Datacenter B due the affinity rule, but the DNS service resumed over to DNS2 on Datacenter B.

Compliant

Resource cluster

A c460-1 SAPAB1DB B c460-2 4:55 Compliant

A c460-3 vShield Edge 0

The vShield Edge 0 was not restarted on Datacenter B due an Affinity Rule, while gateway service failover to the vShield Edge 1 on Datacenter B

Compliant

A c460-3 SAPAB1ASCS B c460-4 3:25 Compliant

Analysis When Datacenter A fails, VPLEX Witness ensures that the consistency group’s detach rule, which defines cluster-1 as the preferred cluster, is overridden and the storage served by VPLEX cluster-2 on Datacenter B remains available, as shown in Figure 56.

Figure 56. Status of VPLEX cluster after Datacenter A failure

When the vSphere ESXi hosts on Datacenter A failed, vSphere HA detected the datacenter failure and did the following:

Restarted vCloud Director and vChargeback virtual machines in the management cluster on Datacenter B.

Page 78: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

78

Restarted the SAPAB1DB and SAPAB1ASCS virtual machines in the resource cluster on Datacenter B.

SAPAB1DB was restarted on a vSphere ESXi host different from the one with SAPAB1ASCS on Datacenter B, which is based on the configured DRS affinity rules as described in Table 10. The SAP end users of SAPAB1AAS1 lost their sessions due to the vSphere ESXi host failure, but logged on again when SAPAB1DB and SAPAB1ASCS restarted on Datacenter B.

On each cluster, the virtual machines that failed on the Datacenter A were restarted on the surviving Datacenter B by vSphere HA. The DNS service failed over to DNS2 on Datacenter B without interruption. The vShield Edge gateway 1 failed over to the vShield Edge gateway switch 1 in a few seconds, which is considerably faster than the SAP virtual machines restart, as shown in Figure 57.

Figure 57. vShield Edge gateway failover

Page 79: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

79 EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

Conclusion

This solution demonstrates the HA and application mobility bundle that is added to the foundation bundle solution across datacenters with the high availability provided by EMC VPLEX Metro and vSphere HA/DRS.

The solution combines EMC, VMware, SAP, and Symantec HA components to:

Eliminate SPOFs at all the infrastructure layers

Eliminate any recovery time by using VPLEX Metro with Cross-Cluster Connect

Minimize downtime providing high availability for vCloud environment across datacenters

Enable mission-critical business continuity for SAP applications running across local and remote datacenters

Simplify operational management and allow application monitoring and mapping for the entire infrastructure up to SAP Application layer using Symantec ApplicationHA

EMC Cloud-enabled Infrastructure HA and application mobility bundle provides these benefits:

Provide continuous availability for mission critical SAP infrastructure across stretched clusters:

VPLEX Metro is key enabler for delivering storage infrastructure 100% uptime for SAP as infrastructure is “always on” systems even with the loss of a datacenter. This also enables zero downtime for planned maintenance of the SAP infrastructure.

Increase utilization and reduce cost by sizing SAP for “steady state” versus “peak performance”

SAP systems require capacity for thousands of users, large databases, and peak processing requirements. Deploying incremental infrastructure for SAP high availability can be costly. EMC VPLEX with VMware enables customers to offer capacity on demand, non-disruptively adding SAP resources within and between arrays or over distance for workload relocation and re-balancing. Thus, there are no expensive resources sitting idle since SAP is active at both datacenters. This allows infrastructure in both datacenters to be fully utilized.

Summary

Page 80: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

80

To validate the solution, the EMC validation team performed the tests described in the Testing and validation section and noted the following behaviors:

Simulated a SAP database service process failure The failed process was quickly, cleanly, and automatically restarted by Symantec ApplicationHA in seconds.

Simulated a vSphere ESXi host failure in the management cluster All virtual machines were quickly and automatically restarted in minutes by vSphere HA and the SAP services restarted by Symantec ApplicationHA.

Simulated a complete datacenter A failure All virtual machines were quickly and automatically restarted in minutes by vSphere HA on the surviving datacenter.

Validated VPLEX Witness functionality during simulated isolation of a VPLEX cluster vCloud environment and SAP application continued without interruption.

It also demonstrates how VPLEX Metro, combined with VMware Metro stretched cluster and Layer 2 networking, extends high availability to break the boundaries of the Cloud-enabled datacenter and allow ESXi hosts at two datacenters to have read/write access to shared block storage devices. VPLEX Witness and Cross-Cluster Connect provide the highest level of availability and resilience.

Findings

Page 81: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

81 EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

References

Note: All links were working correctly at the time of publication.

For additional information, see the following EMC documents (available on EMC.com and the EMC Online Support website):

EMC Cloud-enabled Infrastructure for SAP Applications Business White Paper

EMC Cloud-enabled Infrastructure for SAP EMC Next-Generation VNX Reference Architecture

Using VPLEX Metro with VMware High Availability and Fault Tolerance for Ultimate Availability

EMC Cloud-Enabled Infrastructure for SAP Foundation Bundle—VNX White Paper

EMC VPLEX Metro Witness Technology and High Availability Techbook

Using VMware vSphere with EMC VPLEX Best Practices Planning

Conditions for Stretched Hosts Cluster Support on EMC VPLEX Metro

EMC VPLEX with GeoSynchrony 5.0 Configuration Guide

EMC Solutions Support Matrix (SSM)

Implementation and Planning Best Practices for EMC VPLEX Technical Notes

EMC VPLEX with GeoSynchrony 5.0 and Point Releases CLI Guide

Validating Host Multipathing with EMC VPLEX Technical Notes

EMC Powerpath/VE for VMware vSphere Best Practices Planning

For additional information, see the following VMware documents:

VMware vCloud Architecture Toolkit (VCAT)

Virtualizing Business-critical Applications with Confidence Technical Papers

VMware—SAP Solutions on VMware Business Continuity—Protecting Against Unplanned Downtime

SAP Solutions on VMware Best Practices Guide

VMware vSphere Metro Storage Cluster Case Study Technical Papers

VMware vSphere High Availability 5.0 Deployment Best Practices

VMware vSphere Networking ESXi 5.1 and vSphere Availability ESXi 5.1

VMware vSphere Resource Management ESXi 5.1 and vCenter Server 5.1

VMware Knowledge Base articles

1026692: Using VPLEX Metro with vSphere HA 2007545: Implementing vSphere Metro Storage Cluster (vMSC) using EMC

VPLEX

SAP —Solutions on VMware vSphere: High Availability

EMC

VMware

Page 82: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

82

VMware Performance Best Practices for VMware vSphere 5.1

VMware vSphere 5.1 vMotion Architecture, Performance and Best Practices

SQL Server on VMware Best Practices Guide

VMware vSphere vMotion—Architecture, Performance and Best Practices in VMware vSphere 5

The Design and Evolution of Live Storage Migration in VMware ESX

VMware High Availability —Concepts, Implementation and Best Practices

VMware vSphere 5.1 Documentation Center

For additional information, see the following SAP documents:

SAP Solution Management Business Continuity Best Practices

SAP NetWeaver High Availability and Business Continuity in Virtual Environments with VMware and Hyper-V on Microsoft Windows

SAP Installation Guide for SAP Systems Based on the Application Server ABAP of SAP NetWeaver on Windows: Microsoft SQL Server Using Software Provisioning Manager 1.0 -Valid for SAP Systems Based on SAP NetWeaver 7.0 including Enhancement Package 2

High Availability Solutions for SAP on VMware

SAP Support Notes

SAP Note 1380654—SAP Support in Public Cloud Environments

SAP Note 1492000—General Support Statement for Virtual Environments

SAP Note 1612283—Hardware Configuration Standards and Guidance

SAP Note 1580509—Windows Server Editions Supported by SAP

SAP Note 1409608—Virtualization on Windows

SAP Note 1056052—Windows: VMware vSphere Configuration Guidelines

SAP Note 0611361—Hostnames of SAP Servers

SAP Note 0962955—Use of Virtual TCP/IP Host Names

SAP Note 1733195—Microsoft SQL Server Version and Download List

SAP Note 0062988—Service Packs for Microsoft SQL Server

SAP Note 0098051—Database Reconnect: Architecture and Function

SAP Note 0024806—Database Reconnect: Technical Details and Settings

SAP Note 1482272—Key Figures of Virtualization on VMware vSphere

SAP Note 1680045—Release Note for Software Provisioning Manager 1.0

SAP Note 1716219—SAP Release Note for SL Toolset 1.0 SPS06

SAP

Page 83: EMC CLOUD-ENABLED INFRASTRUCTURE FOR · PDF fileWhite Paper EMC Solutions Abstract This white paper focuses on high availability and application mobility add-on bundle of the on-premises

83 EMC Cloud-Enabled Infrastructure for SAP—Business Continuity Series: High Availability and Application Mobility Bundle —VNX

For additional information, see the Symantec website, the main page of Symantec ApplicationHA, and the following Symantec documents:

Symantec ApplicationHA Release Notes

Symantec ApplicationHA Installation and Upgrade Guide

Symantec ApplicationHA User Guide

Symantec ApplicationHA Agent for SAP NetWeaver Configuration Guide

Symantec ApplicationHA Agent for Microsoft SQL Server Configuration Guide

Symantec