nyoug delphix slideshare

129
Agile Data : Virtual Data Revolution [email protected] kylehailey.com slideshare.com/khailey

Upload: kyle-hailey

Post on 27-Jan-2015

122 views

Category:

Documents


2 download

DESCRIPTION

 

TRANSCRIPT

Page 1: Nyoug delphix slideshare

Agile Data : Virtual Data Revolution

[email protected]

slideshare.com/khailey

Page 2: Nyoug delphix slideshare

In this presentation :

• Problem in IT• Solution• Use Cases

Page 3: Nyoug delphix slideshare

In this presentation :

• Problem in IT• Solution• Use Cases

Page 4: Nyoug delphix slideshare

The Phoenix Project

• Bottlenecks• Metrics• Priorities• Goals• Iterations

“The Goal”

by E. Goldratt

Page 5: Nyoug delphix slideshare

The Phoenix Project

“Any improvement not made at the constraint is an illusion.”

Page 6: Nyoug delphix slideshare

The Phoenix Project

“Any improvement not made at the constraint is an illusion.”

What is the constraint?

Page 7: Nyoug delphix slideshare

The Phoenix Project

“Any improvement not made at the constraint is an illusion.”

What is the constraint?

“One of the most powerful things that IT can do is get environments to development and QA when they need it”

Page 8: Nyoug delphix slideshare

Problem in IT

I. Data Constraint strains ITII. Data Constraint price is hugeIII. Data Constraint companies

unaware

Page 9: Nyoug delphix slideshare

Problem in IT

60% Projects Over Schedule

85% delayed waiting for data

Data is the Constraint

CIO Magazine Survey:

Current situation: only getting worse … Data Doomsday

Page 10: Nyoug delphix slideshare

I. Data Constraint strains IT

If you can’t satisfy the business demands then your process is broken.

Page 11: Nyoug delphix slideshare

II. Data Constraint price is huge

Page 12: Nyoug delphix slideshare

III. Data Constraint : companies unaware

Page 13: Nyoug delphix slideshare

Data is the constraint

I. Data Constraint strains ITII. Data Constraint price is hugeIII. Data Constraint companies

unaware

Page 14: Nyoug delphix slideshare

I. Data Constraint companies unaware

– Moving data is hard

– Triple tax

– Data Floods infrastructure

Page 15: Nyoug delphix slideshare

I. Data Constraint : moving data is hard

– Storage & Systems– Personnel – Time

Page 16: Nyoug delphix slideshare

Typical Architecture

Production

Instance

File system

Database

Page 17: Nyoug delphix slideshare

Typical Architecture

Production

Instance

Backup

File system

Database

File system

Database

Page 18: Nyoug delphix slideshare

Typical Architecture

Production

Instance

Reporting Backup

File system

Database

Instance

File system

Database

File system

Database

Page 19: Nyoug delphix slideshare

Typical Architecture

Production

Instance

File system

Database

Instance

File system

Database

File system

Database

File system

Database

Instance Instance

Instance

File system

Database

File system

Database

Dev, QA, UAT Reporting Backup

Triple Tax

Page 20: Nyoug delphix slideshare

Typical Architecture

Production

Instance

File system

Database

Instance

File system

Database

File system

Database

File system

Database

Instance Instance

Instance

File system

Database

File system

Database

Page 21: Nyoug delphix slideshare

I. Data constraint: Data floods company infrastructure

92% of the cost of business , in financial services business ,

is “data” www.wsta.org/resources/industry-articles

Most companies have 2-9% IT spending http://uclue.com/?xq=1133

Data management is the largest Part of IT expense

Gartner: Data Doomsday

Page 22: Nyoug delphix slideshare

Data is the constraint

I. Data Constraint strains ITII. Data Constraint price is hugeIII. Data Constraint companies

unaware

Page 23: Nyoug delphix slideshare

Part II. Data constraint price is Huge

Page 24: Nyoug delphix slideshare

Part II. Data constraint price is Huge

• Four Areas data tax hits

1. IT Capital resources 2. IT Operations personnel 3. Application Development 4. Business

Page 25: Nyoug delphix slideshare

Part II. Data constraint price is Huge

• Four Areas data tax hits

1. IT Capital resources 2. IT Operations personnel 3. Application Development 4. Business

Page 26: Nyoug delphix slideshare

II. Data constraint price is huge : 1. IT Capital

• Hardware–Servers–Storage–Network–Data center floor space, power, cooling

Page 27: Nyoug delphix slideshare

Part II. Data constraint price is Huge

• Four Areas data tax hits

1. IT Capital resources 2. IT Operations personnel 3. Application Development 4. Business

Page 28: Nyoug delphix slideshare

II. Data constraint price is huge : 2. IT Operations• People

– DBAs– SYS Admin– Storage Admin– Backup Admin – Network Admin

• Hours : 1000s just for DBAs • $100s Millions for data center

modernizations

Page 29: Nyoug delphix slideshare

Part II. Data constraint price is Huge

• Four Areas data tax hits

1. IT Capital resources 2. IT Operations personnel 3. Application Development 4. Business

Page 30: Nyoug delphix slideshare

II. Data constraint price is Huge : 3. App Dev

• Inefficient QA: Higher costs of QA• QA Delays : Greater re-work of code• Sharing DB Environments :

Bottlenecks• Using DB Subsets: More bugs in Prod• Slow Environment Builds: Delays

“if you can't measure it you can’t manage it”

Page 31: Nyoug delphix slideshare

II. Data Tax is Huge : 3. App Dev

Long Build TimeQA Test

96% of QA time was building environment$.04/$1.00 actual testing vs. setup

Build

Page 32: Nyoug delphix slideshare

II. Data Tax is Huge : 3. App Dev

Build QA Env QA Build QA Env QA

Sprint 1 Sprint 2 Sprint 3

Bug CodeX

1 2 3 4 5 6 70

10203040506070

Delay in Fixing the bug

Cost ToCorrect

Software Engineering Economics – Barry Boehm (1981)

Page 33: Nyoug delphix slideshare

II. Data Tax is Huge : 3. App Dev full copies cause bottlenecks

Frustration Waiting

Old Unrepresentative Data

Page 34: Nyoug delphix slideshare

II. Data Tax is Huge : 3. App Dev subsets cause bugs

Page 35: Nyoug delphix slideshare

Production

II. Data Tax is Huge : 3. App Dev subsets cause bugs

The Production ‘Wall’

Page 36: Nyoug delphix slideshare

II. Data Tax is Huge : 3. App Dev

Developer Asks for DB Get Access

Manager approves

DBA Request system

Setup DB

System Admin

Requeststorage

Setup machine

Storage Admin

Allocate storage (take snapshot)

3-6 Months to Deliver Data

Page 37: Nyoug delphix slideshare

II. Data Tax is Huge : 3. App Dev

Why are hand offs so expensive?

1hour1 day

9 days

Page 38: Nyoug delphix slideshare

II. Data Tax is Huge : 3. App DevSlow Environment Builds

Never enough environments

Page 39: Nyoug delphix slideshare

Part II. Data constraint price is Huge

• Four Areas data tax hits

1. IT Capital resources 2. IT Operations personnel 3. Application Development 4. Business

Page 40: Nyoug delphix slideshare

II. Data constraint price is Huge : 4. Business

Ability to capture revenue

• BusinessIntelligence – Old data = less intelligence

• Business Applications – Delays cause

=> Lost Revenue

Page 41: Nyoug delphix slideshare

II. Data constraint price is Huge : 4. Business

Page 42: Nyoug delphix slideshare

II. Data constraint price is Huge : 4. Business

Storage

IT Ops

Dev

Revenue

0 5000 10000 15000 20000 25000 30000Billion $

Page 43: Nyoug delphix slideshare

Data is the constraint

I. Data Constraint strains ITII. Data Constraint price is hugeIII. Data Constraint companies

unaware

Page 44: Nyoug delphix slideshare

Part III. Data Constraint companies unaware

Page 45: Nyoug delphix slideshare

III. Data Constraint companies unaware

DBA Developer

Page 46: Nyoug delphix slideshare

III. Data Constraint companies unaware

#1 Biggest Enemy :

IT departments believe– best processes – greatest technology– Just the way it is

Page 47: Nyoug delphix slideshare

III. Data Constraint companies unaware

Why do I need an iPhone ?

Don’t we already do that ?

Page 48: Nyoug delphix slideshare

III. Data Constraint companies unaware

• Ask Questions– me: we provision environments in

minutes for almost not extra storage.– Customer: We already do that – me: How long does it take a developer to

get an environment after they ask ?– Customer: 2-3 weeks– me: we do it in 2-3 minutes

Page 49: Nyoug delphix slideshare

III. Data Constraint companies unaware

How to enlighten? Ask for metrics

– How old is data in• BI and DW : ETL windows• QA and Dev : how often refreshed

– How long does it take a developer to get a DB copy?

– How long does it take QA to setup an environment

Page 50: Nyoug delphix slideshare

Data is the constraint

I. Data Constraint strains ITII. Data Constraint price is hugeIII. Data Constraint companies

unaware

Page 51: Nyoug delphix slideshare

In this presentation :

• Problem in the Industry• Solution• Use Cases

Page 52: Nyoug delphix slideshare

Clone 1 Clone 3Clone 2

99% of blocks are identical

Page 53: Nyoug delphix slideshare

Solution

Page 54: Nyoug delphix slideshare

Clone 1 Clone 2 Clone 3

Thin Clone

Page 55: Nyoug delphix slideshare

Technology Core : file system snapshots

• Vmware Linked Clones– Not supported for Oracle

• EMC – 16 snapshots– Write performance impact

• Netapp– 255 snapshots

• ZFS– Unlimited snapshots

Page 56: Nyoug delphix slideshare

III. Companies unaware of the Data Tax

Page 57: Nyoug delphix slideshare

Three Core Parts

Production

File System

Instance

DevelopmentStorage

21 3

Copy Sync SnapshotsTime FlowPurge

Clone (snapshot)CompressShare CacheStorage Agnostic

Mount, recover, renameSelf Service, Roles & Security Rollback & Refresh Branch & Tag

Instance

Page 58: Nyoug delphix slideshare

Three Core Parts

Production

File System

Instance

DevelopmentStorage

21 3

Copy Sync SnapshotsTime FlowPurge

Clone (snapshot)CompressShare CacheStorage Agnostic

Mount, recover, renameSelf Service, Roles & Security Rollback & Refresh Branch & Tag

Instance

Page 59: Nyoug delphix slideshare

3. Database Virtualization

Page 60: Nyoug delphix slideshare

Three Physical CopiesThree Virtual Copies

Data Virtualization Appliance

Page 61: Nyoug delphix slideshare

Install Delphix on x86 hardware

Intel hardware

Page 62: Nyoug delphix slideshare

Allocate Any Storage to Delphix

Allocate StorageAny type

Pure Storage + DelphixBetter Performance for 1/10 the cost

Page 63: Nyoug delphix slideshare

One time backup of source database

Database

Production

File systemFile system

Upcoming

Supports

InstanceInstanceInstance

Application Stack Data

Page 64: Nyoug delphix slideshare

DxFS (Delphix) Compress Data

Database

Production

Data is compressed typically 1/3 size

File system

InstanceInstanceInstance

Page 65: Nyoug delphix slideshare

Incremental forever change collection

Database

Production

File system

Changes

• Collected incrementally forever• Old data purged

File system Time Window

Production

InstanceInstanceInstance

Page 66: Nyoug delphix slideshare

Source Full Copy Source backup from SCN 1

Page 67: Nyoug delphix slideshare

Snapshot 1Snapshot 2

Page 68: Nyoug delphix slideshare

Snapshot 1Snapshot 2

Backup from SCN

Page 69: Nyoug delphix slideshare

Snapshot 1Snapshot 2

Snapshot 3

Page 70: Nyoug delphix slideshare

Drop Snapshot

Snapshot 1Snapshot 2

Snapshot 3Snapshot 2

Snapshot 3

DropSnapshot 1

Page 71: Nyoug delphix slideshare

Virtual DB71 / 30

Jonathan Lewis © 2013

Snapshot 1 – full backup once only at link time

a b c d e f g h i

We start with a full backup - analogous to a level 0 rman backup. Includes the archived redo log files needed for recovery. Run in archivelog mode.

Page 72: Nyoug delphix slideshare

Virtual DB72 / 30

Jonathan Lewis © 2013

Snapshot 2 (from SCN)

b' c'

a b c d e f g h i

The "backup from SCN" is analogous to a level 1 incremental backup (which includes the relevant archived redo logs). Sensible to enable BCT.

Delphix executes standard rman scripts

Page 73: Nyoug delphix slideshare

Virtual DB73 / 30

Jonathan Lewis © 2013

a b c d e f g h i

Apply Snapshot 2

b' c'

The Delphix appliance unpacks the rman backup and "overwrites" the initial backup with the changed blocks - but DxFS makes new copies of the blocks

b' c'

Page 74: Nyoug delphix slideshare

Virtual DB74 / 30

Jonathan Lewis © 2013

Derived Full Backup at Snapshot 2

b' c'a d e f g h i

The call to rman leaves us with a new level 0 backup, waiting for recovery. But we can pick the snapshot root block. We have EVERY level 0 backup

Page 75: Nyoug delphix slideshare

Virtual DB75 / 30

Jonathan Lewis © 2013

Creating a vDB

b' c'a d e f g h i

The first step in creating a vDB is to take a snapshot of the filesystem as at the backup you want (then roll it forward)

My vDB(filesystem)

Your vDB(filesystem)

Page 76: Nyoug delphix slideshare

Virtual DB76 / 30

Jonathan Lewis © 2013

Creating a vDB

b' c'a d e f g h i

The first step in creating a vDB is to take a snapshot of the filesystem as at the backup you want (then roll it forward)

My vDB(filesystem)

Your vDB(filesystem)

i’

Page 77: Nyoug delphix slideshare

Cloning

Database

Production

Instance

File systemFile system Time Window

Database

InstanceInstance

InstanceInstance

Page 78: Nyoug delphix slideshare

In this presentation :

• Problem in the Industry• Solution• Use Cases

Page 79: Nyoug delphix slideshare

Use Cases

1. Development2. QA3. Recovery4. Business Intelligence5. Modernization

Page 80: Nyoug delphix slideshare

Use Cases

1. Development2. QA3. Recovery4. Business Intelligence5. Modernization

Page 81: Nyoug delphix slideshare

Development

• Parallelized Environments• Full size environments• Self Service

Development

Page 82: Nyoug delphix slideshare

Development: Parallelize Environments

gif by Steve Karam

Page 83: Nyoug delphix slideshare

Development: Full size copies

Produ

ction

Page 84: Nyoug delphix slideshare

Development: Self Service

Page 85: Nyoug delphix slideshare

Use Cases

1. Development2. QA3. Recovery4. Business Intelligence5. Modernization

Page 86: Nyoug delphix slideshare

QA

• Fast • Parallel• Rollback• A/B testing

Page 87: Nyoug delphix slideshare

QA : Fast environments with Branching

Instance Instance

Instance

Source Dev

QA branched from Dev

Source

dev

QA

Page 88: Nyoug delphix slideshare

QA : Fast environments with Branching

Build Time

QA Test

1% of QA time was building environment$.99/$1.00 actual testing vs. setup

Build Time

QA Test

Build

Page 89: Nyoug delphix slideshare

QA : bugs found fast

Sprint 1 Sprint 2 Sprint 3

Bug CodeX

QA QA

Build QA Env

QA

Build QA Env

QA

Sprint 1 Sprint 2 Sprint 3

Bug Code

X

Page 90: Nyoug delphix slideshare

QA : Parallel environments

Instance

Instance

Instance

Instance

Source

Page 91: Nyoug delphix slideshare

QA : Rewind for patch and QA testing

Instance Instance

Development

Time Window

Prod

Page 92: Nyoug delphix slideshare

QA : A/B testing

Instance

Instance

Instance

Index 1

Index 2

Page 93: Nyoug delphix slideshare

Use Cases

1. Development2. QA3. Quality4. Business Intelligence5. Modernization

Page 94: Nyoug delphix slideshare

Quality

1. Prod & Dev Backups2. Surgical recovery3. Recovery of Production4. Recovery of Development5. Bug Forensics

Page 95: Nyoug delphix slideshare

Quality : 50 days of backup in size of production

Page 96: Nyoug delphix slideshare

Quality : Surgical recovery

Instance Instance

Development

Time Window

Before dropDrop

Source

Page 97: Nyoug delphix slideshare

Quality: recovery of development

Instance Instance

Dev1 VDB

Time Window

Time WindowDev1 VDB

Instance

Source

Source

Dev2 VDB Branched

Time Window

Dev2 VDB Branched

Page 98: Nyoug delphix slideshare

Quality : recovery of production

Instance Instance

VDB

Source

Time Window

Corruption

Page 99: Nyoug delphix slideshare

1. Forensics: Investigate Production Bugs

Instance

Time Window

Instance

Development

BugYesterday

Yesterday

Page 100: Nyoug delphix slideshare

Use Cases

1. Development2. QA3. Quality4. Business Intelligence5. Modernization

Page 101: Nyoug delphix slideshare

Business Intelligence

• 24x7 Batches• Low Bandwidth• Temporal Data• Confidence Testing

Page 102: Nyoug delphix slideshare

Business Intelligence: ETL and Refresh Windows

1pm 10pm 8am noon

Page 103: Nyoug delphix slideshare

Business Intelligence: ETL and DW refreshes taking longer

1pm 10pm 8am noon20112012201320142015

Page 104: Nyoug delphix slideshare

Business Intelligence ETL and Refresh Windows

20112012201320142015

1pm 10pm 8am noon

10pm 8am noon 9pm

6am 8am 10pm

Page 105: Nyoug delphix slideshare

Business Intelligence: ETL and DW Refreshes

Instance

Prod

Instance

DW & BI

Data Guard – requires full refresh if usedActive Data Guard – read only, most reports don’t work

Page 106: Nyoug delphix slideshare

Business Intelligence: Fast Refreshes

• Collect only Changes• Refresh in minutes

Instance Instance

Prod

Instance

BI and DW

ETL24x7

Page 107: Nyoug delphix slideshare

Business Intelligence: Temporal Data

Page 108: Nyoug delphix slideshare

Business Intelligence

a) 24x7 Batches & Refreshes

b) Temporal queries

c) Confidence testing

Page 109: Nyoug delphix slideshare

Use Cases

1. Development2. QA3. Quality4. Business Intelligence5. Modernization

Page 110: Nyoug delphix slideshare

Modernization

1. Federated2. Consolidation3. Migration4. Auditing

Page 111: Nyoug delphix slideshare

Modernization: Federated

Instance

Instance

Instance

Instance

Source1

Source2

Source1

Page 112: Nyoug delphix slideshare

Modernization: Federated

Page 113: Nyoug delphix slideshare

“I looked like a hero”Tony Young, CIO Informatica

Modernization: Federated

Page 114: Nyoug delphix slideshare

Modernization: Data Center Migration

5x Source Data Copy < 1 x Source Data Copy

S SC C C C V V V V

Page 115: Nyoug delphix slideshare

Modernization: Consolidation

Without Delphix With Delphix

Page 116: Nyoug delphix slideshare

Dev

QA

UAT

Dev

QA

UAT

2.6

2.7

Dev

QA

UAT2.8

Data Control = Source Control for the Database

Production Time Flow

Modernization: Auditing & Version Control

CIOInsurance

600 Applications

CIOInvestment Banking

180 Applications

CIOSouth America65 Applications

Page 117: Nyoug delphix slideshare

Use Case Summary

1. Development2. QA3. Quality4. Business Intelligence5. Performance Acceleration

Page 118: Nyoug delphix slideshare

How expensive is the Data Constraint?

Measure before and after Delphix w/ Fortune 500 :

Median App Dev throughput increase by 2x

Page 119: Nyoug delphix slideshare

How expensive is the Data Constraint?

• 10 x Faster Financial Close• 9x Faster BI refreshes• 2x faster Projects • 20 % less bugs

Page 120: Nyoug delphix slideshare

Agile Data Quotes

• “Allowed us to shrink our project schedule from 12 months to 6 months.”– BA Scott, NYL VP App Dev

• "It used to take 50-some-odd days to develop an insurance product, … Now we can get a product to the customer in about 23 days.”– Presbyterian Health

• “Can't imagine working without it”– Ramesh Shrinivasan CA Department of General Services

Page 121: Nyoug delphix slideshare
Page 122: Nyoug delphix slideshare

Summary

• Problem: Data is the constraint • Solution: Agile data is small & fast• Results: Deliver projects

– Half the Time– Higher Quality– Increase Revenue

[email protected]/khailey

Page 123: Nyoug delphix slideshare

Future

Now• Application Stack Cloning• Cross Platform Cloning : UNIX -> Linux• Postgres

Coming• VM cloning• Workflows

– Chef, Puppet, etc workflows for virtual data provisioning

• Developer workspaces– Check out, check in, bookmark, tagging, rollback, refresh

• Secure Data– Masking

• More Databases– MySQL, Sybase, DB2, Hadoop, Mongo, Cassandra

• DR and HA

Page 124: Nyoug delphix slideshare

Oracle 12c

Page 125: Nyoug delphix slideshare

80MB buffer cache ?

Page 126: Nyoug delphix slideshare

200GBCache

Page 127: Nyoug delphix slideshare

5000

Tnxs

/ m

inLa

tenc

y

300 ms

1 5 10 20 30 60 100 200

with

1 5 10 20 30 60 100 200Users

Page 128: Nyoug delphix slideshare

8000

Tnxs

/ m

inLa

tenc

y

600 ms

1 5 10 20 30 60 100 200Users

1 5 10 20 30 60 100 200

Page 129: Nyoug delphix slideshare

$1,000,000 1TB cache on SAN

$6,000200GB shared cache on Delphix

Five 200GB database copies are cached with :