optimize your vertica data management infrastructure

16
Optimize Your Vertica Data Management Infrastructure Srinivas Vadlamani, Chief Architect January 2017

Upload: talena-inc

Post on 15-Feb-2017

81 views

Category:

Technology


2 download

TRANSCRIPT

Page 1: Optimize Your Vertica Data Management Infrastructure

Confidential and Proprietary1

Optimize Your Vertica Data Management InfrastructureSrinivas Vadlamani, Chief ArchitectJanuary 2017

Page 2: Optimize Your Vertica Data Management Infrastructure

Confidential and Proprietary2

My background

Co-founder and Chief Architect at Talena. Prior to Talena, I was an early architect at Couchbase and Aster Data, helping design some of their key datamanagement capabilities.

Page 3: Optimize Your Vertica Data Management Infrastructure

Confidential and Proprietary3

ing

Data Management Drivers

ApplicationIteration

Compliance

70% of businesses lost data over the past

two years

40% of businesses hitby ransomware in 2016

Robust testing requires up-to-date data

90% of enterprises delayapplication rollouts waiting

for production data

Average Global 2000 company has seven copies of prod data

Storage costs for archival growing by

35% yearly

$900K average cost of

a data loss incident

$1.08M cost to implement

manual test data efforts

$300K average yearly cost of

managing archives

BUSI

NES

S CH

ALLE

NG

ESFI

NAN

CIAL

IM

PACT

Data Loss

Source: EMC, CA, Ponemon

Page 4: Optimize Your Vertica Data Management Infrastructure

Confidential and Proprietary4

Key Big Data Protection Principles

Replication and backup are not the

same

You need an incremental

forever architecture

It really is about

recovery, not backup

Even commodity storage is expensive

Page 5: Optimize Your Vertica Data Management Infrastructure

Confidential and Proprietary5

Replication vs BackupReplication: ideal for hardware failures

Backup: ideal to protect against human errors and application corruption

Need both as part of your Vertica data protection strategy

Page 6: Optimize Your Vertica Data Management Infrastructure

Confidential and Proprietary6

Why Incremental-Forever?

Data volumes are getting too large for traditional backup methods. Backing up hundreds of terabytes on a weekly basis is not feasible as a backup policy

Page 7: Optimize Your Vertica Data Management Infrastructure

Confidential and Proprietary7

A Recovery-centric ArchitectureHow quickly you recover impacts your business and brand

Your recovery architecture needs to handle changes to the production topology over time

Your recovery flexibility (whole database or at a schema level) will influence your recovery point and recovery time objectives

Page 8: Optimize Your Vertica Data Management Infrastructure

Confidential and Proprietary8

Speeding Up Application Delivery

Real versus

Synthetic Data

Supporting Complianc

e Initiatives

Minimizing Network

Overhead

Page 9: Optimize Your Vertica Data Management Infrastructure

Confidential and Proprietary9

What’s Different In The Cloud

Metadata managemen

t is made more

complex

PROBLEM

Storage optimization

becomes that much

more difficult

PROBLEM

Relying on traditional

backup mechanisms

does not scale

PROBLEM

Page 10: Optimize Your Vertica Data Management Infrastructure

Confidential and Proprietary10

The Talena Architecture

• Deep de-duplication and compression with app-aware architecture

• Incremental-forever backup architecture• High availability via erasure coding in distributed cluster

architecture

Smart Storage Optimizer

Page 11: Optimize Your Vertica Data Management Infrastructure

Confidential and Proprietary11

The Talena Architecture

Native querying and analytics via active compute layer

Unbounded scale with a Hadoop-native architecture

Smart Storage Optimizer

Active Compute Services Distributed File System

Page 12: Optimize Your Vertica Data Management Infrastructure

Confidential and Proprietary12

The Talena Architecture

• Google-like catalog shortens data recovery time

• Automatic schema generation for mirroring and backups

• Granular recovery at an object level

• Recovery to multiple topologies

• Native integration with LDAP and Kerberos for authentication

• Role-based access control defines specific privileges

• Transparent data encryption

• Masking for PII data

Smart Storage Optimizer

Active Compute Services Distributed File System

Metadata Catalog Data Orchestration ServicesSecurity Services

Page 13: Optimize Your Vertica Data Management Infrastructure

Confidential and Proprietary13

Smart Storage Optimizer

The Talena Architecture

GUI CLI API

Active Compute Services Distributed File System

• ‘Single pane of glass’ for multiple use cases and data platforms• Agentless architecture minimizes management overhead• GUI, CLI, REST-based Talena API options

Metadata Catalog Data Orchestration ServicesSecurity Services

Page 14: Optimize Your Vertica Data Management Infrastructure

Confidential and Proprietary14

Talena and vbr.pyvbr.py Talena

Recovery to different Vertica version

No Yes

Recovery to different Vertica topology

No Yes

Google-like metadata catalog for rapid discovery

No Yes

Built-in storage optimization

No Yes

UI for automated policy and workflow creation

No Yes

Ability to support test data management

No Yes

Inherent scalable infrastructure

No Yes

Data masking support No Yes

Sampling support No Yes

Page 15: Optimize Your Vertica Data Management Infrastructure

Confidential and Proprietary15

Q&A We’ll send you a link to our eBook “The Vertica Backup Guide”

Additional resources: talena-inc.com/resources and talena-inc.com/blog

Ping us with any additional questions: [email protected]

Page 16: Optimize Your Vertica Data Management Infrastructure

Confidential and Proprietary16

Q and A