consolidate more: high performance primary deduplication in the age of abundant capacity

25
CONSOLIDATE MORE: HIGH- PERFORMANCE PRIMARY DEDUPLICATION IN THE AGE OF ABUNDANT CAPACITY YONG KIM, TECHNICAL DIRECTOR, AMERICAS FILE AND CONTENT SOLUTIONS

Upload: hitachi-data-systems

Post on 29-Nov-2014

668 views

Category:

Technology


0 download

DESCRIPTION

Increase productivity, efficiency and environmental savings by eliminating silos, preventing sprawl and reducing complexity by 50%. Using powerful consolidation systems, Hitachi Unified Storage or Hitachi NAS Platform, lets you consolidate existing file servers and NAS devices on to fewer nodes. You can perform the same or even more work with fewer devices and lower overhead, while reducing floor space and associated power and cooling costs. View this webcast to learn how to: Shrink your primary file data without disrupting performance. Increase productivity and utilization of available capacity. Defer additional storage purchases. Save on power, cooling and space costs. For more information please visit: http://www.hds.com/products/file-and-content/network-attached-storage/?WT.ac=us_inside_rm_htchunfds

TRANSCRIPT

Page 1: Consolidate More: High Performance Primary Deduplication in the Age of Abundant Capacity

CONSOLIDATE MORE: HIGH- PERFORMANCE PRIMARY DEDUPLICATION IN THE AGE OF ABUNDANT CAPACITY

YONG KIM, TECHNICAL DIRECTOR, AMERICAS FILE AND CONTENT SOLUTIONS

Page 2: Consolidate More: High Performance Primary Deduplication in the Age of Abundant Capacity

Increase productivity, efficiency and environmental savings by eliminating silos,

preventing sprawl and reducing complexity by 50%. Using powerful consolidation

systems − Hitachi Unified Storage or Hitachi NAS Platform − lets you consolidate existing

file servers and NAS devices onto fewer nodes. You can perform the same or even more

work with fewer devices and lower overhead, while reducing floor space and associated

power and cooling costs.

Attend this webcast to learn how to

Shrink your primary file data without disrupting performance.

Increase productivity and utilization of available capacity.

Defer additional storage purchases.

Save on power, cooling and space costs.

CONSOLIDATE MORE: HIGH-PERFORMANCE PRIMARY DEDUPLICATION

IN THE AGE OF ABUNDANT CAPACITY

WEBTECH EDUCATIONAL SERIES

Page 3: Consolidate More: High Performance Primary Deduplication in the Age of Abundant Capacity

UPCOMING WEBTECHS

Cloud and Object Store Series

‒ Environmental Pressures Driving an Evolution in File Storage, April 3, 9 a.m. PT, noon ET

Big Data Webcast Series Continues

‒ Big Data: Shining the Light on Enterprise Dark Data, April 17, 9 a.m. PT, noon ET

‒ HDS Big Data Roadmap, May 1, 9 a.m. PT, noon ET

Check www.hds.com/webtech for

Links to the recording, the presentation, and Q&A (available next week)

Schedule and registration for upcoming WebTech sessions

Page 4: Consolidate More: High Performance Primary Deduplication in the Age of Abundant Capacity

CUSTOMER CHALLENGES

Reduce the cost of

storing data?

Reduce the cost of

protecting data?

Manage distributed IT

more effectively?

Mitigate data risk?

Gain IT agility?

How

do I

Do more with less?

Archive first

Back up less

Consolidate more

Page 5: Consolidate More: High Performance Primary Deduplication in the Age of Abundant Capacity

TIE IT ALL TOGETHER

EDGE STORAGE

ALTERNATE DATA CENTER

REMOTE OFFICE

APPLICATION APPLICATION APPLICATION

HDDS SEARCH ACROSS

THE POWER OF THE PORTFOLIO

Reduce the cost of

storing data

Reduce the cost of

protecting data

Do more with less

Back up

less

Consolidate

more

Reduce overall storage costs by reducing the load on primary storage by at least 40%

Reduce licensing and management cost, complexity and backup by up to 75%

CLOUD STORAGE

MOBILE WORKFORCE

HUS FILE

MODULE

Page 6: Consolidate More: High Performance Primary Deduplication in the Age of Abundant Capacity

TIE IT ALL TOGETHER

EDGE STORAGE

ALTERNATE DATA CENTER

REMOTE OFFICE

APPLICATION APPLICATION APPLICATION

HDDS SEARCH ACROSS

THE POWER OF THE PORTFOLIO

Reduce the cost of

storing data

Reduce the cost of

protecting data

Do more with less

Consolidate

more

Streamline backup and restore operations by 50-60%

Improve reliability with >24x improvement in RPO, >30x improvement in RTO

Simplify management, improve data protection and reduce risk

CLOUD STORAGE

MOBILE WORKFORCE

HUS FILE

MODULE

Page 7: Consolidate More: High Performance Primary Deduplication in the Age of Abundant Capacity

TIE IT ALL TOGETHER

EDGE STORAGE

ALTERNATE DATA CENTER

REMOTE OFFICE

APPLICATION APPLICATION APPLICATION

HDDS SEARCH ACROSS

THE POWER OF THE PORTFOLIO

Reduce the cost of

storing data

Reduce the cost of

protecting data

Do more with less

Simplify management, improve data protection and reduce risk

Reduce or eliminate CAPEX, simplify management, offload data from primary storage

CLOUD STORAGE

MOBILE WORKFORCE

HUS FILE

MODULE

Page 8: Consolidate More: High Performance Primary Deduplication in the Age of Abundant Capacity

DATA GROWTH CONTINUES UNABATED …

Data growth: Doubling every 18 months

‒ Unstructured data (files) growing even faster

Worldwide data creation exceeded 1

zettabyte in 2010 for the first time

40% projected growth in global data

generated per year vs. 5% growth in global

IT spend *

Page 9: Consolidate More: High Performance Primary Deduplication in the Age of Abundant Capacity

… FAR LESS OF THE DATA IS UNIQUE

75% duplicate data (IDC)

80% (McKinsey Global Institute)

Page 10: Consolidate More: High Performance Primary Deduplication in the Age of Abundant Capacity

DATA DEDUPLICATION

A storage-optimization technology

‒ Compression, SIS

Reduces data by eliminating multiple copies of redundant

data and only keeping unique data

First made popular for backup (secondary) devices, the

technology has been extended to support primary

storage

WHAT IS IT?

Page 11: Consolidate More: High Performance Primary Deduplication in the Age of Abundant Capacity

DATA DEDUPLICATION

Challenge for companies today continues to be the cost and

associated cost of storage

Data storage

Optimizing data in place, and reducing the on-disk footprint of data as

it is stored provides immediate savings in capacity and new disk

expenditures

Drive down the cost per gigabyte − store more information in the

same gigabyte

Data management

Reduces the volume of data that needs to be managed

Reduces the frequency of backups

BUSINESS AND CUSTOMER BENEFITS

Page 12: Consolidate More: High Performance Primary Deduplication in the Age of Abundant Capacity

WHAT MARKET RESEARCH FIRMS ARE SAYING

Page 13: Consolidate More: High Performance Primary Deduplication in the Age of Abundant Capacity

WHAT MARKET RESEARCH FIRMS ARE SAYING

“… the percentage of deployments with must-

have primary dedupe ‘requirements’ will reach

anywhere from 5% (conservative) to 22%

(aggressive) by 2015”

Page 14: Consolidate More: High Performance Primary Deduplication in the Age of Abundant Capacity

WHAT HAPPENS DURING DEDUPLICATION

After Dedupe

Page 15: Consolidate More: High Performance Primary Deduplication in the Age of Abundant Capacity

DEDUPE IN ACTION

Owner: Steve

File Name: \home\steve\work\Report.doc

Size: 15 4K blocks

Owner: Paul

File Name: \home\paul\proforma.xls

Size: 5 4K blocks

Owner: John

File Name: \home\john\tmp\MyReport.doc

Size: 17 4K blocks

Owner: Mary

File Name: \home\mary\SteveReport.doc

Size: 17 4K blocks

Without Dedupe

54 4K Blocks Consumed

With Dedupe

Only 24 4K Blocks Consumed

30 4K Blocks (> 50%) Reclaimed

Page 16: Consolidate More: High Performance Primary Deduplication in the Age of Abundant Capacity

HITACHI NAS PLATFORM (HNAS) DEDUPLICATION OVERVIEW

Leverages HNAS unique hybrid core technology

‒ Indexing engine via CPU

‒ VLSI/FPGA technology

SHA256 hash of a data block detect duplicate

Up to 4 SHA calculators running in parallel

Post-processing

‒ File system data is analyzed and processed for dedupe after it is

written to disk

‒ Not in data path – no risk of losing or being unable to access data

Dedupe index is used to store and identify duplicate

blocks in a file system

Page 17: Consolidate More: High Performance Primary Deduplication in the Age of Abundant Capacity

IT’S ALL ABOUT POINTERS

Maximum (in theory) dedupe ratio is 239:1

‒ A HNAS block can be shared up to 239 times

‒ For example, 478 duplicate blocks dedupe down to 2 blocks

Page 18: Consolidate More: High Performance Primary Deduplication in the Age of Abundant Capacity

DEDUPE EFFICIENCY

Typical enterprises 2:1 to 5:1 data reduction

Higher in virtual environments

2:1 50% Savings

5:1 80% Savings

4:1 75% Savings

10:1 90% Savings

0%

10%

20%

30%

40%

50%

60%

70%

80%

90%

100%

Page 19: Consolidate More: High Performance Primary Deduplication in the Age of Abundant Capacity

COMPETITIVE DIFFERENTIATION

Extreme performance

‒ Up to 450MB/sec post-ingest throughput rate

Dynamic quality of service (QoS)

‒ Avoid impacting application I/O

‒ Less degradation on I/O Performance

Simple to use

‒ Not a hardware appliance

‒ Little to no administration required

Page 20: Consolidate More: High Performance Primary Deduplication in the Age of Abundant Capacity

QOS / THROTTLING MECHANISMS

Automatic background

process

‒ No complex scheduling

process

‒ 24/7 operation

‒ No disruption to workflow

QoS/auto throttling

‒ When the file serving load

passes beyond 50% (of

available IOPS or throughput

capacity), the engine throttles

back

Page 21: Consolidate More: High Performance Primary Deduplication in the Age of Abundant Capacity

RESULTS FROM A LEADING MANUFACTURER

REAL-WORLD RESULTS

“The current HNAS algorithm appears to be far better than

others (competitors) tested”

Page 22: Consolidate More: High Performance Primary Deduplication in the Age of Abundant Capacity

TIE IT ALL TOGETHER

EDGE STORAGE

ALTERNATE DATA CENTER

REMOTE OFFICE

APPLICATION APPLICATION APPLICATION

CLOUD STORAGE

HDDS SEARCH ACROSS

THE POWER OF THE PORTFOLIO

Reduce the cost of

storing data

Reduce the cost of

protecting data

Do more with less

Archive

first

Back up

less

Consolidate

more

The intelligent archive is the strong foundation of the

21st Century data center

Page 23: Consolidate More: High Performance Primary Deduplication in the Age of Abundant Capacity

QUESTIONS AND DISCUSSION

Page 24: Consolidate More: High Performance Primary Deduplication in the Age of Abundant Capacity

UPCOMING WEBTECHS

Cloud and Object Store Series

‒ Environmental Pressures Driving an Evolution in File Storage, April 3, 9 a.m. PT, noon ET

Big Data Webcast Series Continues

‒ Big Data: Shining the Light on Enterprise Dark Data, April 17, 9 a.m. PT, noon ET

‒ HDS Big Data Roadmap, May 1, 9 a.m. PT, noon ET

Check www.hds.com/webtech for

Links to the recording, the presentation, and Q&A (available next week)

Schedule and registration for upcoming WebTech sessions

Page 25: Consolidate More: High Performance Primary Deduplication in the Age of Abundant Capacity

THANK YOU