protectier t7600
DESCRIPTION
Data Efficency - Why everyone in interested.TRANSCRIPT
Information Infrastructure
© 2009 IBM Corporation1 Apr 9, 2023
When Do I Use ProtecTIER or TSM 6 Built-in Deduplication?
Both Solutions Offer the Benefits of Data Deduplication:• Greatly reduced storage capacity requirements • Lower operational costs, energy usage and TCO• Faster recoveries with more data on disk
Use ProtecTIER When:• High performance is needed to reduce backup & recovery times
•Up to 500 MB/sec (1000MB/s with 2 node) inline deduplication• Large capacity scaling (>5TB) is required
•Deduplicated capacities up to 25 PB are required• Inline deduplication is required to minimize impact on SLAs and production • Deduplicating across multiple backup servers is needed to maximize deduplication• Established backup application is not TSM
Use TSM 6 Built-in Deduplication When:• Sufficient TSM server resources can be made available and you desire
deduplication operations be completely integrated within TSM• The benefits of deduplication are desired without separate hardware or software
dependencies or licenses (ships with TSM Extended Edition)• You desire end to end data lifecycle management with minimized data store
TSM
IBM ProtecTIER
© 2009 IBM Corporation
Advanced Data Deduplication Solutions for Medium, Large and Enterprise Environments
Introduction to the IBM System Storage TS7600 ProtecTIER™ Deduplication Technology
NOTE: Some slides in this presentation have animation that can not be run in LinkedIn. Contact Dave Thiede to
see the full presentation if you are interested.
Information Infrastructure
© 2009 IBM Corporation3 Apr 9, 2023
This presentation is divided into the following sections:
Market Drivers for Data Deduplication
ProtecTIER Overview
Deploying a ProtecTIER Solution (System i status)
Customer Case Studies
ProtecTIER’s Position in the Deduplication Market
Summary
Agenda
© 2009 IBM Corporation
Market Drivers for Data Deduplication
Why is Deduplication such a HOT Technology?
Information Infrastructure
© 2009 IBM Corporation5 Apr 9, 2023
Up to 80% of data is unstructured content
Storage capacity shipments are
growing at 54% a year
2005 2006 2007 2008 2009 2010
PB shipped
Backup Administrators are Struggling to Keep Up with Rapid Data Growth
33.0%
37.0%
40.0%
49.0%
66.0%
Media management
Difficulty measuring backup success
Manual effort required
Recoveries take too long
Backups take too long
What are the biggest problems with your current backup and recovery
solutions?
Information Infrastructure
© 2009 IBM Corporation6 Apr 9, 2023
• Short Term Retention• Use disk for daily backup
& restore operations• Performance
• Fast backups• Even faster restores• Meet “backup windows”
• Long Term Retention• Cost effective capacity• Removable & transportable
• Compliance• Meet financial & regulatory
requirements• Data encryption, WORM
Using the right balance of high density tape and high performance disk is the solution . . .
Information Infrastructure
© 2009 IBM Corporation7 Apr 9, 2023
And data deduplication is the key to using more disk more cost effectively!
© 2009 IBM Corporation
ProtecTIER Overview
Enterprise-Class Data Deduplication
Information Infrastructure
© 2009 IBM Corporation9 Apr 9, 2023
ProtecTIER reduces the required backup disk capacity by
up to 25 times or more!
Protect More. Store Less.™
Information Infrastructure
© 2009 IBM Corporation10 Apr 9, 2023
1. Data-agnostic factoring of up to 25 times or more
2. Unmatched performance up to 1000 MB/s or more
3. Unequaled scalability: up to 1 PB physical data
4. Enterprise-class data-integrity: Not hash-based
5. Simple, non-disruptive deployment
6. Supported in most hardware and software environments
No other dedupe technology meets all these criteria!Most elements of new-data backed up today already exist in previous backups
ProtecTIER Vision and Design Criteria
Information Infrastructure
© 2009 IBM Corporation11 Apr 9, 2023
Linux server-based application
Emulates a tape library unit, including drives, cartridges, and robotics
Uses Fibre Channel (FC) attached disk storage system as the backup medium
Backup Server
FC
Disk Storage System
Virtual Tape Library
ProtecTIER Server
“It’s a TapeLibrary and Drives”
ProtecTIER Architecture Overview
ProtecTIER Application
Information Infrastructure
© 2009 IBM Corporation12 Apr 9, 2023
Repository
Backup Servers
ProtecTIER™Server
HyperFactor™
New Data Stream
“Filtered” data
MemoryResident Index
Only 4GB needed to map 1PB of physical disk!
Inline deduplicationUp to 500MB/sec per server or
1000MB/sec with 2 node cluster!
How ProtecTIER works
Information Infrastructure
© 2009 IBM Corporation13 Apr 9, 2023
Backup application writes data to ProtecTIER as it would to tape
Data goes through HyperFactor deduplication engine
Only unique data is stored
Existing duplicate data is referenced
When a data object expires or is overwritten, references are removed
Free space is reclaimed and reused
1 2 3 4 5
A B D E F G H I JC
Overview of ProtecTIER Operations
Information Infrastructure
© 2009 IBM Corporation14 Apr 9, 2023
Master Server
Backup Server
ProtecTIER Server
Physical capacity
Store up to 25 times or more backup data on given physical storage capacity
Represented capacity
Storage Impact from ProtecTIER Deduplication
Information Infrastructure
© 2009 IBM Corporation15 Apr 9, 2023
Production Customers Deployment Results
Information Infrastructure
© 2009 IBM Corporation16 Apr 9, 2023
Physical capacity
ProtecTIERGateway
Backup Server
Backup Server
Represented capacityPrimary Site
Physical capacity
ProtecTIER GatewayBackup
Server
Secondary Site
IP-based WAN link
Tape library
Virtual cartridges can be cloned to
tape at DR site
Deduplication enables a large amounts of data to be replicated with significantly less bandwidth
Significantly Reduces Replication Bandwidth
Information Infrastructure
© 2009 IBM Corporation17 Apr 9, 2023
One-to-one replication from primary to a DR-site
Flexible policy driven replication– Choose which cartridges to replicate – one, some or all
– Assign priorities and schedule when replication should occur
Define cartridge “visibility”– Determine “where” virtual cartridges “exist” from ProtecTIER and/or
backup application standpoint
– ProtecTIER native replication will emulate moving tapes in & out of VTL’s import/export slots
– Allows users to use the VTL export/import slots via backup app to “move” cartridges from one library/site to another
Manage and monitor operations during disaster– Fail-over (when Disaster occurs) & Fail-back (to normal operation)
IBM ProtecTIER Native Replication Features
Information Infrastructure
© 2009 IBM Corporation18 Apr 9, 2023
Dramatic improvements in Disaster Recovery operations– Automates electronic transfer of backup data to a remote site– Leverages deduplication to only send unique data across the network
Radical cost reduction to DR operations through deduplication-enabled replication
– Bandwidth, one of the most expensive costs for replication, is greatly reduced– ProtecTIER requires less infrastructure at both the primary and secondary sites
“Democratization" of replication – Replication for the masses– Replication no longer reserved for Tier one applications only– Deduplication enables ALL applications to be replicated cost effectively– Reduces risk of data loss and speeds recovery for most other applications
It's all about Recovery Time Objectives (RTO)– Replication gets data to the remote site faster and safer– Applications can get back online quicker with fast disk-based recovery– Deduplication enables more data to be protected with low RTO solution
IBM ProtecTIER Native Replication Benefits
Information Infrastructure
© 2009 IBM Corporation19 Apr 9, 2023
System i Status System I qualified PT 2.2 software package released on June 22
(AS400 & BRMS) Tested/qualified in Delivered as a standard ProtecTIER software upgrade
Release will be compatible with all current 3958 MTM's, no new HW requirements or feature codes
Support Specifications:
1. Will support all native AS400 SAVE commands as well as BRMS
2. Supported AS400 versions: V5R4M0, V5R4M5, V6R1M0
3. Supported AS400 IOP FC adapters: 2765, 5704, 5761
4. Supported PT library emulation: TS3500/LTO3
ProtecTIER repository can be shared between Open Systems and AS400 w/ separate virtual libraries
System I deduplication testing showed to be as effective as open systems (actual deduplication results vary based on retention and data change rate)
Support of Native Replication targeted for GA in September
© 2009 IBM Corporation
Deploying a ProtecTIER Solution
Flexible, Scalable and Easy to Deploy
Information Infrastructure
© 2009 IBM Corporation21 Apr 9, 2023
ProtecTIER Deployment Options Overview
IBM System Storage TS7650 ProtecTIER Appliance Preconfigured solution optimized for performance and capacity
Cost effective solution
Easy to deploy, easy to maintain, easy to scale
IBM System Storage TS7650G ProtecTIER Gateway Needs higher performance or larger capacity
Has unique environment or needs a flexible storage configuration
IBM System Storage TS7650G ProtecTIER Cluster Needs the highest possible performance
Seeking higher availability
The market’s most powerful and scalable configuration
Information Infrastructure
© 2009 IBM Corporation22 Apr 9, 2023
Scalable Capacity and Performance
Highest PerformanceHighest PerformanceLargLargest CapacityHigh AvailabilityHigh Availability
Better Performance
Larger Capacity
Scalable
Better Performance
Larger Capacity
Scalable
Good PerformanceGood Performance
Highly ScalableHighly Scalable
Low costLow cost
Highest Performance
Largest Capacity
Highest Performance
Largest Capacity
Up to 500 MB/secUp to 500 MB/sec
36 TB useable36 TB useable
Up to 100 MB/secUp to 100 MB/sec
7 TB useable7 TB useable
Up to 250 MB/secUp to 250 MB/sec
18 TB useable18 TB useable
Active-Active ClusterActive-Active Cluster
Up to 500 MB/secUp to 500 MB/sec
36 TB useable36 TB useable
IBM TS7650 ProtecTIER® Deduplication Family
High PerformanceHigh PerformanceHigh CapacityHigh Capacity
Flexible StorageFlexible Storage
Highest PerformanceHighest Performancelargest Capacitylargest CapacityHigh AvailabilityHigh Availability
Single NodeSingle Node
Up to 500 MB/secUp to 500 MB/sec
1 PB useable1 PB useable
Active-Active ClusterActive-Active Cluster
Up to 1000 MB/secUp to 1000 MB/sec
1 PB useable1 PB useable
TS7650G GatewaysTS7650G Gateways
TS7650 ApplianceTS7650 Appliance
Information Infrastructure
© 2009 IBM Corporation23 Apr 9, 2023
Pre-configured for rapid deployment into existing backup environments
Extremely powerful solutions, featuring:
IBM ProtecTIER software with patented HyperFactor™ deduplication technology
IBM System x Server – multi-core server for enterprise-level performance
IBM Storage Controller with Fibre Channel drives - Proven reliability and performance
Complete solution that includes rack, cables, switches, and everything that is needed
IBM TS7650 ProtecTIER® Deduplication Appliance
Information Infrastructure
© 2009 IBM Corporation24 Apr 9, 2023
Customer Profile for each Appliance Configuration Ideal Customer for 7TB ProtecTIER Appliance
1 TB or less incremental backups per day 1-3 TBs full backups each week Experiencing average data growth Needs a cost effective solution
Ideal Customer for 18TB ProtecTIER Appliance 3 TBs or less incremental backups per day 3-6 TBs full backups each week Experiencing rapid data growth Needs good performance to meet backup window
Ideal Customer for 36TB ProtecTIER Appliance 5 TBs or less incremental backups per day 5-12 TBs full backups each week Additional growth expected Meeting the Backup window is an issue - higher performance needed
* Note: These general guidelines are based on the backup workload that best fits each appliance configuration Please use Capacity Planning Tool to accurately size a solution to meet customer’s specific requirements
Information Infrastructure
© 2009 IBM Corporation25 Apr 9, 2023
Powerful and Flexible solution, featuring: IBM ProtecTIER software with patented
HyperFactor™ deduplication technology IBM System x Server – multi-core server for
enterprise-level performance
Supports both IBM & Non-IBM disk IBM DS4000, DS5000, DS8000 and XIV
HDS, EMC and others
And delivers: Up to 500 MB/sec or more performance Up to 25 times or more data reduction Scalable to 1PB physical capacity Enterprise-class data integrity
TS7650G ProtecTIER Deduplication Gateway
TS7650G ProtecTIER Deduplication Gateway
Information Infrastructure
© 2009 IBM Corporation26 Apr 9, 2023
Our most Powerful and Flexible solution: IBM ProtecTIER software with patented
HyperFactor™ deduplication technology 2 IBM System x Servers – multi-core servers
for maximum performance & availability
Supports both IBM & Non-IBM disk IBM DS4000, DS5000, DS8000 and XIV
HDS, EMC and others
And delivers: Up to 1000 MB/sec or more performance Active-active cluster technology Two nodes working together as one repository Easily manageable yet highly scalable
TS7650G ProtecTIER Deduplication Cluster
TS7650G ProtecTIER Deduplication Cluster
Information Infrastructure
© 2009 IBM Corporation27 Apr 9, 2023
Active-Active 2 nodes cluster (architecture will allow for increasing node count over time)
Full repository sharing among nodes Writing data to the repository Reading data from the repository (restore and read reference) Access to all virtual devices
• No degradation on HyperFactor efficiency (regardless of the node through which the data is received)
• Minimum cluster down-time
TS7650G ProtecTIER Deduplication Cluster Features
IBM’s Fastest and Most Scalable Deduplication Solution!
Information Infrastructure
© 2009 IBM Corporation28 Apr 9, 2023
Store up to 25 times or more data on disk
– 250TB reduced to only 10TB with enterprise class data integrity
Reduce backup and restore times
– High speed inline data deduplication at up to 1000MB/sec or more
Improve the reliability of backup operations
– Eliminates mechanical & handling failures
Drive the cost of disk based backup down
– Reduces energy, cooling, and space required
Increase data retention
– Store more backup data on disk for a longer time with very little additional cost
With an IBM ProtecTIER Solution you can . . .
© 2009 IBM Corporation
Customer Case Studies
Time Tested and Proven Technology
Information Infrastructure
© 2009 IBM Corporation30 Apr 9, 2023
Business challengeFaced with the complexity and manageability of backup to tape, this media industry giant was also starting to lack datacenter floor space and power needed to accommodate future growth. With their data sets growing at least 50% year over year, the client needed to find a deduplication solution that would enable them to facilitate the growth, provide them with reliable backups and lower hardware acquisition and labor costs. This client chose the TS7650G ProtecTIER deduplication solution over incumbents SUN and EMC.
SolutionIBM System Storage and IBM ProtecTIER Deduplication Technology IBM XIV
IBM TS7650G ProtecTIER Deduplication Gateway (2 node cluster)
IBM TSM
Benefits Increased performance of backup & restore operations
Reduced equipment, energy and space consumed
Increased capacity enabled longer data retention
Simple scalability for future data growth
ProtecTIER’s unique capabilities
enables customers to protect
data more efficiently and reliably
and save money by reducing
energy, floor space and
maintenance requirements.
Media and Entertainment
Protect More. Store Less.™
Information Infrastructure
© 2009 IBM Corporation31 Apr 9, 2023
Business challenge
Solution 10 TS7650G ProtecTIER™ Deduplication Gateways
Benefits Executes backups to disk with a retention of 180 days providing
faster backups and even quicker restores
Saved over 100+ square meters of floor space by eliminating tape libraries through this implementation
Off-site backups are no longer needed. Data is electronically copied and replicated safely and efficiently
Enables customer to re-use existing disk infrastructure
Banking/Financial Services
Protect More. Store Less.™
IBM’s TS7650G ProtecTIER seamlessly integrated into an existing backup environment using TSM, removed the complexity of failed backup and restores and will help them contain the growth rate of their data sets
This large banking institution serves over 30 million customers as an international banking business with a global footprint in over 36 countries. It operates one of the largest SANs in Europe, with nightly backup numbers exceeding 1,000TBs and has more than 5PBs of centrally managed storage. Faced with shrinking backup windows, backup failures and data growth at 55% CAGR, contributed to them evaluating a new disk-based backup and recovery infrastructure. This client chose IBM’s TS7650G ProtecTIER deduplication solution over final contender Data Domain. This deal is now IBM’s largest ProtecTIER installation across Europe.
Information Infrastructure
© 2009 IBM Corporation32 Apr 9, 2023
Business challengeFaced with the growing demand of online applications for cellular customers this client is constantly adding new and creative applications to its repertoire to separate themselves from their competition. They needed to create an infrastructure that is highly available and highly responsive to their customer needs. They launched an online backup application for the customers and needed high performance backups, and more importantly, high performance restores for when a customer lost their phone or was moving to a new phone. Having close to 70 million customers, they need to manage their data growth and selected ProtecTIER over Data Domain and EMC.
Solution
ProtecTIER Deduplication Solution (third party servers/disk) NetBackup
Benefits
Increased performance of backup & restore operations
Reduced equipment, energy and space consumed
Increased capacity enabled longer data retention
Simple scalability for future data growth
Telecommunications
Protect More. Store Less.™
ProtecTIER is at the heart of this provider’s backup infrastructure
allowing them to increase their disk capacity and retain data for longer
periods of time more cost effectively
Information Infrastructure
© 2009 IBM Corporation33 Apr 9, 2023
Business challenge
This government retail exchange store was already familiar with VTLs, but did not have enough capacity to handle disk-to-disk backup for anticipated new workloads and data growth. Anticipating a 50% growth of data year over year, the client evaluated several solutions and chose IBM’s TS7650G over Data Domain for its performance, scalability and capacity
Solution IBM TS7650G ProtecTIER™ Deduplication Gateway
IBM DS4700 disk arrays (2)
SVC
IBM TSMBenefits Increased performance of backup & restore operations – 20%
faster restores
Increased capacity enabled longer data retention – currently 18:1 factoring
Eliminated multiple management and cost points vs. the competition
Simple scalability for future data growth
Government
Protect More. Store Less.™
IBM’s TS7650G ProtecTIER enables customers to scale without impacting performance and seamlessly works with existing backup operations
© 2009 IBM Corporation
ProtecTIER’s Position in the Deduplication Market
ProtecTIER’s Market Leadership
Information Infrastructure
© 2009 IBM Corporation35 Apr 9, 2023
The only Enterprise-class deduplication solution on the market today
Launched in Q4 2005
– First VTL with Deduplication
Installed in all major industries
Vital Stats
– Global reach via IBM offices and Business Partners
– Over 300 customers worldwide; over 650 systems in production
– Over 35 PB of disk capacity under management
ProtecTIER’s Growing Market Presence
Information Infrastructure
© 2009 IBM Corporation36 Apr 9, 2023
“Six Storage Companies to Watch” (July 2006)
Top 10 Hot Storage Startup! (March 2005)
Industry Leading Recognition
Information Infrastructure
© 2009 IBM Corporation37 Apr 9, 2023
Deduplication Market at a Glance
DEDUPE TECHNOLOGY
RESOURCE UTILIZATION
Single node performance 500 MB/s ! 300 MB/s
Dual node Cluster performance 1000MB/s
No disk staging area required
Ø Staging area > twice the size of largest full backup
Only 4GB RAM needed for a 100TB repository
DXi7500DD880
ProtecTIER with HyperFactor
PERFORMANCE
RockSoft Hash-based
SIRHash-based
! Post processInlineInline
Deduplication ! Post process ! Post process
DeltaStor
Block Level Deduplication
Block Level Block Level Block Level Ø File Level
Byte-level diff comparison
! Potential Hash collision
RockSoft Hash-based
No disk staging area required
! Staging area > than the size of largest full backup
! Staging area > than the size of largest full backup
!Clustering not available
! 130 MB/s ! 188 MB/s ! 160 MB/s
S2100-ES2VTL 700
Byte-level diff comparison
!Clustering not available
!Clustering with Global Dedupe not
available
! Potential Hash collision
! Potential Hash collision
See Note (2)
See Note (1)
See Note (3)
See Note (4)
See Notes (5-6)
See Notes (7-8)!Clustering with Global Dedupe not
available
See Notes (9-10)
!Over 300GBs of RAM!
!Over 300GBs of RAM!
!Over 300GBs of RAM!
24GB of RAM Not hash based
See Note (11)
Information Infrastructure
© 2009 IBM Corporation38 Apr 9, 2023
Deduplication Market at a Glance
Single system can scale to 1PB capacity
Up to 16 virtual tape libraries
DXi7500DD880 ProtecTIER with
HyperFactor
CAPACITY-SCALABILTY
RockSoft Hash-based
SIRHash-based
! Post processIn production
since 2006ProtecTIER in
production since 2006 ! GA October 2008
DeltaStor
Over 25PBs of in production
Many small systems in production
IBM in business for nearly 100 years
RockSoft Hash-based
!Limits not published
S2100-ES2VTL 700
! Over $400 million in debt
! Small struggling company
See Note (12-13)
PRODUCT STABILITY
Acquired by EMC
Ø Acquisition or failure imminent
! GA May 2008
! Very few small customers
! Very few small customers
Ø Almost no deduplication in
production
See Note (14-15)
! 58TB Maximum useable capacity
!Limited by rapid hash table growth
!Limited by rapid hash table growth
!Limited by huge storage requirements
Up to 64 virtual tape libraries
Up to 128 virtual tape libraries
Up to 192 virtual tape libraries
Up to 512 virtual tape drives !Limits not
publishedUp to160 virtual
tape drives
Up to 1024 virtual drives
Up to 192 virtual tape drives
Up to 512,000 virtual tape cartridges !Limits not
publishedUp to130,000 virtual cartridges
Up to 64,000 virtual cartridges
Up to 5.3 million virtual cartridges
YES ! NO ! NO
MEETS ENTERPRISE REQUIREMENTS?
! NO! NO
Information Infrastructure
© 2009 IBM Corporation39 Apr 9, 2023
Contact Dave Thiede
800-661-7761 x8022
www.proactivesolutions.com
For More Information on IBM’s ProtecTIER
Information Infrastructure
© 2009 IBM Corporation40 Apr 9, 2023
8 IBM Corporation 1994-2009. All rights reserved.References in this document to IBM products or services do not imply that IBM intends to make them available in every country.
Trademarks of International Business Machines Corporation in the United States, other countries, or both can be found on the World Wide Web at http://www.ibm.com/legal/copytrade.shtml.
Intel, Intel logo, Intel Inside, Intel Inside logo, Intel Centrino, Intel Centrino logo, Celeron, Intel Xeon, Intel SpeedStep, Itanium, and Pentium are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries.Linux is a registered trademark of Linus Torvalds in the United States, other countries, or both.Microsoft, Windows, Windows NT, and the Windows logo are trademarks of Microsoft Corporation in the United States, other countries, or both.UNIX is a registered trademark of The Open Group in the United States and other countries.Java and all Java-based trademarks are trademarks of Sun Microsystems, Inc. in the United States, other countries, or both.Other company, product, or service names may be trademarks or service marks of others.
Information is provided "AS IS" without warranty of any kind.
The customer examples described are presented as illustrations of how those customers have used IBM products and the results they may have achieved. Actual environmental costs and performance characteristics may vary by customer.
Information concerning non-IBM products was obtained from a supplier of these products, published announcement material, or other publicly available sources and does not constitute an endorsement of such products by IBM. Sources for non-IBM list prices and performance numbers are taken from publicly available information, including vendor announcements and vendor worldwide homepages. IBM has not tested these products and cannot confirm the accuracy of performance, capability, or any other claims related to non-IBM products. Questions on the capability of non-IBM products should be addressed to the supplier of those products.
All statements regarding IBM future direction and intent are subject to change or withdrawal without notice, and represent goals and objectives only.
Some information addresses anticipated future capabilities. Such information is not intended as a definitive statement of a commitment to specific levels of performance, function or delivery schedules with respect to any future products. Such commitments are only made in IBM product announcements. The information is presented here to communicate IBM's current investment and development activities as a good faith effort to help with our customers' future planning.
Performance is based on measurements and projections using standard IBM benchmarks in a controlled environment. The actual throughput or performance that any user will experience will vary depending upon considerations such as the amount of multiprogramming in the user's job stream, the I/O configuration, the storage configuration, and the workload processed. Therefore, no assurance can be given that an individual user will achieve throughput or performance improvements equivalent to the ratios stated here.
Photographs shown may be engineering prototypes. Changes may be incorporated in production models.
Trademarks and Disclaimers