© 2009 ibm corporation ibm information infrastructure addressing information retention ibm...
TRANSCRIPT
© 2009 IBM Corporation
IBM Information Infrastructure
Addressing Information Retention
IBM Information Infrastructure Archiving Leadership
Name: Tony PearsonDate: February 26, 2009
© 2009 IBM Corporation 2
IBM Information Infrastructure
Agenda Information Infrastructure Challenges
IBM’s approach
Information Retention
Why IBM
© 2009 IBM Corporation 3
IBM Information Infrastructure
Information Growth and Drivers
Application driven growth
–Email–E-commerce –ERP/CRM
New applications
–Web 2.0 –Digital video, voice, audio
Data multiplier effect
–Backup, Disaster Recovery–Test, Development
Mergers and acquisitions
Source: IDC "Storage Infrastructure: Innovations for the Future Datacenter," Doc # DR2008_1RV, March 2008
Structured
Unstructured
growing at 32 %
growing at 64 %
Structured Data: database data
Unstructured Data also called ‘content’: files, medical images, docs, web content, rich media files etc.
© 2009 IBM Corporation 4
IBM Information Infrastructure
IBM Information InfrastructureComplete, Integrated, Available Today
Manage information more effectively and mitigate information risks …
with a dynamic infrastructure
… that efficiently and securely stores, protects, and provides optimized access to information.
© 2009 IBM Corporation 5
IBM Information Infrastructure
What are your risks?
Have you evaluated the cost of storing and managing all this information?
Is your infrastructure optimized based on your retention?
Can you backup and recover your servers fast and reliably enough?
How are you addressing these requirements today?
© 2009 IBM Corporation 6
IBM Information Infrastructure
IBM Information Infrastructurefor efficient Information Retention
Support daily activities
Business Value
Future business development
Preservation
Infrastructure Concerns
Cost and Performance
Cultural or Heritage Needs
In a marriage of Big Red and Big Blue, the extensive marketing records of the Coca-Cola Company -- from calendars created by Norman Rockwell to corporate memoranda to the famous commercial in which a chorus proclaims its desire to ''teach the world to sing'' -- have been digitized in an online archive with the help of I.B.M.
"All these wonderful books are only of use if they're read," said the Rev. Leonard Boyle, prefect of the Vatican Library. He said the I.B.M. project would put the library's manuscripts and texts in digital form as a way of broadening the library's reach.
© 2009 IBM Corporation 7
IBM Information Infrastructure
Information RetentionAreas of focus
Storage infrastructure optimizationA comprehensive line of tiered storage, across disk and tape technologies.
Policy-based automation software for space management and data retention to move data to the appropriate tier of storage.
Data deduplication and compression technologies for storage capacity gains.
Content Collection and Archiving Intelligently managing inactive or infrequently-accessed data that still has value,
while providing the ability to search and retrieve the information during a specified retention period.
Integrate with information management solutions from IBM and Technology Partners to handle file systems, databases, email, SharePoint and content management.
Long term retentionFuture-proofing information with data migration capabilities and forward compatibility.
© 2009 IBM Corporation
IBM Information Infrastructure
Storage Infrastructure Optimization
© 2009 IBM Corporation 9
IBM Information Infrastructure
Tape
$946,405
$7
$3.5
$0
Mill
ion
s
SATA Disk
$6,365,950
Blended Disk and Tape
$2,255,346
Hardware
Prod + DR Carts
Maintenance
Power & Cooling
Floor space
Consider the long-term costs of ownership
SATA disk lower cost access to online data than FC disk
Tape less than disk and consumes less energy, but often not ideal for online access
Blended solution:
– Online access to most recent content
– Lower cost, energy-efficiency for long-term
Storage infrastructure optimization Benefit from a blended solution
Cut TCO 50% with Blended Tape and Disk*10 year TCO example. Assumes 250TB storage, 25% growth/yr
* TCO estimates based on IBM internal studies.
© 2009 IBM Corporation 10
IBM Information Infrastructure
Fully Managed Costs: Storage Options
Rel
ativ
e C
ost /
GB
/ Ye
ar
High PerfDisk
0
10
20
30
40
50
60
70
Hi Capacity Disk
Online Archive
Online Tape
Offline Tape
Data Access Needs Throughout Lifecycle
Storage infrastructure optimization Reduce Cost with an Automated Migration Process
Align the business value of information with the most cost effective Information Infrastructure.
Policy-driven, Automated Data Movement
Today: Move data to the most cost effective storage for its current use based on policies
Future: Move data to the most power efficient storage dynamically to satisfy usage requirements
1 Hour
5 Years
20 Years
50 Years
100+
Years
2 Months
3 Years
Slow/Low
Fast/High
1 Day
© 2009 IBM Corporation 11
IBM Information Infrastructure
Storage infrastructure optimization Drive exceptional efficiency with data deduplication and compression
Revolutionary in-line data deduplication
– Dramatically reduced disk storage requirements – up to 25X reduction
• Store up to 25 TB of backups onto 1 TB of disk, in 8 hours
• Up to 9x faster than competitors
– Reduced power, space and cooling requirements
– Improved management - simplifies and speeds information protection
Two to one, Three to one tape compression
– Increase tape capacity utilization
Disk Space
© 2009 IBM Corporation
IBM Information Infrastructure
Content Collection and Archiving
© 2009 IBM Corporation 13
IBM Information Infrastructure
ArchivingThe foundation for Information Retention
Not the same as keeping backups foreverLong term in nature
Selected, preserved, disposed according to policyFor retrieval, not recovery
Adds operational efficiencies
Archiving is an intelligent process for managing inactive or infrequently accessed data, that still has value, while providing the
ability to preserve, search and retrieve the data during a specified retention period.
© 2009 IBM Corporation 14
IBM Information Infrastructure
Archiving Benefits
Improve Productivity– Improved Application Availability and Performance– Faster Backup and Recovery
Manage Risks– Adhere to internal corporate policies
– Reduce manual operations
Drive Operational Efficiency– Policy base automation
– Inactive or infrequently accessed data retainedin lower cost storage (disk and tape) for efficiency
– Data migrated between tiers automatically
Reduce Cost– Reduce infrastructure costs
• Power and Cooling requirements• Hardware and Software
Mitigate business risk, reduce costs, improve competitiveness
© 2009 IBM Corporation 15
IBM Information Infrastructure
IBM ECM
RecordsManagement
ElectronicDiscovery
AdvancedClassification
ContentCollection
1
3
2 4
IBM Enterprise Content Management (ECM)Key building block for Archive and Retention
© 2009 IBM Corporation 16
IBM Information Infrastructure
IBM Strategic Archiving & Retention Offerings
Archive / HSMInfrastructure
Information
Retention
Systems
Applications
Data content layer IBM CommonStore, IBM Content
Manager, IBM Content Collector IBM FileNet P8 Content Manager, Image
Services, SAP Connector IBM Records Crawler IBM Optim Grid Medical Archive Solution (GMAS)
Policy Management layer Tivoli Storage Manager System Storage Archive Manager Data Facility Storage Management
Subsystem
Storage Media layer IBM DR550 (includes SSAM) IBM Tape Systems IBM Disk Systems including N series with
SnapLock™ feature
Services layer IBM Enterprise Archive Services
ArchivingApplication
FileNetCommon
Store
E-mailE-mailFilesFiles Records Records ImagesImages
DBMS DBMS Siebel Siebel
PeoplesoftPeoplesoft
Optim
SSAM
PACSPACS
GMAS
SAPSAP
TSM
TSM / SSAM Client
DFSMS
Ent
erpr
ise
Arc
hive
Ser
vice
s
Content Collector
N series w/SnapLock
Disk Tape DR550
© 2009 IBM Corporation 17
IBM Information Infrastructure
IBM Enterprise Archive Services
Enterprise deployment of architecture Enterprise deployment of services Enterprise deployment of organization Enterprise deployment of technology
ArchivingAssessment
Develop Archiving
Strategy
ArchivingSolution
Architecture
Solution Integration
Manage and Run
Assess business lines' information retention requirements and IT abilities to support them
Identify gaps Provide recommendations Cost Benefit analysis
Create archiving strategy Define transition planning Design solution approach Cost Benefit analysis
Create/update enterprise archive architecture based on reference architecture
Define the classes of service Organizational impact / impact on existing architecture High level Solution design Cost Benefit Analysis
Identify all areas in scope Logical and physical design of pilots within
reference architecture Define certification criteria Complete solution architecture
Tell me what to do
Help me to do it
Do it for me
© 2009 IBM Corporation
IBM Information Infrastructure
Long Term Retention
© 2009 IBM Corporation 19
IBM Information Infrastructure
Yedioth Ahronoth, Sunday June 1, 2008
The “Digital Dark Ages”
© 2009 IBM Corporation 20
IBM Information Infrastructure
Microsoft Office 2003 SP3 Blocks older File Formats
“The move was done for security, says Microsoft, but still bewilders users “
“Office users will no longer be able to open files in 24 older file formats from Lotus, Corel, and most versions of MS Office products before 2000”
"The decision to block the formats is strictly to protect your machine from being compromised." according to Microsoft
To restore file formats from the dead users must edit the Windows registry – which Microsoft warns against.
Logical Preservation is not a 100 Year Problem
Source: Computerworld.com artilce by Gregg Keizer http://www.computerworld.com/action/article.do?command=viewArticleBasic&articleId=9055138
ComputerworldJanuary 2008
© 2009 IBM Corporation 21
IBM Information Infrastructure
Approaches to Long Term Digital Preservation
What are Companies doing today?– Keeping old servers, storage and
applications (Museum approach)
– Emulating old systems with new systems (Emulation approach)
– Migrating data to new technology (Migration approach)
Areas IBM is focusing its research– Methods for descriptive metadata to
enable future rendering (Descriptive approach)
– Encapsulation of data, metadata and related application logic for future processing (Encapsulation approach)
© 2009 IBM Corporation 22
IBM Information Infrastructure
Will you be able to access your data in 20 years? 50 years?
You must future proof your information
Bit Preservation Logical Preservation
How do you retrieve a bit perfect copy of digital data after years or decades?
Once you’ve retrieved the bit perfect copy, how do you
productively use the data?
This information was created a few
years ago
The media and content are still accessible
Mayan Glyph, Palenque ~630AD.Coronation of King Pacal on 26 March, 603
Long-term retention Preserve information
© 2009 IBM Corporation 23
IBM Information Infrastructure
Global Technology Services
Planning, design and
implementation services
specifically for security,
compliance, archive &
retention of information
for the IT staff
Software Integrated middleware and
software for managing a wide variety of content. Content collection, records management, classification and ediscovery/search
Integrated Solutions Soft bundles for customers
needing maximum flexibility with ease of ordering and deployment.
Appliances for customers desiring turnkey solutions and simplicity
Global Business Services Line of business oriented consulting and
services around archive and information retention for the business executive
Storage & Servers Storage systems
providing multiple tiers of storage, performance, scale, and cost
Storage software running on highly reliable and scalable servers.
IBM Provides the Complete Information Infrastructure
Information
Retention
© 2009 IBM Corporation 24
IBM Information Infrastructure
Global Archive Solutions Center
Current Topics include
– IBM Archive Solutions Overview
– Email Archive Solutions Overview with solution demo
– Database Archive Solutions Overview with solution demo
– Space Management for File Systems with Demo
– File System Archive with DR550 and N series with Snaplock demos
SMEs developing additional integrated archive solutions such as:
– GMAS, SAP Archiving, FileNet P8 and IBM Content Manager
Contact Us To book a briefing, contact your IBM Representative, IBM Business Partner, or Briefing
Center Coordinator, Lee Olguin at [email protected]. For more information, visit: http://www-03.ibm.com/systems/services/briefingcenter/tbc/
© 2009 IBM Corporation 25
IBM Information Infrastructure
Quick assessment
Storage Infrastructure Optimization– What is driving your information growth: databases, email, or other files?
– Are you able to control your IT budget to handle current and anticipated information growth?
– Are you aware of inactive, orphaned or stale data on your most expensive storage systems?
– Do you manually migrate data to less expensive storage to help reduce costs?
Content Collection and Archiving– Has information growth impacted the performance of your key applications?
– Are you aware of recent changes in legislation that require information to be retained longer?
– Would you pass an audit if a regulator asked to see how electronic data is stored, discovered, retrieved and deleted?
Long-term Retention– Under what circumstances does your company request information to be
kept for 7 years or longer?
© 2009 IBM Corporation 26
IBM Information Infrastructure
Getting Started
Visit ibm.com/information_infrastructure for more information
Download whitepaper “Addressing Archiving and Retention Challenges”
Listen to podcast “Addressing Archiving and Retention Challenges”
Sign up for an in-depth briefing at one of our Briefing Centers worldwide
Visit http://www-935.ibm.com/services/us/index.wss/offerfamily/gts/a1027722 to schedule an Archive Workshop with GTS
© 2009 IBM Corporation 27
IBM Information Infrastructure
© 2009 IBM Corporation 28
IBM Information Infrastructure
Additional information
Retention Systems
IBM Solutions portfolio
Customer References
© 2009 IBM Corporation 29
IBM Information InfrastructureT
ivo
li S
tora
ge
Man
ager
Sys
tem
Sto
rag
e A
rch
ive
Man
ager
Gri
d A
cces
s M
anag
er
Non-erasableNon-rewriteable
Read/Write
DR550 N Series SnapLock
Tape Systems
SATA disk
Online archive
Removable WORM
Nearline and Offline archive
IBM Information Retention Systems
© 2009 IBM Corporation 30
IBM Information Infrastructure
50 Years of Disk
DS3000
Entry Level
DS4000 DS6000 DS8000 Turbo
Midrange Disk FamilyEnterprise Disk (z & open)
Enterprise Storage Continuum
DS5000 XIV
High-end Open Systems Disk
Modular
IBM System Storage DS family
© 2009 IBM Corporation 31
IBM Information Infrastructure
IBM System Storage™ N series Hardware Portfolio
Entry LevelN3000
Midrange N5000/N6000
Enterprise N7000
Gat
eway
sE
xpan
sion
dr
awer
sF
ilers
EXN4000 FC Storage Expansion Unit (4Gb)
EXN1000 SATA Storage Expansion Unit
N7700 N7900N3300
N3600
N7700 N7900
Gateways use external storage
N6040 N6070
N6070N6040 N6060
N6060
© 2009 IBM Corporation 32
IBM Information Infrastructure
IBM System Storage TS family
Entry Level Midrange Enterprise Class
TS1030 (LTO3)TS1040 (LTO4) TS1130 (3592)
3494
TS3200
TS3100TS3310
TS7530(distributed)
TS7740(mainframe)
Tap
e dr
ives
Tap
e au
tom
atio
nT
ape
virt
ualiz
atio
n
TS3400
TS3500
TS2230
TS2900
TS7650G(distributed)
© 2009 IBM Corporation 33
IBM Information Infrastructure
IBM Retention Solutions enabled with Leading ISVs & PACS vendors
ISVs (ECM and Archiving)
– Autonomy (ex Zantaz)
– AXS-One Inc.
– BrainTribe (Formerly Comprendium)
– Bycast Inc.
– IBM CommonStore
– IBM Content Manager
– IBM FileNet
– IBM Optim (Princeton Softech)
– CaminoSoft – Ceyoniq
– d.velop AG– Front Porch Digital– Easy Software– Enigma Data Systems– Gamma Systems– HABEL GmbH & Co. KG– Heilig & Schubert (H&S)– Hummingbird– Hyland Software (OnBase)– Hyperwave– Interwoven– Lighthouse Global
Technologies
– Mimosa Systems
– MBS Technologies
– OpenText (formerly IXOS)
– PBS Software GmbH
– RJS Software Systems
– Saperion
– SER Solutions
– Solix Technologies
– Stellar Technologies
– Symantec / Veritas / KVS EV
– Windream
PACS (Medical Imaging) vendors
Siemens, AGFA, GE Healthcare, Philips, McKesson, Cerner, Fujifilm, eRadPACS, ACUO Technologies, DeJarnette
© 2009 IBM Corporation 34
IBM Information Infrastructure
Solution IBM Grid Medical Archive Solution storing
30TB of PACS images and clinic files annually
Total Solution Components:
IBM System Storage DS4000 disk systems, Enterprise Storage Server 800
Plans to implement tape tier in future for less frequently accessed data.
IBM GTS Implementation Services
Result Non-disruption of production environment
Eliminated traditional back-up and restoration window
Improved performance - reduced time to image access by 70%
All hospitals within network can share information
Reduced data management costs by 90%
Need high capacity, scalable and automated archival solution for PACS digital images
Solution to span across multiple centers
Near continuous access to imaging files with security
Reduce archive management TCO dramatically
Iowa Health Improves Productivity
© 2009 IBM Corporation 35
IBM Information Infrastructure
Solution IBM GMAS Spinning disk solution
using IBM and Grid Access Mgr SW
Grid spans across two data centers
Utilizes repurposed legacy storage
Supports AGFA enterprise application
Result Storage cost optimization and TCO reduction
long-term data archive Virtualization across existing storage Digital signatures for no single point of failure
data integrity and "corruption correction" Auto-replication and encryption of all patient
images Reduction in application down-time regardless
of growth & migration needs
Support for two datacenters, 200+ TBs of patient images Multiple points of failure placing data at risk Explosive growth of clinical patient data, requiring continuous availability to physicians Desire for multi-vendor strategy demanding open, industry standard interfaces Flexibility to absorb Storage media growth & manage data migration without down-time FTEs overly engaged on current HW problem determination/issues vs proactive activities
© 2009 IBM Corporation 36
IBM Information Infrastructure
Solution Includes IBM Information Infrastructure for
SAP data retention and compliance
IBM CommonStore for SAP
IBM System Storage DR550
IBM Tivoli Storage Manager
Result Optimized storage infrastructure for the
entire information lifecycle, from creation to disposal – leveraging disk and tape storage
Data is protected against deletion or modification
Retain copies of critical SAP data for extended periods of time to comply with government regulations
Reduce the cost of managing and storing data
is4 Optimizes storage
“With IBM storage solutions, we’ve been able to cost-effectively
manage and retain critical SAP data and leverage its business value
throughout its lifecycle.”
© 2009 IBM Corporation 37
IBM Information Infrastructure
Solution IBM Content Manager, Tivoli Storage
Manager, WebSphere Application Server, DB2, AIX
IBM p570 system, IBM DS6800 storage system, IBM 358494 Tape Libraries, Plasmon G638 Optical Library
IBM Global Services and Almaden Research Laboratory
Result Estimated savings of $5m (USD) per year
by avoiding manual cataloging and storing Preservation of and improved access to the
national cultural heritage of The Netherlands
Better, easier and faster access to information
Collect, maintain and preserve an archive of all publications (books, papers, periodicals, scientific publications) issued in The Netherlands
Store & retrieve electronic publications on a large scale
Required Long Term Preservation
Koninklijke Bibliotheek -(National Library of the Netherlands) Preserves information
© 2009 IBM Corporation 38
IBM Information Infrastructure
Disclaimer
IBM's customer is responsible for ensuring its own compliance with legal requirements. It is the customer's sole responsibility to obtain advice of competent legal counsel as to the identification and interpretation of any relevant laws and regulatory requirements that may affect the customer's business and any actions the customer may need to take to comply with such laws.
IBM does not provide legal advice or represent or warrant that its services or products will ensure that the customer is in compliance with any law.
The information contained in this documentation is provided for informational purposes only. While efforts were made to verify the completeness and accuracy of the information provided, it is provided “as is” without warranty of any kind, express or implied. IBM shall not be responsible for any damages arising out of the use of, or otherwise related to, this documentation or any other documentation. Nothing contained in this documentation is intended to, nor shall have the effect of, creating any warranties or representations from IBM (or its suppliers or licensors), or altering the terms and conditions of the applicable license agreement governing the use of IBM software.
© 2009 IBM Corporation 39
IBM Information Infrastructure
8 IBM Corporation 1994-2008. All rights reserved.References in this document to IBM products or services do not imply that IBM intends to make them available in every country.
Trademarks of International Business Machines Corporation in the United States, other countries, or both can be found on the World Wide Web at http://www.ibm.com/legal/copytrade.shtml.
Intel, Intel logo, Intel Inside, Intel Inside logo, Intel Centrino, Intel Centrino logo, Celeron, Intel Xeon, Intel SpeedStep, Itanium, and Pentium are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries.Linux is a registered trademark of Linus Torvalds in the United States, other countries, or both.Microsoft, Windows, Windows NT, and the Windows logo are trademarks of Microsoft Corporation in the United States, other countries, or both.UNIX is a registered trademark of The Open Group in the United States and other countries.Java and all Java-based trademarks are trademarks of Sun Microsystems, Inc. in the United States, other countries, or both.Other company, product, or service names may be trademarks or service marks of others.
Information is provided "AS IS" without warranty of any kind.
The customer examples described are presented as illustrations of how those customers have used IBM products and the results they may have achieved. Actual environmental costs and performance characteristics may vary by customer.
Information concerning non-IBM products was obtained from a supplier of these products, published announcement material, or other publicly available sources and does not constitute an endorsement of such products by IBM. Sources for non-IBM list prices and performance numbers are taken from publicly available information, including vendor announcements and vendor worldwide homepages. IBM has not tested these products and cannot confirm the accuracy of performance, capability, or any other claims related to non-IBM products. Questions on the capability of non-IBM products should be addressed to the supplier of those products.
All statements regarding IBM future direction and intent are subject to change or withdrawal without notice, and represent goals and objectives only.
Some information addresses anticipated future capabilities. Such information is not intended as a definitive statement of a commitment to specific levels of performance, function or delivery schedules with respect to any future products. Such commitments are only made in IBM product announcements. The information is presented here to communicate IBM's current investment and development activities as a good faith effort to help with our customers' future planning.
Performance is based on measurements and projections using standard IBM benchmarks in a controlled environment. The actual throughput or performance that any user will experience will vary depending upon considerations such as the amount of multiprogramming in the user's job stream, the I/O configuration, the storage configuration, and the workload processed. Therefore, no assurance can be given that an individual user will achieve throughput or performance improvements equivalent to the ratios stated here.
Photographs shown may be engineering prototypes. Changes may be incorporated in production models.
Trademarks and Disclaimers