1 “cross-industry preservation architectures” – pasig may, 2011 cross-industry preservation...

23
1 “Cross-Industry Preservation Architectures” – PASIG May, 2011 Cross-Industry Preservation Architectures Michael Peterson May 2011

Upload: genevieve-mier

Post on 14-Dec-2015

215 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: 1 “Cross-Industry Preservation Architectures” – PASIG May, 2011 Cross-Industry Preservation Architectures Michael Peterson May 2011

1“Cross-Industry Preservation Architectures” – PASIG May, 2011

Cross-IndustryPreservation Architectures

Michael PetersonMay 2011

Page 2: 1 “Cross-Industry Preservation Architectures” – PASIG May, 2011 Cross-Industry Preservation Architectures Michael Peterson May 2011

2“Cross-Industry Preservation Architectures” – PASIG May, 2011

Introductions

●Michael Peterson Founder, past President, and Chief Strategy

Advocate for the SNIA Currently working on Cloud Archive and Long-term

Retention standards, best practices, market education Information Services Architect – consulting in long-

term retention and digital preservation system design and implementation

Currently driving the LTDPRM.org & ILM2.0.org Communities

Author: “100 Year Archive Requirements Study,” 2008, “Building a Terminology Bridge: Guidelines for Digital Information Retention and Preservation Practices in the Datacenter,” Sept. 2009

Page 3: 1 “Cross-Industry Preservation Architectures” – PASIG May, 2011 Cross-Industry Preservation Architectures Michael Peterson May 2011

3“Cross-Industry Preservation Architectures” – PASIG May, 2011

www.ltdprm.org

Page 4: 1 “Cross-Industry Preservation Architectures” – PASIG May, 2011 Cross-Industry Preservation Architectures Michael Peterson May 2011

4“Cross-Industry Preservation Architectures” – PASIG May, 2011

Agenda

●LTDP Paradoxes (Laws designed to be broken)

●New Digital Preservation Models That the cloud empowers

●Using the Cloud for Digital Archives (Digital Preservation)

Page 5: 1 “Cross-Industry Preservation Architectures” – PASIG May, 2011 Cross-Industry Preservation Architectures Michael Peterson May 2011

5“Cross-Industry Preservation Architectures” – PASIG May, 2011

4 Paradoxes of Digital Preservation

●Data will be lost●Migration does not

scale●Access & use models

keep changing●Cost overwhelms

everything complexity does not

Page 6: 1 “Cross-Industry Preservation Architectures” – PASIG May, 2011 Cross-Industry Preservation Architectures Michael Peterson May 2011

6“Cross-Industry Preservation Architectures” – PASIG May, 2011

How do we break them?“Old” Laws of Digital Preservation

Page 7: 1 “Cross-Industry Preservation Architectures” – PASIG May, 2011 Cross-Industry Preservation Architectures Michael Peterson May 2011

7“Cross-Industry Preservation Architectures” – PASIG May, 2011

100 Year Complexity Barrier

●Overwhelming growth, cost, change Constant Physical and Logical migration Power, cooling, space, people, resources,

maintenance,… Always adding & migrating systems, networking,

storage Managing thousands of formats Constant Auditing and recovery of damaged or lost

data Thousands of moving parts Complex systems and architectures Changing software platforms

Page 8: 1 “Cross-Industry Preservation Architectures” – PASIG May, 2011 Cross-Industry Preservation Architectures Michael Peterson May 2011

8“Cross-Industry Preservation Architectures” – PASIG May, 2011

Aha!

Move from “physical” preservation architectures and design as in physical media or a physical

repository (a 2002 ‘OAIS’)to

“virtualized” Preservation Services based on Service Management principles

(Sounds like the Cloud…)

Page 9: 1 “Cross-Industry Preservation Architectures” – PASIG May, 2011 Cross-Industry Preservation Architectures Michael Peterson May 2011

9“Cross-Industry Preservation Architectures” – PASIG May, 2011

New Digital Preservation Models

Using the Cloud

Page 10: 1 “Cross-Industry Preservation Architectures” – PASIG May, 2011 Cross-Industry Preservation Architectures Michael Peterson May 2011

10“Cross-Industry Preservation Architectures” – PASIG May, 2011

“Physical” Doesn’t Scale

●Old Architecture “Storing digital images

effectively requires standards related to the storage media, such as CD- ROMs, and the file formats, such as TIFF.” Source: “A Resource List for Standards Related to Digital Imaging” Dec. 2010

Physical Application & Storage infrastructure

●Physical Standards Architecture: OAIS 2002 Metadata: Dublin Core:

ISO 15836:2009 Storage Media: ISO

18921:2008, ISO 18925:2008, Digitization: ISO/IEC 10918-1/Cor1:2005, ISO/IEC 10918-3/Amd1:1999

File Formats: ISO 19005-1:2005, Adobe TIFF Specification, V6, 1992

Transfer Protocols: ISO 15740:2008

Page 11: 1 “Cross-Industry Preservation Architectures” – PASIG May, 2011 Cross-Industry Preservation Architectures Michael Peterson May 2011

11“Cross-Industry Preservation Architectures” – PASIG May, 2011

“Infrastructure Virtualization”

●New Architecture Media independent System Architecture

virtualized, self-protecting, cloud based, and self healing

Integrated migration & transformation services

Virtualized historical applications hosted in the cloud in specialized containers running in virtual machines

●New Standards Architecture: OAIS 2010 Metadata: FCIS, PREMIS,

IETF Cloud: SNIA-CDMI Interoperability: NIST

Smart Grid Framework, Cloud and Interoperability workgroups

Object Containerization: SNIA-SIRF

Page 12: 1 “Cross-Industry Preservation Architectures” – PASIG May, 2011 Cross-Industry Preservation Architectures Michael Peterson May 2011

12“Cross-Industry Preservation Architectures” – PASIG May, 2011

Add “Information Virtualization”

●Portable Information Objects Extensible Preservation Objects Location, media independent Secure, auditable, authentic,

portable Self-healing

●On-demand, virtual emulation “Jumpbox” hosted emulators Populations of legacy ‘readers’ Web-based delivery and access

●New Standards Architecture: OAIS 2010 Metadata: FCIS, PREMIS,

IETF Cloud: SNIA-CDMI Interoperability: NIST

Smart Grid Framework, Cloud and Interoperability workgroups

Object Containerization: SNIA-SIRF and CDMI

Page 13: 1 “Cross-Industry Preservation Architectures” – PASIG May, 2011 Cross-Industry Preservation Architectures Michael Peterson May 2011

13“Cross-Industry Preservation Architectures” – PASIG May, 2011

Move to “Managed”

Content Management

Service Management ITIL, ITSM, ILM2.0,

Information Governance

Litigation ‘Ready’Preservation begins

at “Creation”Preservation is a new Datacenter Practice

Operating Practices:ITSM - IT Service Mgmt.

ITIL-IT Infrastructure Library

ILM2.0 - Service mgmt. based approach to information mgmt. and automation

Regulatory Compliance

Page 14: 1 “Cross-Industry Preservation Architectures” – PASIG May, 2011 Cross-Industry Preservation Architectures Michael Peterson May 2011

14“Cross-Industry Preservation Architectures” – PASIG May, 2011

And to “Virtual Services” in the Cloud

●Platform as a Service

● Infrastructure as a Service

●Storage as a Service●Evolving Web access

and use models●Private, Hybrid,

Public Clouds Multiple clouds, multiple

providers

Page 15: 1 “Cross-Industry Preservation Architectures” – PASIG May, 2011 Cross-Industry Preservation Architectures Michael Peterson May 2011

21“Cross-Industry Preservation Architectures” – PASIG May, 2011

Using the “Cloud” in Preservation

●Most likely use-cases: Private and Hybrid

clouds Virtualize infrastructure Virtualize delivery and

access Virtualize emulation Virtualize information

Providing portability

●Examples Web-access models Web-drop boxes Agile, Scalable, cost

effective compute and storage resources (on demand)

Virtual emulation Demand spikes

Disaster recovery Distributed data sets Infrastructure extensions

Page 16: 1 “Cross-Industry Preservation Architectures” – PASIG May, 2011 Cross-Industry Preservation Architectures Michael Peterson May 2011

22“Cross-Industry Preservation Architectures” – PASIG May, 2011

Emerging Cloud Standards

●Cloud Data Management Interface, CDMI SNIA to ISO: storage-to-cloud, cloud-to-cloud

interchange format

●Self-contained Information Retention Format, SIRF SNIA to ISO: extensible preservation object format

●Interoperability ISO project: Data Preservation Interchange

Framework, DPIF

Page 17: 1 “Cross-Industry Preservation Architectures” – PASIG May, 2011 Cross-Industry Preservation Architectures Michael Peterson May 2011

23“Cross-Industry Preservation Architectures” – PASIG May, 2011

Cloud Data Management Interface

●Data Portability Standard with an Object Storage Interface Move data and metadata in standard portable

containers in and out of the cloud and between clouds

Simple XML container of objects plus metadata

●A data and information services management interface and control path Operate services through CDMI

Rules and Policies in metadata Cloud Peering – cloud to cloud communications

Page 18: 1 “Cross-Industry Preservation Architectures” – PASIG May, 2011 Cross-Industry Preservation Architectures Michael Peterson May 2011

24“Cross-Industry Preservation Architectures” – PASIG May, 2011

Design for the Cloud

●Considerations Establish Service

Objectives Include verification of

recovery, authenticity, availability, digital audit, etc.

Consider using multiple cloud destinations or local and remote copies for increased reliability and availability

Beware of excessive moving of data across the WAN due to high I/O and bandwidth costs

●Evaluate Cloud providers Establish strong contracts

●Test and Audit All required services

●Use CDMI !

Page 19: 1 “Cross-Industry Preservation Architectures” – PASIG May, 2011 Cross-Industry Preservation Architectures Michael Peterson May 2011

25“Cross-Industry Preservation Architectures” – PASIG May, 2011

Cloud Contract Considerations

CostsRetention

Management Preservation/Integrity/

Authentication Return and Secure

Disposal – Subpoenas, Control

Legal Hold Digital Audits &

Verification Physical and logical

migration practices and authenticity verifications

•Access Availability, Protection,

Security & Confidentiality

Search/Discovery Multi‐Cloud Provider

Relationships

• Right to Conduct Forensic Exams

•Cross‐Border Data Transfers

Page 20: 1 “Cross-Industry Preservation Architectures” – PASIG May, 2011 Cross-Industry Preservation Architectures Michael Peterson May 2011

26“Cross-Industry Preservation Architectures” – PASIG May, 2011

Summary ThoughtsPreservation Architectures: Virtualization and Cloud

Page 21: 1 “Cross-Industry Preservation Architectures” – PASIG May, 2011 Cross-Industry Preservation Architectures Michael Peterson May 2011

27“Cross-Industry Preservation Architectures” – PASIG May, 2011

Move to Virtual Preservation

●Shift thinking from “Physical” Preservation to “Virtual”

●Virtualization Applies in many ways System, storage, application, infrastructure Information Migration – both physical and logical Cost reduction

●Conclusion: ‘Cloud’ has a positive role

Page 22: 1 “Cross-Industry Preservation Architectures” – PASIG May, 2011 Cross-Industry Preservation Architectures Michael Peterson May 2011

28“Cross-Industry Preservation Architectures” – PASIG May, 2011

Using the Cloud

●Start out Private, Move to Hybrid●Apply Service Management Principles

Classify, Requirements, SLAs, Design, Audit, Improve

●Design for the Cloud Create strong and measureable SLA style contracts Test, Audit, Verify

●Use and Promote CDMI Need cloud interface, management, and

information portability standards

Page 23: 1 “Cross-Industry Preservation Architectures” – PASIG May, 2011 Cross-Industry Preservation Architectures Michael Peterson May 2011

29“Cross-Industry Preservation Architectures” – PASIG May, 2011

Contact Information

●Michael Peterson IMERGE consulting and LTDPRM.org [email protected] (805)201-3178