the premis working group: preservation metadata for digital repositories

25
The PREMIS Working Group: Preservation Metadata for Digital Repositories DLF Fall Forum October 26, 2004 Rebecca Guenther LC/NDMSO [email protected]

Upload: alvin-riley

Post on 30-Dec-2015

38 views

Category:

Documents


1 download

DESCRIPTION

The PREMIS Working Group: Preservation Metadata for Digital Repositories. DLF Fall Forum October 26, 2004 Rebecca Guenther LC/NDMSO [email protected]. Preservation Metadata Functions. Information that supports and documents the digital preservation process: - PowerPoint PPT Presentation

TRANSCRIPT

The PREMIS Working Group:Preservation Metadatafor Digital Repositories

DLF Fall ForumOctober 26, 2004 Rebecca GuentherLC/NDMSO [email protected]

Oct. 26, 2004 DLF-PREMIS 2

Preservation Metadata Functions

• Information that supports and documents the digital preservation process:• Establish provenance: track chain of custody and

alterations over time• Details authenticity• Documents technical processes object has

undergone• Describes technical details of object• Describes the environment from which it

originated• Specify rights management information

Oct. 26, 2004 DLF-PREMIS 3

Preservation Metadata Functions (cont.)

• Provide information to maintain resources over the long term:• viability: object’s bitstream is intact• renderability: object can be translated

to a form that can be viewed or used• understandability: rendered content can

be interpreted and understood

Oct. 26, 2004 DLF-PREMIS 4

Background• March 2000: OCLC and RLG jointly sponsor international

working group on preservation metadata• Identify key issues/challenges• Seek consensus on recommendations and best practice

• White paper (January 2001)• Defined preservation metadata; role in preservation process• Reviewed/synthesized existing preservation metadata schemes

• Preservation metadata framework (June 2002)• Comprehensive description of types of information constituting

preservation metadata• Based on OAIS information model• Set of “prototype” preservation metadata elements

Oct. 26, 2004 DLF-PREMIS 5

Aftermath …

• Framework …• Consolidated expertise• Provided foundation for developing formal

preservation metadata specifications• Common departure point for different

schema implementations

• But ... further scope for collaboration in preservation metadata• Needed best practices/recommendations for

implementing preservation metadata in real world digital archiving systems

Oct. 26, 2004 DLF-PREMIS 6

Issues unresolved in WG

• How minimal is a core preservation metadata element set?

• How much metadata can be generated automatically?

• Is it useful to apply metadata elements by object type or object behavior?

• Levels of granularity not addressed• Need to provide less abstract view of

preservation metadata for implementation

Oct. 26, 2004 DLF-PREMIS 7

PREMIS

• June 2003: OCLC and RLG sponsored new working group: PREMIS• Preservation Metadata: Implementation Strategies

• Objectives• Define “core” set of preservation metadata

elements, with supporting data dictionary, applicable to broad range of digital preservation activities

• Identify and evaluate alternative strategies for encoding, storing, managing, and exchanging preservation metadata

Oct. 26, 2004 DLF-PREMIS 8

Membership

• Priscilla Caplan, FCLA (Chair)• Rebecca Guenther, LC (Chair)• Michael Alexander, British Library• George Barnum, GPO• Charles Blair, U. of Chicago• Olaf Brandt, U. of Gottingen• Adam Farquhar, British Library• David Gewirtz, Yale• Kevin Glavash, MIT/Dspace• Cathy Hartman, U. of N. Texas• Helen Hodgart, British Library• Nancy Hoebelheinrich, Stanford• Roger Howard/Sally Hubbard, Getty

Museum• Pam Kircher, OCLC• John Kunze, Calif. Digital Library

• Brian Lavoie, OCLC liaison• Robin Dale, RLG liaison• Vicky McCarger, LA Times• Jerry McDonough, NYU/METS• Evan Owens, JSTOR• Erin Rhodes, NARA• Madi Solomon, Walt Disney Co.• Angela Spinazze, ATSPIN• Stefan Strathmann, U. of

Gottingen• Gunter Waibel, RLG• Lisa Weber, NARA• Robin Wendler, Harvard• Hilde van Wijngaarden, KB• Andrew Wilson, NAA

Oct. 26, 2004 DLF-PREMIS 9

Advisory Committee

• Howard Besser, UCLA• Liz Bishoff, OCLC (via Colorado

Digitization Program)• Gerard Clifton, National Library of

Australia• Gail Hodge, CENDI• Steve Knight, National Library of

New Zealand

• Maggie Jones, Digital Preservation Coalition

• Nancy McGovern, Cornell• Cliff Morgan, Wiley UK• Richard Rinehart, U. of California,

Berkeley

Oct. 26, 2004 DLF-PREMIS 10

PREMIS Subgroups

• Core elements• Establish core metadata elements and data dictionary• Developed a data model• Has had 2 face-to-face meetings• Weekly conference calls

• Implementation• Examine alternative strategies for encoding, storage

and management of preservation metadata• Conducted a survey of practices• Monthly conference call

• Expect to complete activities by end of 2004

Oct. 26, 2004 DLF-PREMIS 11

Core elements subgroup

• Development of data model• Objects• Events• Agents• Intellectual entities• Rights

• Data dictionary structured according to entities

Oct. 26, 2004 DLF-PREMIS 12

Core Elements

• Conducting element-by-element review of prototype elements from metadata framework• Is the element “core”?• How is it being used at WG members’

institutions?• How should it be implemented/populated?• Elements not covered by the framework?

Oct. 26, 2004 DLF-PREMIS 13

Objects

• Identifiers• Location• Descriptive metadata out of scope• Technical metadata not specific to

particular file format• Levels of objects: representation, file,

filestream, bitstream

Oct. 26, 2004 DLF-PREMIS 14

Objects:Technical metadata

• Object characteristics • Fixity• Size• Format (including link to format registry)• Inhibitors• Significant properties• Creating application information

• Environment (software, hardware)• Externally defined technical metadata (e.g.

Z39.87/MIX)

Oct. 26, 2004 DLF-PREMIS 15

Events

• Digital provenance/process information• Actions that involve one or more objects• May be related to one or more agents• Semantic units

• Event identifier• Event type• Event outcome• Event detail• Event date/time

Oct. 26, 2004 DLF-PREMIS 16

Agents

• Agent descriptions out of scope• Attributes of agents associated with

preservation events and rights management• May carry-out, authorize, or compel one or more

events • may create or act upon one or more objects• may hold or grant one or more rights

• Semantic units• Agent identifier• Agent name

Oct. 26, 2004 DLF-PREMIS 17

Rights and relationships

• Rights• Only in context of right to preserve• Collecting rights use cases

• Relationships• Data model expresses relationships

between entities• Relationships between objects

• Derivative, dependency, structural

Oct. 26, 2004 DLF-PREMIS 20

Implementation Strategies subgroup

• Conducted survey of preservation repositories to explore the state of the art

• Questions about policies, governance, funding, system architecture, preservation strategies, metadata implementation

• 70 surveys sent• Responses from 28 libraries, 7 archives, 14

other in 13 different countries• 10 national libraries, 6 national archives• Survey published Oct. 2004

Oct. 26, 2004 DLF-PREMIS 21

Survey findings

• Little experience with digital preservation• Most didn’t have active preservation strategy• Many not yet in production• Cannot assess adequacy of metadata

• Lack of common vocabulary and conceptual framework• Informed by OAIS reference model• Difference of opinion as to meaning of OAIS

compliance

Oct. 26, 2004 DLF-PREMIS 22

Survey findings (cont.)

• Metadata• Many recording rights, provenance, technical,

administrative, descriptive and structural

• Consistent roles in preservation scope and policies (academic libraries, archives, national libraries)

• Substantial use of METS, Z39.87/MIX, OCLC sets

• Most repositories serve goals of both preservation and access

Oct. 26, 2004 DLF-PREMIS 23

Trends• Store metadata redundantly in XML or

relational database and with content data objects

• Use METS for structural metadata and as container for descriptive and administrative; MIX for images

• Use OAIS as framework and starting point• Maintain multiple versions (originals, some

normalized or migrated) in repository with complete metadata for all versions

• Choose multiple strategies for digital preservation

Oct. 26, 2004 DLF-PREMIS 24

Looking ahead

• Finalize core preservation metadata elements set

• Complete data dictionary

• XML schemas to support exchange of core elements for digital provenance/process and technical metadata

• Final PREMIS report by end of 2004

• Community outreach: opportunities for public comment

• Follow-on activities?

Oct. 26, 2004 DLF-PREMIS 25

More information…

• PREMIS Web site:http://www.oclc.org/research/projects/pmwg/

• “Implementing Metadata in Digital Preservation Systems: The PREMIS Activity” D-Lib (April ‘04)http://www.dlib.org/dlib/april04/lavoie/04lavoie.html

• Rebecca Guenther: [email protected]

• Priscilla Caplan: [email protected]