introduction to mets (metadata encoding and transmission standard) jerome mcdonough new york...

27
Introduction to METS (Metadata Encoding and Transmission Standard) Jerome McDonough New York University [email protected]

Upload: chase-patterson

Post on 10-Dec-2015

223 views

Category:

Documents


1 download

TRANSCRIPT

Introduction to METS (Metadata Encoding and Transmission Standard)

Jerome McDonoughNew York University

[email protected]

What was MOA2?

Concept phaseWhite paper published by CLIR

Testbed phaseUse of ideas generated in the concept phase by real life participants (http://sunsite.berkeley.edu/moa2/)Included metadata capture DB, Java object browser, and MOA2 DTD

Who was MOA2?

MOA2 whitepaper

Hurley, Price-Wilkin, Proffitt, BesserMOA2 testbed participants

Cornell University LibraryNew York Public LibraryPenn State University LibraryStanford University LibraryUniversity of California, Berkeley Library

Why MOA2?

A common object format allows us to share the effort of developing tools/servicesA common object format ensures interoperability of digital library materials as they are exchanged between institutions (including vendors)

Transition to METS

Continuing need to share, archive & display digital objects but:Need more flexibility for varying descriptive and administrative metadataNeed to support audio/video/other data formats

Who is METS?Community-based development process

UC Berkeley, Harvard, Library of Congress, Michigan State University, METAe, Australian National Library, RLG, California Digital Library, Cornell, University of Virginia (not a complete list)….METS Editorial Board (UC, Harvard, LC, MSU, RLG, DCMI, MIT, NYU, OCLC, PFA, Stanford, Oxford, British Library, U. Alberta, Göttingen)

Maintenance Agency

Web hosting for developing standard and documentationListservs for METS community and editorial boardVocabulary/Profile Registries

The Library of Congress provides:

The METS FormatCreate a single document format for encoding digital library objects which can fulfill roles of SIP, AIP and DIP within the OAIS reference modelInitial scope limited to objects comprised of text, image, audio & video filesPromote interoperability of descriptive, administrative and technical metadata while supporting flexibility in local practice

Technical Components

Primary XML SchemaExtension SchemaControlled Vocabularies

METS XML Schema

METS Document

Header

Descript. MD

Admin. MD

File List

Link Struct.

Struct. Map

Behaviors

Structural Map

Object modeled as tree structure (e.g., book with chapters with subchapters….)Every node in tree can be associated with descriptive/administrative metadata and…Individual/multiple files (or portions thereof) orOther METS documents

Structural Map

<div type=“book” label=“Hunting of the Snark”><div type=“chapter” label=“Fit the First”>

<fptr>…</fptr></div><div type=“chapter” label=“Fit the Second”>

<fptr>…</fptr></div>…

</div>

Link Structure

Records all links between nodes in structural mapUses XLink/Xptr syntaxCaveat Encoder: make sure your structural map supports your link structure

Content Files Listing

Records file specific technical metadata (checksum, file size, creation date/time) as well as providing access to file contentFiles are arranged into groups, which can be arranged hierarchicallyFiles may be referenced (using Xlink) or contained within the METS document (in XML or as Base64 Binary)

Descriptive Metadata

Non-prescriptive/Multiple instancesDesc. metadata associated with entirety of METS object or subcomponentsDesc. metadata may be internal (XML or binary) or external (referenced by XLink) to METS document

Administrative Metadata

4 Types: Technical, Rights, Source Document, Digital ProvenanceNon-prescriptive/Multiple instancesassociated with entirety of METS object or subcomponentsmay be internal (XML/binary) or external (XLink) to METS document

METS Header

Metadata regarding METS documentCreation/Last Modification Date/Record StatusDocument Agents (Creator, Editor, Archivist, Preservation, Disseminator, Rights Owner, Custodian, etc.)Alternative Record ID values

Behaviors Section

Multiple Behaviors allowed for any METS documentBehaviors may operate on any part of METS documentMay provide information on API, service location, etc.

METS Structure

METS Structure

Oral History

Introduction

Q1 & Answer

Q2 & Answer

AIFF Master

TEI Tran-scription

AES/EBUTech. Metadata

Text Tech. Metadata

MODS Record

Time Code Link

IDREF Link

METS Extension Schema

Descriptive Metadata (DC, MARC, MODS)Administrative Metadata

Technical (image, text, audio, video)IP Rights (XrML, ODRL, MPEG 21, DRM Core)Digital Provenance (capture/migration)

Controlled Vocabularies

Known metadata typesKnown file address types (xptr, time code, etc.)METS profiles

Development Status

Version 1.3 Complete; Version 1.4 out soonFormally endorsed by Digital Library FederationRegistered with NISOEditorial Board working on further development of schema, extension schema, controlled vocabularies, registries, documentation and education

Development StatusHarvard Java Toolkit

http://hul.harvard.edu/mets/

CCS GmbH docWorkshttp://www.ccs-gmbh.de/index_e.html

DSpace, FEDORA, SRB, Greenstone (RSN), Cheshire 3 (also RSN)XSLT:

NYU Page turner & METS2SMIL

http://dlib.nyu.edu/metstools/ CDL MOA2METS converter

http://sunsite.berkeley.edu/mets/moa2mets/

MSU METS2SMIL

Next StepsBetter documentationMore Opening Days (all over the place)Tool development (particularly open source)Encourage development of METS ProfilesHelp spark extension schema development (video tech. metadata, IP rights, digital provenance)Work on controlled vocabulariesPromote interoperability with courseware systems (IMS & SCORM)

Why?

Further Info

METS Web Site: http://www.loc.gov/standards/mets

METS Community Mailing List: [email protected]

…or contact me at [email protected]