ukoln is supported by:

50
A centre of expertise in digital information management www.ukoln.ac.u k UKOLN is supported by: An Introduction to Metadata and (some) Metadata Standards Making Sense of Metadata, Society of Archivists EAD/Data Exchange SIG London, Thursday 17 November 2005 Pete Johnston Research Officer, UKOLN, University of Bath www.bath.ac.u k

Upload: chavez

Post on 20-Jan-2016

41 views

Category:

Documents


2 download

DESCRIPTION

An Introduction to Metadata and (some) Metadata Standards Making Sense of Metadata, Society of Archivists EAD/Data Exchange SIG London, Thursday 17 November 2005 Pete Johnston Research Officer, UKOLN, University of Bath. UKOLN is supported by:. www.bath.ac.uk. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: UKOLN is supported  by:

                                                             

A centre of expertise in digital information management www.ukoln.ac.uk

UKOLN is supported by:

An Introduction to Metadata and (some) Metadata Standards

Making Sense of Metadata, Society of Archivists EAD/Data Exchange SIG

London, Thursday 17 November 2005

Pete JohnstonResearch Officer, UKOLN, University of Bath

www.bath.ac.uk

Page 2: UKOLN is supported  by:

                                                             

A centre of expertise in digital information management www.ukoln.ac.uk

An Introduction to Metadata and (some) Metadata Standards

• Metadata in action: an example• What is metadata?• Some metadata standards• Current issues, challenges

Page 3: UKOLN is supported  by:

                                                             

A centre of expertise in digital information management www.ukoln.ac.uk

Metadata in action: an example

Page 4: UKOLN is supported  by:

A metadata-driven/metadata-dependent device!

Page 5: UKOLN is supported  by:

PlaylistsAmbient SelectionElectro Selection

I Think About You I Think About You (Geiger mix)

Albums76:14Accelerator

I'm ReadyYellow Kid

Artists(Smog)A Tribe Called Quest

The Low End Theory

GenresAcidAmbient

0 5412 18

Page 6: UKOLN is supported  by:

Now Playing

HerbstBarbara Morgenstern & Robert Lippok

Seasons

1 of 400:01:38

Page 7: UKOLN is supported  by:

Simple metadata describing each mp3 file

•Track title

•Artist name

•Album title

•Sequence on album

•Genre

•Length

•Sequence in playlist

Used to find, select, organise, access files

Page 8: UKOLN is supported  by:
Page 9: UKOLN is supported  by:
Page 10: UKOLN is supported  by:

http://www.last.fm/

Page 11: UKOLN is supported  by:
Page 12: UKOLN is supported  by:

"Neighbours"

Page 13: UKOLN is supported  by:

"Similar"

"Fans""Tags"

Page 14: UKOLN is supported  by:

"Radio"

Page 15: UKOLN is supported  by:

http://www.bloglines.com/

Page 16: UKOLN is supported  by:

♫M

MP3 Shop 1

MP3 Shop 2

Player

Transfer

MMMGracenote

CDDB

M

M

MBTagger

MMMMusicBrainz

M

M

Transfer

♫M Last.fm (Other)

M MMM

Audioscrobbler

Page 17: UKOLN is supported  by:

The mp3 example

• Track metadata obtained from network services– supplied by users

• Metadata embedded in mp3 file (ID3)• Extracted/indexed by desktop mp3 player, portable

mp3 player– discovery, management

• Used in "play" metadata posted to network services– basis for statistics, recommendation services,

"collaborative filtering"

Page 18: UKOLN is supported  by:

The mp3 example

• Metadata about different types of resources– Tracks, albums, artists, "plays", people….

• Metadata obtained from various sources– Created by different agents

• Metadata moving between different applications/services

• Metadata supporting multiple functions• Effective (re)use of metadata

– minimal user effort– "making (meta)data work harder" (Lorcan Dempsey)

Page 19: UKOLN is supported  by:

                                                             

A centre of expertise in digital information management www.ukoln.ac.uk

What is metadata?

Page 20: UKOLN is supported  by:

                                                             

A centre of expertise in digital information management www.ukoln.ac.uk

What is metadata?Some simple definitions

• ‘Structured data about data’.• Dublin Core Metadata Initiative FAQ, 2005

– http://dublincore.org/resources/faq/

• Machine-understandable information about Web resources or other things.

• Tim Berners-Lee, W3C, 1997– http://www.w3.org/DesignIssues/Metadata

Page 21: UKOLN is supported  by:

                                                             

A centre of expertise in digital information management www.ukoln.ac.uk

"Web resources or other things"

– HTML documents– digital images– databases– books– museum objects– archival records– metadata records

– Web sites– collections– services– physical places– people– organisations– “works”– formats– concepts– events

• Metadata might be "about"… anything!

Page 22: UKOLN is supported  by:

                                                             

A centre of expertise in digital information management www.ukoln.ac.uk

What is metadata?Towards a "functional" view

• Data associated with objects which relieves their potential users of having to have full advance knowledge of their existence or characteristics.

• Lorcan Dempsey & Rachel Heery, "Metadata: a current view of practice and issues", 1998

– http://www.ukoln.ac.uk/metadata/publications/jdmetadata/

Page 23: UKOLN is supported  by:

                                                             

A centre of expertise in digital information management www.ukoln.ac.uk

What is metadata?Towards a "functional" view

• Structured data about resources that can be used to help support a wide range of operations.

• Michael Day, "Metadata in a Nutshell", 2001– http://www.ukoln.ac.uk/metadata/publications/nutshell/

Page 24: UKOLN is supported  by:

What might metadata "say"?What is this called?

What is this about?

Who made this?

When was this made?

Where do I get (a copy of) this?

When does this expire?

What format does this use?

Who is this intended for?

What does this cost?

Can I copy this? Can I modify this?

What are the component parts of this?

What else refers to this?

What did "users" think of this?

(etc!)

Page 25: UKOLN is supported  by:

                                                             

A centre of expertise in digital information management www.ukoln.ac.uk

What operations/functions?

• resource disclosure & discovery• resource retrieval, use• resource management, including preservation• verification of authenticity• intellectual property rights management• commerce• content-rating• authentication and authorisation• personalisation and localisation of services• (etc!)

Page 26: UKOLN is supported  by:

                                                             

A centre of expertise in digital information management www.ukoln.ac.uk

What operations/functions?

• Different functions : different metadata• Metadata (and metadata standards)

sometimes classified according to function– Descriptive: primarily for discovery, retrieval– Administrative: primarily for management– Structural: relationships between component parts

of resources – Contextual: relationships between resources

• No “one size fits all solution”!

Page 27: UKOLN is supported  by:

Where is metadata?

Resource1

e.g. ID3 metadata in MP3; meta elements in HTML docs; TEI header; summary properties in word processor docs; IPTC, EXIF data in image formats

Can resource support embedding of metadata?

Does metadata creator have write access to resource?

Can metadata consumer extract embedded metadata?

What happens when resource deleted?

Metadata about aggregates of resources?

Metadata about people, places, concepts?

Creator = J Smith

Date = 2001-11-05

Title = Report

Metadata embedded in resource

Page 28: UKOLN is supported  by:

Where is metadata?

e.g. link rel="meta" elements in HTML docs

Metadata record may be remote from resource

Can resource support embedding of link?

Does metadata creator have write access to resource?

Can metadata consumer extract link to metadata record?

What happens when resource deleted?

Metadata about aggregates of resources?

Metadata about people, places, concepts?Resource1

Metadata rec 1

Metadata rec = 1

Creator = J Smith

Date = 2001-11-05

Title = Report

Metadata record as separate objectRecord identifier embedded in resource

Page 29: UKOLN is supported  by:

Where is metadata?

e.g. (lots!)

Metadata record may be remote from resource

Does not require embedding of metadata or link

Does not require metadata creator to have write access to resource

Metadata record created independently of resource – possibly multiple records

Metadata consumer uses metadata records independently of resource

Metadata record may persist after resource deleted

Metadata record can describe anything (with identifier…)

Resource1

Metadata rec 1

Creator = J Smith

Date = 2001-11-05

Title = Report

Doc = 1 Metadata record as separate objectResource identifier in metadata record

Page 30: UKOLN is supported  by:

                                                             

A centre of expertise in digital information management www.ukoln.ac.uk

Metadata as managed resource

• Metadata – may be used independently of resource– may grow/change independently of resource– may be used in different subsets, multiple formats– may be the subject of metadata!– requires management

• Metadata typically stored in some form of database, repository

• Exposed/exported as required

Page 31: UKOLN is supported  by:

Metadata as managed resource

J Smith 2001-11-05 Report

Creator Date TitleDoc

1

Page 32: UKOLN is supported  by:

                                                             

A centre of expertise in digital information management www.ukoln.ac.uk

Who/what creates metadata?

• Information professionals ("cataloguers")• Resource creators• Resource managers• Resource distributors/publishers• Indexing/abstracting services (and similar)• Resource users• Software applications• Probably others I've forgotten…

Page 33: UKOLN is supported  by:

                                                             

A centre of expertise in digital information management www.ukoln.ac.uk

User-created metadata

• Growing interest in user-created metadata– user annotation, ratings, comments, "reviews"

• e.g. Amazon, OCLC OpenWorldCat

– "tagging", folksonomy• e.g. Flickr, del.icio.us

• Capture user perceptions of resources• Capture user knowledge of resources• Questions of authority, accuracy, trust, etc

Page 34: UKOLN is supported  by:

                                                             

A centre of expertise in digital information management www.ukoln.ac.uk

Application-captured/generated metadata• Human metadata creation costs time/effort/money

– "experts" cost even more!

• Software applications can obtain metadata from– operating system, Web server etc

• size, MIME types etc

– resource itself• email headers etc• metadata created by authoring applications (e.g. MS Word) • automated analysis of resource content (e.g. citation

analysis, keyword extraction, automated classification)

– usage records, transaction logs • e.g. people who bought/used/played this also bought these

– "joining up" metadata from different sources

Page 35: UKOLN is supported  by:

                                                             

A centre of expertise in digital information management www.ukoln.ac.uk

Some metadata standards

Page 36: UKOLN is supported  by:

                                                             

A centre of expertise in digital information management www.ukoln.ac.uk

Metadata standards

• Typically defined by "resource management communities"– Different traditions, perspectives, functional

requirements• Typically comprise

– A "conceptual model" (sometimes not explicit)– A set of named components ("terms", "elements"

etc) and documentation on their meaning and use– A specification of how to represent a metadata

instance in a digital format (binding)

Page 37: UKOLN is supported  by:

                                                             

A centre of expertise in digital information management www.ukoln.ac.uk

Bibliographic Metadata standards

• Machine-Readable Catalogue (MARC)– primary library cataloguing standard– supports discovery and management of library resources– maintained by Library of Congress

• Metadata Object Description Schema (MODS) – represents subset of MARC– XML Schema– maintained by Library of Congress

• ONIX– information provided by publishers to retailers– some use of ONIX to enhance library catalogue records– maintained by EDItEUR/Book Industry Communication

Page 38: UKOLN is supported  by:

                                                             

A centre of expertise in digital information management www.ukoln.ac.uk

Archival/Records Management Metadata standards• ISAD(G)

– not in itself machine-processable?– but used as basis of database schemas in e.g. CALM

• Encoded Archival Description (EAD)– metadata about archival records (and aggregations of

records)– may include some metadata about organisations,

individuals

• Encoded Archival Context (EAC)– metadata about organisations, individuals

• Records Management Metadata e.g.– National Archives ERMS Metadata Standard

Page 39: UKOLN is supported  by:

                                                             

A centre of expertise in digital information management www.ukoln.ac.uk

Museum Metadata standards

• SPECTRUM– Museum documentation standard– Describes

• Procedures• Information requirements ("units of information")

– Metadata about objects, events, agents etc

– CIMI XML Schema for SPECTRUM– Maintained by mda

Page 40: UKOLN is supported  by:

                                                             

A centre of expertise in digital information management www.ukoln.ac.uk

Image Metadata standards

• VRA Core– "works of visual culture as well as the images that

document them"– Image as visual representation of Work– maintained by Visual Resources Association

• NISO Data Dictionary of Technical Metadata for Digital Still Images– To facilitate technical interoperability, also management

curation/preservation– Encoded/serialised using MIX XML Schema

Page 41: UKOLN is supported  by:

                                                             

A centre of expertise in digital information management www.ukoln.ac.uk

Government Metadata standards

• UK e-Government Metadata Standard– based on Dublin Core– also incorporates components from NA ERMS– specifies constraints on values e.g. Integrated

Public Sector Vocabulary– primarily to support resource discovery,

retrieval/access, some records management– eGMS v3.0 provides large set of terms

• in practice, deployed in subsets

Page 42: UKOLN is supported  by:

                                                             

A centre of expertise in digital information management www.ukoln.ac.uk

Learning Metadata standards

• IEEE Learning Object Metadata (LOM)– To support the disclosure/discovery and

use/reuse of "learning objects"– UK LOM Core as "application profile" of LOM

• IMS Specifications– Learner Information Profile (people)– Learning Design (learning activities etc)– Enterprise (groups/classes etc)– Resource List Interoperability (reading lists etc)– etc!

Page 43: UKOLN is supported  by:

                                                             

A centre of expertise in digital information management www.ukoln.ac.uk

Multimedia Metadata standards

• MPEG-7– to describe the content of audio-video streams – "making audio-visual material as searchable as

text"– designed to be incorporated into the production

process• create metadata at various stages

– extensible through the use of a Description Definition Language (DDL)

– metadata may be embedded in resource or located separately

Page 44: UKOLN is supported  by:

                                                             

A centre of expertise in digital information management www.ukoln.ac.uk

Some current challenges

Page 45: UKOLN is supported  by:

                                                             

A centre of expertise in digital information management www.ukoln.ac.uk

Metadata standards & interoperability

• Standardisation (mainly) within communities/domains…

• … but on the Web– resources/metadata moving

between/across "communities"– services operating on metadata from

multiple "communities"

Page 46: UKOLN is supported  by:

                                                             

A centre of expertise in digital information management www.ukoln.ac.uk

Metadata standards & interoperability

• How to minimise costly, complex, lossy mappings/translations?– The "railroad gauge dilemna"

• (Stuart Weibel, "Border Crossings", D-Lib, Jul 2005)

• How to maximise effective reuse of existing metadata?– How to realise aspirations to extensibility,

modularity?

• Does the W3C's Resource Description Framework (RDF) offer a solution?

Page 47: UKOLN is supported  by:

Summary

• Metadata is used almost everywhere• Metadata enables people and software

applications to do things– Not only about "discovery"– Different functions require different metadata

• Metadata creation is potentially costly– Clarify functional requirements– Exploit existing sources

• Many metadata standards established/emerging• But challenges remain in working across

standards, using standards in combination

Page 48: UKOLN is supported  by:

                                                             

A centre of expertise in digital information management www.ukoln.ac.uk

P.S.

Page 49: UKOLN is supported  by:

http://base.google.com/

Page 50: UKOLN is supported  by:

                                                             

A centre of expertise in digital information management www.ukoln.ac.uk

UKOLN is supported by:

An Introduction to Metadata and (some) Metadata Standards

Making Sense of Metadata, Society of Archivists EAD/Data Exchange SIG

London, Thursday 17 November 2005

Pete JohnstonResearch Officer, UKOLN, University of Bath

www.bath.ac.uk