ese to edm 2010

44
Europeana Metadata - ESE to EDM Robina Clayphan Interoperability Manager, Europeana Foundation Collections Trust , London 28 June, 2010

Upload: matija-kancijan

Post on 10-Mar-2015

152 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: ESE to EDM 2010

Europeana Metadata - ESE to EDM

Robina Clayphan

Interoperability Manager, Europeana Foundation

Collections Trust , London 28 June, 2010

Page 2: ESE to EDM 2010

Introduction

• The current metadata schema

• Europeana Semantic Elements - ESE

• Content ingestion

• The future data model

• Europeana Data Model - EDM

Page 3: ESE to EDM 2010

Europeana

Europeana brings together and makes available digital

content from:

• Four cultural heritage sectors

• Museums, Archives, Libraries, Audio-visual archives

• Twenty-nine countries

• EU plus Norway and Switzerland

• Twenty-six languages

• Four types of material

• Image, sound, video, text

….need for a metadata lingua franca…

Page 4: ESE to EDM 2010

ESE V3.2

Europeana Semantic Elements (ESE) V3.2 developed for the

prototype

• A Dublin core-based application profile

• Cross-domain schema for heterogeneous data

• Not to capture the full semantics of provider’s data

• 37 Dublin Core terms – used principally to describe the

objects

• 12 Europeana coined terms - used to support portal

functionality

• Needed to have consistent data for the portal to work

Page 5: ESE to EDM 2010

The Dublin Core elements

Title Alternative

Creator

Subject

Description TableOfContents

Publisher

Contributor

Date Created; Issued

Type

Format Extent; Medium

Identifier

Source

Language

Relation isVersionOf; hasVersion; isReplacedBy; replaces; isRequiredBy; requires; isPartOf; hasPart; isReferencedBy; references; isFormatOf; hasFormat; conformsTo

Coverage Spatial; Temporal

Rights

Provenance

Page 6: ESE to EDM 2010

Europeana elements

Element Who is responsible Function

europeana:isShownAt

or

europeana:isShownBy

Provider must provide at least one of

these elements - both if applicable.

URL

Links to object

europeana:objectProvider - if appropriate to the data

URLSource of thumbnail

europeana:providerProvider must provide this element.

Controlled list.Facet

europeana:typeProvider must provide this element.

Controlled list Facet

europeana:unstoredProvider – only if appropriate to your

data. Text stringContainer element

europeana:country

Europeana is responsible for providing

all these elements.

Facet

europeana:hasObject System use

europeana:language Facet

europeana:uri System Identifier

europeana:usertag User provided tags (future)

europeana:year Facet, timeline

Page 7: ESE to EDM 2010
Page 8: ESE to EDM 2010
Page 9: ESE to EDM 2010
Page 10: ESE to EDM 2010

Normalised elements

• Language

• ISO 369-1 standard two character code.

• Country

• ISO 3166 standard

• Year

• Four digit year from Gregorian calendar (YYYY)

• Generated where possible from date supplied in <dc:date>

• Provider

• Controlled list of names, in the language of provider

• Type

• Controlled list (in English) of four types: Text, Image, Sound, Video

• mapped from the diverse types used in source data (by provider)

Page 11: ESE to EDM 2010

Mapping and Normalisation

Three key reference documents for providers:

• ESE Specification V3.2

• Normalisation Guidelines V1.2

• ESE V3.2 XML schema + explanatory text

All available from the “Provide Content” section of the

Europeana Group pages:

http://group.europeana.eu/web/guest/provide_content

Page 12: ESE to EDM 2010

Additional elements for Rhine

Currently under development:

• europeana:dataProvider

• The name of the content provider

• europeana:rights

• Indication of a licence type the covers the digital content and the thumbnail

• Waiting for all the interdependencies to be worked through

Page 13: ESE to EDM 2010
Page 14: ESE to EDM 2010

Introduction

• The current metadata schema

• Europeana Semantic Elements - ESE

• Content ingestion

• The future data model

• Europeana Data Model - EDM

Page 15: ESE to EDM 2010

Content Ingestion

• Europeana has provided a Content Checker tool which has

two parts:

• The Content Ingestor

• Allows uploading of a data set

• Validation against the ESE V3.2 XML schema

• Importing the data into the database

• Indexing of data

• Caching of thumbnails

• The Test Portal

• Separate from the operational portal

• Allows provider to search for uploaded data

Page 16: ESE to EDM 2010

Content Ingestor

Select “new data set” - the ingestor automatically

creates a new ID – “null05” in this example

Page 17: ESE to EDM 2010

Content Ingestor - upload

Page 18: ESE to EDM 2010

Content Ingestor - validate

Page 19: ESE to EDM 2010

Index

Page 20: ESE to EDM 2010

Test Portal - search

Page 21: ESE to EDM 2010

Introduction

• The current metadata schema

• Europeana Semantic Elements - ESE

• Content ingestion

• The future data model

• Europeana Data Model - EDM

Page 22: ESE to EDM 2010

Looking forward

• Rhine release – July to September 2010

• Some ESE-related changes

• Addition of europeana:dataProvider

• Addition of europeana:rights

• Danube release – April 2011

• Incremental move to Europeana Data Model

• Features will depend on outcome of current prototyping work

Page 23: ESE to EDM 2010

PlanningFrom April 2010 till December 2010

Page 24: ESE to EDM 2010

Danube requirements specification

• Explore

• New ways of searching and browsing content: e.g. map searches, virtual

exhibitions, improved timeline, extended facets and multi-lingual support.

• Re-Use

• Search API, Linked Data

• Interact

• Tagging, more social media features, user generated content coming in

from partners

• Under the hood

• New richer data model (EDM), Metadata Service Registry, (External)

Service Registry, Resolution Discovery Service (PIDs)

• Experiment

• The ThoughtLab will showcase new services developed by our partners.

Page 25: ESE to EDM 2010

Rationale of EDM

• Precursor: ESE (Europeana Semantic Elements)

• used in 2008 version of Europeana

• represents lowest common denominator for object metadata

• convert datasets to Dublin-Core like standard

• forces interoperability

• major drawback: original metadata is lost

• EDM goals

• preserve original data while still allowing for interoperability

• Semantic Web representation

• Semantic linking between objects

Page 26: ESE to EDM 2010

EDM requirements

1. Distinction between the real world object (painting, book,

program) and its digital representation

2. Distinction between the object and the metadata record

describing the object.

3. Allow multiple records for same object, containing

potentially contradictory statements about an object

4. Support for objects that are composed of other objects

5. Standard metadata format that can be specialized

6. Standard vocabulary format that can be specialized

7. EDM should be based on existing standards

• “not yet another standard” !

Page 27: ESE to EDM 2010

EDM basics

• OAI ORE for organization of metadata about an object

• Requirements 1-4

• Dublin Core for metadata representation

• Requirement 5

• SKOS for vocabulary representation

• Requirement 6

• OAI ORE, Dublin Core and SKOS together fulfil

Requirement-7

Page 28: ESE to EDM 2010

The General Picture

Semantic Network

Networked object representations

Page 29: ESE to EDM 2010

The Class Taxonomy (from V5.0+)

Proxy

Page 30: ESE to EDM 2010

The Property Taxonomy (without ESE)

Page 31: ESE to EDM 2010

The Example – 1 from Direction des Musees de

France

31

Page 32: ESE to EDM 2010

The Example – 2 from the Louvre

32

Page 33: ESE to EDM 2010

Aggregation organizes data of a single provider:example 1

33

aggregation

digital representation

object

provenance

metadata

Page 34: ESE to EDM 2010

Proxy: metadata record for an object

34

proxy

object

metadata

Page 35: ESE to EDM 2010

Multiple providers = multiple aggregations(the same object)

35

aggregation

of DMF

aggregation

of Louvre

v

Page 36: ESE to EDM 2010

Europeana is “just” a special provider

with processed/enriched metadata

36

Europeana

aggregation

enriched

metadata

landing

page

Page 37: ESE to EDM 2010

Read about it

• EDM Primer

http://www.few.vu.nl/~aisaac/edm/EDM_Primer_100401.pdf

Thank you!

[email protected]

Page 38: ESE to EDM 2010

Advanced modeling in EDM

• See the documentation

• Relations between “provided” objects

• Part-whole links for complex objects

• Derivation and versioning relations

• Predefined classes for person, place, time and event

Page 39: ESE to EDM 2010

PlanningFrom April 2010 till December 2010

Page 40: ESE to EDM 2010

Priorities for Danube

• Improved Access

• Contextualization

• Content reuse

• (User) participation

• Data Enrichment

• Ingestion Infrastructure

• Repository Infrastructure

• PR & Projects Activity

• Experimentation

Page 41: ESE to EDM 2010

EDM representation: RDF standard

• Ovals are web resources with a URL

• Arcs are properties linking resources to other resources or to literals

• Resources belong to classes

• RDF model can be specialized using subclass and subproperty definitions

Page 42: ESE to EDM 2010

Dublin Core

• EDM uses the latest version of DCMI Metadata Terms

http://dublincore.org/documents/dcmi-terms/

• Specified with an RDF model

• Specialization of 15 original DC elements

dcterms:coverage

dcterms:spatial

dcterms:temporal

• Can be specialized itself

• see requirement

Page 43: ESE to EDM 2010

SKOS: vocabulary publication on the Web

• W3C standard

http://www.w3.org/TR/skos-primer/

• Adopted by large institutions such as Library of Congress

• Specified with an RDF model

• Can be specialized itself

Page 44: ESE to EDM 2010

OAI OREOpen Archives Initiative Object Reuse & Exchange

• Specification:

http://www.openarchives.org/ore/1.0/toc.html

• Specified with an RDF model

• Four key notions (RDF classes)

• Object: the book/painting/program being described

• Aggregation: organizes object information from a particular provider (museum, archive, library)

• Digital representation: some digital form of the object with a Web address

• Proxy: the metadata record for the object