chiara latronico,europeana cloud - ingestion clinic, the european library

45

Upload: the-european-library

Post on 29-Aug-2014

343 views

Category:

Technology


2 download

DESCRIPTION

 

TRANSCRIPT

Page 1: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library
Page 2: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

europeana cloud

Ingestion ClinicChiara LatronicoOperations Officer, The European Library

Marian Lefferts Executive Manager, CERL – WP4 Leader

Europeana Cloud Ingestion Clinic: 19-21 June, 3 July, 2013

Page 3: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

Agenda Ingestion process step by step Ingestion plan broken down per provider Rights documentation (Europeana Pro) Other topics:

Thumbnails De-duplication Sets and subsets Catalogue records vs digital objects Collection descriptions

Providers experience and questions

Page 4: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

The European Library Portal

Page 5: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

The European Library: Ingestion WorkflowPreparation Work Content ingestion questionnaire Ingestion plan Sample records to ingest Datasets ready for harvesting Create case in CRM: case # to provider Step by Step Harvest metadata Enhance metadata Index in acceptance portal Communicate with data provider Live index = live portalDeliver to EuropeanaEnhance and publish in Europeana

Page 6: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

The European Library: System Architecture

Page 7: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

Harvesting: Repox

Page 8: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

Harvesting: Repox

Page 9: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

Ingestion: UIM Loading

Page 10: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

Ingestion: UIM Validation

Page 11: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

Ingestion: UIM Validation

Page 12: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

Ingestion: UIM Validation

Page 13: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

UIM Validation: Record in Portal

Page 14: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

UIM Validation: Record in Portal

Page 15: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

UIM Validation: Records in Portal

Page 16: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

Validation: Acceptance Portal

Page 17: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

XSLT to Internal Object Model

Page 18: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

Ingestion: UIM OAI-Enrichment-Acceptance

Page 19: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

Ingestion: UIM OAI-Enrichment-Acceptance

Page 20: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

Validation: Acceptance Portal

Page 21: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

Dataset in Acceptance Create an account onhttp://www.theeuropeanlibrary.org/

Use credentials to sign in to acceptancehttp://www.tel.ulcc.ac.uk/acceptance/

Validate data using tabs Default Dublin Core (Soon) EDM

Page 22: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

Validation: Acceptance Portal

Page 23: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

Acceptance Portal: Communication

When a dataset is in acceptance Communication with data provider Fixing dataset if needed More commination until provider gives approval to publish Data provider accepts dataset Dataset ready for The European Library

live index

Page 24: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

Ingestion: UIM Index to Publish

Page 25: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

Live Index: Live Portal

When a provider accepts dataset Dataset ready for live index

Dataset indexed into the live portal It takes from 1 day to 1 week for a

dataset to be searchable in The European Library live portal

(this is variable and changes due to circumstances)

Page 26: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

Dataset Live in Europeana

When a provider accepts a dataset

Dataset delivered to Europeana Dataset searchable in Europeana by

following quarter

Dataset published live in Europeana E-mail to provider with link to dataset into

Europeana portal

Page 27: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

SugarCRM: eCloud Ingestion Plan

eCloud Ingestion Plan Report

Page 28: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

eCloud Ingestion Plan: Hangout # 119th June1. National Library of Technology (NTK), PragueThree datasets scheduled for Q2 2014 Delivery to The European Library: April 2014 In Europeana by Q3 2014

2. ULBFive datasets scheduled for Q4 2013Delivery to The European Library: October 2013 In Europeana by Q1 2014

3. DIALNETTwo datasets scheduled for Q1 2014Delivery to The European Library: January 2014 In Europeana by Q2 2014

Page 29: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

eCloud Ingestion Plan: Hangout # 119th June

4. Tilburg UniversityOne dataset scheduled for Q1 2014Delivery to The European Library: January 2014In Europeana by Q2 2014

5. OAPENTwo datasets scheduled for Q2 2013Delivery to The European Library: May 2013In Europeana by Q3 2013

Page 30: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

eCloud Ingestion Plan: Hangout # 2 (21st June)

1. University of EdinburghTen datasets scheduled for Q4 2013Delivery to The European Library: October 2013In Europeana by Q1 2014

2. DANSThree datasets scheduled for Q3 2013Delivery to The European Library: July 2013In Europeana by Q4 2014

3. UNIBI One dataset scheduled for Q3 2013Delivery to The European Library: July 2013In Europeana by Q4 2014

Page 31: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

eCloud Ingestion Plan: Hangout # 2 (21st June)

4. VU UniversityNine datasets scheduled for Q3 2013Delivery to The European Library: July 2013In Europeana by Q4 2013

one dataset scheduled for Q1 2014Delivery to The European Library: January 2014In Europeana by Q2 2014

one dataset scheduled for Q3 2014Delivery to The European Library: July 2014In Europeana by Q4 2014

5. WalesOne dataset scheduled for Q1 2014Delivery to The European Library: January 2014In Europeana by Q2 2014

Page 32: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

eCloud Ingestion Plan: Hangout # 3 (3rd July)

1. Bavarian State LibraryOne dataset scheduled for Q4 2013 Delivery to The European Library: October 2013In Europeana by Q4 2013

2. Debrecen University LibraryThree datasets scheduled for Q1 2015 Delivery to The European Library: January 2015 In Europeana by Q2 2015

Page 33: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

eCloud Ingestion Plan: Hangout # 3 (3rd July)

3. HAZUTwenty-eight sub-sets scheduled for Q4 2013 Delivery to The European Library: October 2013 In Europeana by Q1 2014

One sub-set scheduled for Q2 2014 Delivery to The European Library: April 2014 In Europeana by Q3 2014

Page 34: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

eCloud Ingestion Plan: Number of Records

Records promised = Records delivered

Number of records promised needs to be the same of the number of records delivered to The European Library

If a data provider cannot deliver the record promised The Collections Team needs to be informed soon

If a data provider has more records to deliver It’s good news and we will be happy to ingest more

Deliverable D4.1 (containing the ingestion schedule) is available on Basecamp and can be accessed by everyone

Page 35: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

Europeana Pro Website

Europeana Pro is the Europeana Professional website http://pro.europeana.eu/

Here is possible to findInformation about projects NewsDiscussionsTechnical documentation

For data provider to make metadata Europeana rights information

Page 36: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

Europeana Rights on Europeana ProEuropeana Rights... Define the rights to the digital objectA definition is mandatory for each recordCan be inserted into the metadata Can be sent via email (if the same statement is appliccable for each record)

There are 12 rights statements to choose from 2 Public Domain 6 Creative Commons Licenses 4 Europeana Rights Reserved Statements

Europeana Rights on Europeana Pro website http://pro.europeana.eu/web/guest/available-rights-statements

Page 37: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

Other TopicsThumbnailsAre not mandatory but they enrich the collectionCan be inserted into the metadataA pattern to thumbnails can be sent via email

Page 38: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

Other TopicsDe-duplication

If two or more of your datasets share the same records, the data provider needs to Inform the Collections TeamHelp us to identify a pattern to de-duplicate recordsOr give us a list of identifiers to work with

The European Library portal clusters similar recordsBut Europrana does not accept duplications

Page 39: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

Other TopicsSub-sets

If a dataset is made up of several sub-sets, the data provider needs to

Inform the Collections Team

Because tables and Ingestion Plan might need to be updated

Page 40: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

Other TopicsCatalogue records and digital objects

A catalogue record (bibliographic info) is recommended for each recordA link to a digital object is mandatory for each recordLink to digital objects need be inserted into the metadataEuropeana does not accept records without links to digital objects

Page 41: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

Other Topics Example of record with no catalogue records or digital objects

Page 42: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

Other TopicsCollection descriptions

A data provider could enrich a dataset by sending us a collection descriptionIt would appear on the collection level page in The European Library portalIt would improve retrieval of a dataset on Google searchIt supports data analysis for Content Ingestion Strategy

A few examplesPicture Archives and Graphics Collection, Austrian National LibraryAlba amicorum from the Koninklijke Bibliotheek, National Library of the NetherlandsDigital Periodicals and Newspapers, National Library of Spain

Page 43: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

Other Topics

Providers experience

Comments about time table?Special issues regarding your own datasets?Assistance in preparing the data?Issues with number of records?Questions?

Page 44: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

Thank you!

For every questions or feedback [email protected]

Chiara [email protected]

Page 45: Chiara Latronico,Europeana Cloud - Ingestion Clinic, The European Library

www.theeuropeanlibrary.org