session 3: vocabulary enrichment, gerda koch

26
local content in a Europeana cloud Session 3: Vocabulary enrichment Gerda Koch, [email protected] AIT Angewandte Informationstechnik Forschungsgesellschaft mbH LoCloud is funded by the European Commission's ICT Policy Support Programme

Upload: locloud

Post on 19-Jan-2015

494 views

Category:

Technology


1 download

DESCRIPTION

LoCloud Content Provider workshops Aug-Sept 2013

TRANSCRIPT

Page 1: Session 3: Vocabulary enrichment, Gerda Koch

local content in a Europeana cloud

Session 3: Vocabulary enrichment

Gerda Koch, [email protected] AIT Angewandte Informationstechnik

Forschungsgesellschaft mbH

LoCloud is funded by the European Commission's ICT Policy Support Programme

Page 2: Session 3: Vocabulary enrichment, Gerda Koch

WP3: Micro services for small and medium institutions• establishing a cloud-based collaborative testing

environment for tools and services

• developing cloud-based SaaS services (Software as a Service) and applications suitable for use by small and medium institutions

• providing the basis for a continuing process of participative testing and validation of each of the services and applications

Introduction

Page 3: Session 3: Vocabulary enrichment, Gerda Koch

WP3: Service framework

• Geolocation enrichment

• Metadata enrichment

• Vocabularies and languages

• Historic place names

• Wikimedia applications

Introduction

Page 4: Session 3: Vocabulary enrichment, Gerda Koch

This presentation will provide you:

A brief introduction to …

1. Web Services

2. Vocabulary Standards used by Task 3.4

3. Vocabulary Management Tool (Sample)

Contents

Page 5: Session 3: Vocabulary enrichment, Gerda Koch

What is a Web Service?

A web service is a software function provided at a network address over the web or the cloud (24/7)

• are application components• communicate using open protocols• are self-contained and self-describing• can be used by other applications• XML is the basis for Web services

1 Web Services

Page 6: Session 3: Vocabulary enrichment, Gerda Koch

Query a Web Service offered online

Results: Rivers in GermanyTGN – Getty Thesaurus of Geographic Names

Query

Result

1 Web Services

Page 7: Session 3: Vocabulary enrichment, Gerda Koch

Xataface: editing a record

Integrate a Web Service in a local application

Which

Descriptors

should I use?

1 Web Services

Page 8: Session 3: Vocabulary enrichment, Gerda Koch

Integrate a Web Service in a local application

The vocabulary

webservice is

directly addressed

within the entry field

of the application.

The user chooses

the vocabulary terms

that are taken over

into the application.

(auto-suggest)

Results: Music GenresDISMARC Genres Vocabulary

1 Web Services

Page 9: Session 3: Vocabulary enrichment, Gerda Koch

Task 3.4: Vocabularies and languages

• Experimental application to enable local cultural institutions to collaborate in the development of multilingual vocabularies for local history and archaeology

2 Vocabularies

Page 10: Session 3: Vocabulary enrichment, Gerda Koch

The application will be based on:

ISO 25964: standard for building thesauri• Part 1: Thesauri for information retrieval

- published in 2011

- developing a thesaurus (mono- andmultilingual)

- replaced previous standards ISO 2788/5964- includes data model and XML schema

• Part 2: Interoperability with other vocabularies- published in 2013

- recommendations for the establishment and maintenance ofmappings between multiple thesauri, or between thesauri and

other types of vocabularies

2 Vocabularies

Page 11: Session 3: Vocabulary enrichment, Gerda Koch

The application will use SKOS as exchange format:SKOS Simple Knowledge Organisation System• is a W3C recommendation designed for

representation of controlled vocabularies• main objective is to enable easy publication and use

of such vocabularies as linked data

2 Vocabularies

Page 12: Session 3: Vocabulary enrichment, Gerda Koch

How this two relate…

• The SKOS metamodel is broadly compatible with the data model of ISO 25964-1 - Thesauri for Information Retrieval.• ISO 25964-1 advises on the selection and fitting together

of concepts, terms and relationships to make a good thesaurus

• SKOS addresses the next step - porting the thesaurus to the Web.

2 Vocabularies

Page 13: Session 3: Vocabulary enrichment, Gerda Koch

Using SKOS, concepts can be identified using URIs, labeled with lexical strings in one or more natural languages…

The SKOS Core Vocabulary is an application of the Resource Description Framework (RDF), that can be used to express a concept scheme as an RDF graph. Using RDF allows data to be linked to and/or merged with other data, enabling data sources to be distributed across the web, but still be meaningfully composed and integrated.

2 Vocabularies

Page 14: Session 3: Vocabulary enrichment, Gerda Koch

Vocabulary examples (SKOS Format)

• DDC Dewey Decimal Classification• Library of Congress’ vocabularies• VIAF person authorities• UKAT UK Archival Thesaurus• UNESCO Thesaurus• …..

Example….

2 Vocabularies

Page 15: Session 3: Vocabulary enrichment, Gerda Koch

C00213URI

C00206URI C00207

URI

skos:prefLabel

skos:prefLabel

skos:narrowerskos:related

skos:prefLabel

skos:prefLabel

Archaeology

Arqueología

Archaeological dating

Archaeological excavations

Object Predicate Subject

XML

Graph

Page 16: Session 3: Vocabulary enrichment, Gerda Koch

Using a vocabulary for classificationSemantic Net presentation

Graph

presentation

(Topic Map)

Tree

presentation

Object

Predicate

Subject

2 Vocabularies

Page 17: Session 3: Vocabulary enrichment, Gerda Koch

Usage of SKOS within theEuropeana Data Model (EDM)

Contextual Classes (EDM)ObjectPredicate

Subject

2 Vocabularies

Page 18: Session 3: Vocabulary enrichment, Gerda Koch

Provided Cultural Heritage Object (EDM)

Object

Predicate

Subject

2 Vocabularies

Page 19: Session 3: Vocabulary enrichment, Gerda Koch

Vocabulary management tool examples

Import and Export

thesauri

ThManager 2.0 developed by the University of Zaragoza

UNESCO

Thesaurus

3 Voc. Tool

Page 20: Session 3: Vocabulary enrichment, Gerda Koch

Vocabulary management tool examples

View and browse

thesauri

List view

Tree view

Search terms

3 Voc. Tool

Page 21: Session 3: Vocabulary enrichment, Gerda Koch

Vocabulary management tool examples

Edit thesauri

3 Voc. Tool

Page 22: Session 3: Vocabulary enrichment, Gerda Koch

Vocabulary Management – Business Process/Workflow

The process of editing and

managing a vocabulary

involves the communication

between different persons

and the technical system:

Eg. Finding a term,

Requesting a candidate term,

Accepting the new term,

Updating the vocabulary etc.

User uses web

service for finding a

vocabulary term

Term exists in the vocabulary

SaaSCrowdsourcing

Term does not

exist in the

vocabulary

Use

crowdsourcing

for finding a new

candidate term

3 Voc. Tool

Page 23: Session 3: Vocabulary enrichment, Gerda Koch

Outcomes of Copenhagen - WS1

Access to the vocabulary:

• Online access point (see TGN example)

• Integration into local application (see Genre example)

Import of existing vocabularies into the LoCloud experimental vocabulary

application (pre-requisite: skosified, multilingual, open access)

• Subject vocabularies

• Object Types

• Geographic Names

• Suggested vocabularies: UNESCO Thesaurus, UKAT

(http://www.heritagedata.org/blog/vocabularies-provided/ ? UK)

Page 24: Session 3: Vocabulary enrichment, Gerda Koch

Outcomes of Copenhagen

General usage of the tool

• Everybody allowed to enter new terms vs. Suggest candidate terms

• Multilinguality: language translations assigned to partners

Other wishes?

• Crowdsourcing?

• Automated Vocabulary enrichment

Page 25: Session 3: Vocabulary enrichment, Gerda Koch

Thank [email protected]

Page 26: Session 3: Vocabulary enrichment, Gerda Koch

LoCloud is funded by the European Commission's ICT Policy Support Programme

The views and opinions expressed in thispresentation are the sole responsibility of the

authors and do not necessarily reflect the views of

the European Commission.

Funding