locloud micro services and the digitisation workflow

39
XIIth Annual International Conference for Professionals in Cultural Heritage LoCloud Micro Services and the Digitisation Workflow Walter Koch ([email protected]) AIT-Angewandte Informationstechnik Forschungs GmbH, Graz - Austria LoCloud is funded by the European Commission's ICT Policy Support Programme

Upload: locloud

Post on 15-Apr-2017

610 views

Category:

Internet


0 download

TRANSCRIPT

Page 1: LoCloud Micro Services and the Digitisation Workflow

XIIth Annual International Conference for Professionals in Cultural HeritageLoCloud Micro Services and the Digitisation Workflow

Walter Koch ([email protected])AIT-Angewandte Informationstechnik

Forschungs GmbH, Graz - Austria

LoCloud is funded by the European Commission's ICT Policy Support Programme

Page 2: LoCloud Micro Services and the Digitisation Workflow

2015-11-08 Walter Koch (AIT) 2

Content

• LoCloud Project (relevant activities)• Vocabulary Microservice• Microservice used in the OpenUp! Project• Digitisation Workflow (Books)• Digital Curation (Long Term Preservation)

Page 3: LoCloud Micro Services and the Digitisation Workflow

LoCloud Project(relevant activities)

2015-11-08 Walter Koch (AIT) 3

Page 4: LoCloud Micro Services and the Digitisation Workflow

2015-11-08 Walter Koch (AIT) 4

locoud.eu

About:– LoCloud is a Best Practice Network co-funded under the

CIP ICT-PSP programme of the European Commission.– LoCloud is supporting small and medium-sized

institutions in making their content and metadata available to Europeana.

– A cloud-based technology infrastructure will enable the aggregation of local content, and

– a number of micro-services will help to reduce technical, semantic and skills barriers and to render the content more discoverable and interoperable.

Page 5: LoCloud Micro Services and the Digitisation Workflow

2015-11-08 Walter Koch (AIT) 5

Microservices

Six development teams: AIT, Athena RC, AVINET, IPCHS, UPV/EHU, VUKFC

All the services are implemented on virtual machines in a cloud testlab (using the OpenNebula cloud computing platform)

API documentation is available for each serviceEnd-user interfaces are provided for three microservicesEach service has been integrated in MORe

Page 6: LoCloud Micro Services and the Digitisation Workflow

2015-11-08 Walter Koch (AIT) 6

LoCloud Microservices

http://support.locloud.eu/

Cloud-based micro-services for metadata enrichment and capture: •Geo-location API•Vocabulary service•Historic Place Names service•Geo-coding application

• Vocabulary matching service • Background linking service• Wikimedia application

Page 7: LoCloud Micro Services and the Digitisation Workflow

2015-11-08 Walter Koch (AIT) 7

LoGeo API• Uses NER-Name Entity Recognition• Recognizes and returns one or more place names

candidates accompanied by geographical coordinates.

GPLIPCHS

Page 8: LoCloud Micro Services and the Digitisation Workflow

2015-11-08 Walter Koch (AIT) 8

Vocabulary service• Based on “TemaTres”• Incorporates 29 standard

thesauri • allows users to add and

create vocabularies, and add terms or translations

• add thesauri terms to all items of a metadata package

• Integrated in MORE• Exploited by the LoCloud

vocabulary matching microservice via its API

GPLAIT

Page 9: LoCloud Micro Services and the Digitisation Workflow

2015-11-08 Walter Koch (AIT) 9

Historic Place Names service

• A semi-automatic historical geo-information management function and web service.

• Enables local cultural institutions to collaborate in developing the Thesaurus of HPN and enriching ingested metadata.

• Enables export of selected LoCloud HPN Thesaurus data sets.

GPLVUKF

Page 10: LoCloud Micro Services and the Digitisation Workflow

2015-11-08 Walter Koch (AIT) 10

Geo-coding application• End-user oriented application• allows institutions to add geographical locations to existing content in

a controlled crowd-sourcing environment and then • load the augmented data back into their original production

databases.

GPLAVINET

Page 11: LoCloud Micro Services and the Digitisation Workflow

2015-11-08 Walter Koch (AIT) 11

Vocabulary matching service

• Builds on work in the PATHS project (FP7)• Automatically assigns relevant concepts and terms to item

records • Retrieves terms from the Vocabulary serviceUPV/EHU Apache License 2.0

Page 12: LoCloud Micro Services and the Digitisation Workflow

2015-11-08 Walter Koch (AIT) 12

Background linking service• Based on work in PATHS• Follows the Named Entity

Disambiguation approach• Enriches items with links to

external contextual information in DBpedia and Wikipedia

Apache License 2.0UPV/EHU

Page 13: LoCloud Micro Services and the Digitisation Workflow

2015-11-08 Walter Koch (AIT) 13

Wikimedia application• A web service that uses a REST interface to communicate with

Wikimedia commons • The main functionalities are harvesting content form

Wikimedia, parsing the harvested content, and identifying useful entities that can be mapped to the ESE or EDM metadata schemas.

• The mapped ESE / EDM records are then sent to MORe to be delivered to Europeana.

Wikimedia Commons LicensingAthena RC

Page 14: LoCloud Micro Services and the Digitisation Workflow

2015-11-08 Walter Koch (AIT) 14

The LoCloudAggregation Structure

EnrichmentServices

Euro

pean

a Cl

oud

Testlab

Page 15: LoCloud Micro Services and the Digitisation Workflow

2015-11-08 Walter Koch (AIT) 15

About: Microservice

Wikipedia: https://en.wikipedia.org/wiki/Microservices (2015-09-15)

[...] microservices is a • software architecture style in which • complex applications are composed

of small, independent processes • communicating with each other using language-

agnostic APIs.• These services are small, highly decoupled and

focus on doing a small task,facilitating a modular approach to system-building.

Page 16: LoCloud Micro Services and the Digitisation Workflow

2015-11-08 Walter Koch (AIT) 16

Evolution of Architecturespwc: Technology Forecast: Rethinking Integration, Issue 1, 2014, page 3

Page 17: LoCloud Micro Services and the Digitisation Workflow

Vocabulary Microservice

2015-11-08 Walter Koch (AIT) 17

Page 18: LoCloud Micro Services and the Digitisation Workflow

2015-11-08 Walter Koch (AIT) 18

http://vocabulary.locloud.eu/• Vocabulary Microservices • This is the entry page for the

LoCloud vocabulary application. • LoCloud uses a variety of standard vocabularies

during the metadata enrichment process.These vocabularies have been established with the TemaTres tool and are available in the LoCloud cloud testlab.

• In addition, it is possible to feed the tool with your own vocabularies, establish new vocabularies or add languages to existing vocabularies.

Page 19: LoCloud Micro Services and the Digitisation Workflow

2015-11-08 Walter Koch (AIT) 19

Vocabulary WebService20+ Functions, API

Parameters

Example

Page 20: LoCloud Micro Services and the Digitisation Workflow

2015-11-08 Walter Koch (AIT) 20

Example: SearchUNESCO-Thesaurus

http://test113.ait.co.at/tematres/unesco/services.php?task=search&arg=peace

Page 21: LoCloud Micro Services and the Digitisation Workflow

2015-11-08 Walter Koch (AIT) 21

SPARQL-ExampleDISMARC-Genres

• Query: "Find all Broader Term for 'Acid Jazz‘“• SPARQL End Point: http://test113.ait.co.at/tematres/vocab/sparql.php

QueryResult

ThesaurusEntry

Page 22: LoCloud Micro Services and the Digitisation Workflow

2015-11-08 Walter Koch (AIT) 22

Service Call („fetchTopTerms“)embedded in a test application

Micro- Service

Vocabulary Web Service

Page 23: LoCloud Micro Services and the Digitisation Workflow

2015-11-08 Walter Koch (AIT) 23

Test Application DevelopmentDesigner View

Development of a Drop Down List

Development of a Drop Down List

Page 24: LoCloud Micro Services and the Digitisation Workflow

2015-11-08 Walter Koch (AIT) 24

The Test Application:Part of Data Entry

Path to the Thesaurus Item

„Candidate Term“ Crowdsourcing activity

Next Slide(Vocabulary Management)

Page 25: LoCloud Micro Services and the Digitisation Workflow

Microservice used in the OpenUp! Project (open-up.eu)

„Common Name Service“

2015-11-08 Walter Koch (AIT) 25

Page 26: LoCloud Micro Services and the Digitisation Workflow

2015-11-08 Walter Koch (AIT) 26

OpenUp! - Natural HistoryAggregator for Europeana

Micro Service:Common Name

Service

Micro Service:Common Name

Service

Page 27: LoCloud Micro Services and the Digitisation Workflow

2015-11-08 Walter Koch (AIT) 27

OpenUp!Transformation Workflow

Invoke Micro Service:

Common Name Service

Next Slide

Page 28: LoCloud Micro Services and the Digitisation Workflow

2015-11-08 Walter Koch (AIT) 28

OpenUp!Transformation Parameters

Micro Service:Common Name Service

Service End Point (Cache)

Page 29: LoCloud Micro Services and the Digitisation Workflow

2015-11-08 Walter Koch (AIT) 29

Result: OpenUp!Enriched Meta Data Set

Latin Name

Common Names

Page 30: LoCloud Micro Services and the Digitisation Workflow

Digitisation Workflow(Books)

2015-11-08 Walter Koch (AIT) 30

Page 31: LoCloud Micro Services and the Digitisation Workflow

2015-11-08 Walter Koch (AIT) 31

Bibliotheca Alexandrinahttp://www.bibalex.org

Points for MicroService Integration

Page 32: LoCloud Micro Services and the Digitisation Workflow

2015-11-08 Walter Koch (AIT) 32

Digital Curation(Long Term Preservation)

Page 33: LoCloud Micro Services and the Digitisation Workflow

2015-11-08 Walter Koch (AIT) 33

Digital CurationState-of-the-Art Examples (2010)

• Archivematica– Linux VM based toolset allows producers and archivists

to implement a small scale workflow

• UC3 / CDL Merritt– Micro-Services specifications – implement building blocks for the Digital Curation

Lifecycle

Page 34: LoCloud Micro Services and the Digitisation Workflow

2015-11-08 Walter Koch (AIT) 34

ArchivmaticaMicro Services

Linux

Toolsuite

Micro-

Services

Categories

of Micro-

Services

Page 35: LoCloud Micro Services and the Digitisation Workflow

2015-11-08 Walter Koch (AIT) 35

California Digital LibraryUC3 Merritt

• California Digital Library UC Curation Center• Independant, Interoperable Micro-Services• Complex curation function through combination of

atomistic services

UC3 Model

Knowledge Enhancement

Phases

Micro-

Services

Annotation

Page 36: LoCloud Micro Services and the Digitisation Workflow

2015-11-08 Walter Koch (AIT) 36

Micro-Services(Merritt vs. Archivematica)

CDL/UC3-MERRITT ARCHIVEMATICA

Micro-Services

for Pre-Ingest

Page 37: LoCloud Micro Services and the Digitisation Workflow

2015-11-08 Walter Koch (AIT) 37

High-Level Workflow

• OAIS activities providea sound basis

• Archivematica Use Cases

• Merritt specifications• Micro-Services are the

building blocks

Producer

OAIS Activity

Archivist

Micro-Services

Page 38: LoCloud Micro Services and the Digitisation Workflow

2015-11-08 Walter Koch (AIT) 38

CDLCuration Micro Services

1) „An Emergent Micro-Services Approach to Digital Curation Infrastructure“, IPRES 2009https://escholarship.org/uc/item/5313h6k9

2) UC Curation Center / California Digital Library, merritt: An Emergent Micro-services Approach to Digital Curation Infrastructure; Rev. 0.6, 2010-03-25

1) Annotation service 2)

The Annotation service provides a mechanism by which the consumers of managed digital content can enrich that content through additional description, association, or related content. The service supports the following methods:

• Annotate

• Submit.

Page 39: LoCloud Micro Services and the Digitisation Workflow

2015-11-08 Walter Koch (AIT) 39

Contact

Walter Koch, [email protected]

http://www.ait.co.at