linked open data for the cultural sector

27
Cultural Linked Open Data 2014-02-06 Lars Marius Garshol, [email protected], http://twitter.com/larsga 1

Upload: lars-marius-garshol

Post on 07-May-2015

908 views

Category:

Technology


2 download

DESCRIPTION

About how

TRANSCRIPT

Page 1: Linked Open Data for the Cultural Sector

1

Cultural Linked Open Data2014-02-06Lars Marius Garshol, [email protected], http://twitter.com/larsga

Page 2: Linked Open Data for the Cultural Sector

2

The importance of data

• Most web sites are data-driven– if you have the data, you can add

functionality– if you don’t have the data, you’re stuck

• Example: Google Maps– imagine you have the application, the

server farm, the scaling and monitoring, etc

– but you don’t have the actual map data– not only are you stuck, but creating the

data is much harder than making the service

Page 3: Linked Open Data for the Cultural Sector

3

Page 4: Linked Open Data for the Cultural Sector

4 Research project by SINTEF and Computas

Page 5: Linked Open Data for the Cultural Sector

5

Data sources

Research project by SINTEF and Computas

Page 6: Linked Open Data for the Cultural Sector

6

Must be at meeting at 1345. Three transport alternatives.

Research project by SINTEF and Computas

Page 7: Linked Open Data for the Cultural Sector

7

Data is raw material for

building services!

Page 8: Linked Open Data for the Cultural Sector

8

Possible users of cultural data• Any kind of web store

– publishers– streaming services– ...

• Travel businesses– public sector, hotels, tour organizers, event

organizers, ...• Media

– newspapers, broadcasting, ...• Lots of public sector uses

– education, ...• Many things none of us can’t imagine

now

Page 9: Linked Open Data for the Cultural Sector

9

Page 10: Linked Open Data for the Cultural Sector

10

Only linked data is usableNRK/Skole

Cappelen Damm

Page 11: Linked Open Data for the Cultural Sector

11

Linked Open Data

• Movement to publish open data online– in machine-readable form– linked to other data sets

• Based on some key technologies– URLs for identifiers– RDF for data

• Gaining a lot of traction in the cultural sector– BBC– Europeana– Smithsonian Institution– ...

Page 12: Linked Open Data for the Cultural Sector

12

The technology

• Provides simple data representation– graph model (RDF)– has ready-made formats (XML, text, JSON, ...)– standard query language (SPARQL)– lots of RDF databases available

• Allows anyone to refer to anything– a museum can say explicitly that one object

in their collection has a specific relation to an object in another collection

– liberation from the ID scheme confusion• Can reuse terminology from other

authorities– can also easily extend that terminology

Page 13: Linked Open Data for the Cultural Sector

13 http://lod-cloud.net/

Page 14: Linked Open Data for the Cultural Sector

14

Page 15: Linked Open Data for the Cultural Sector

http://dbpedia.org/resource/Knut_Faldbakken

• Globally unique– across all systems and organizations

• Distributed– if you have a domain, you can make

URIs

• Self-documenting– just follow the link to find documentation

• Can be used anywhere– anyone can point at anything

Page 16: Linked Open Data for the Cultural Sector

16

Today

• Flat, unlinked data• No navigation• No connections• Poor characterization– doesn’t say what it is

Page 17: Linked Open Data for the Cultural Sector

17

As linked data

nv:Photographrdf:type

“Bergliot Ibsen”dc:title

1903dc:date

edm:ProvidedCHO

dc:subject

“Bergliot Ibsen”

rdfs:label

1953-02-02dbp:died

1869-06-10

dbp:born

foaf:Person

http://dbpedia.org/resource/Bergliot_Ibsen

Europeana Data Model

nv:provider

“Aulestad”

61.2173 10.265952http://dbpedia.org/resource/Aulestad

rdfs:label

grs:point

Page 18: Linked Open Data for the Cultural Sector

Choice of tools

Triple stores

APIsRedland RDF Libraries

Reasoners

pellet

Modelling

Page 19: Linked Open Data for the Cultural Sector

19

Great, but how can we actuallylink the data?

Page 20: Linked Open Data for the Cultural Sector

20

Page 21: Linked Open Data for the Cultural Sector

21

“Do they have Knut Faldbakken in here?”

http://data.deichman.no/sparql

Page 22: Linked Open Data for the Cultural Sector

22

Yes, but not connected to anything ...

...can we do anything about that?

Page 23: Linked Open Data for the Cultural Sector

23

Record linkage to the rescue

• Active research field– dating back to the 1940s

• Can connect data without common IDs

– measure similarity instead• Tools exist, with

– value cleaning– statistical analysis– sophisticated comparators– fast search backends

• One example is Duke– http://code.google.com/p/duke/– Java and open source

Page 24: Linked Open Data for the Cultural Sector

24

Connect to DBpedia

http://dbpedia.org/resource/Knut_Faldbakken

NAME: Knut FaldbakkenBIRTHDATE: 1941-08-31

http://data.deichman.no/...dbakken_Knut_1941-

NAME: Faldbakken, KnutLIFESPAN: 1941-NATIONALITY: n

http://code.google.com/p/duke/wiki/DeichmanLink

Complete recipe here

Page 25: Linked Open Data for the Cultural Sector

25

Training with genetic algorithm

http://www.garshol.priv.no/blog/262.html

Page 26: Linked Open Data for the Cultural Sector

26

Conclusion

• Linked Open Data has tremendous potential– vastly easier reuse of data– hugely empowering for consumers– also opens new possibilities for data

owners• Growing use in cultural sector– both internationally and in Norway

• To learn more– http://www.slideshare.net/larsga/linked-

open-data-14964163– http://data.norge.no/veiledning– http://linkeddatabook.com/editions/1.0/

Page 27: Linked Open Data for the Cultural Sector

27

Hafslund SESAM