Download - Linked Data: opportunities and challenges
Copyright 2011 Digital Enterprise Research Institute. All rights reserved.
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
Linked Data: opportunities and
challengesDr. Michael Hausenblas, DERI, NUI Galway
Open Science Data Cloud NSF PIRE Workshop, Edinburgh, UK, 18 July 2012
Digital Enterprise Research Institute www.deri.ie
Linked Data 101
Digital Enterprise Research Institute www.deri.ie
Linked Data principles
① Use URIs to identify the “things” in your data
② Use HTTP URIs so people & machines can look them up
③ When a URI is looked up return a description of the thing in a structured format (RDF)
④ Link to related things to provide context
http://www.w3.org/DesignIssues/LinkedData.html
Digital Enterprise Research Institute www.deri.ie
HTTP URIs
http://sw-app.org/mic.xhtml#i
Digital Enterprise Research Institute www.deri.ie
HTTP URIs
http://dbpedia.org/resource/Edinburgh
curl -L -H "Accept: application/rdf+xml" http://dbpedia.org/resource/Edinburgh
<?xml version="1.0" encoding="utf-8" ?><rdf:RDF
xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#"xmlns:owl="http://www.w3.org/2002/07/owl#"xmlns:foaf="http://xmlns.com/foaf/0.1/"xmlns:geo="http://www.w3.org/2003/01/geo/wgs84_pos#"xmlns:dbpedia-owl="http://dbpedia.org/ontology/"xmlns:dcterms="http://purl.org/dc/terms/"xmlns:dbpprop="http://dbpedia.org/property/"
xmlns:ns10="http://dbpedia.org/property/start/" > <rdf:Description rdf:about="http://dbpedia.org/resource/Firrhill_High_School"> <dbpedia-owl:city rdf:resource="http://dbpedia.org/resource/Edinburgh" /> </rdf:Description> <rdf:Description rdf:about="http://dbpedia.org/resource/Murrayfield_Stadium"> <dbpedia-owl:location rdf:resource="http://dbpedia.org/resource/Edinburgh" /> <dbpprop:location rdf:resource="http://dbpedia.org/resource/Edinburgh" /> </rdf:Description> <rdf:Description rdf:about="http://dbpedia.org/resource/Stewart%27s_Melville_College"> <dbpedia-owl:city rdf:resource="http://dbpedia.org/resource/Edinburgh" /> <dbpprop:city rdf:resource="http://dbpedia.org/resource/Edinburgh" /> </rdf:Description>
Digital Enterprise Research Institute www.deri.ie
HTTP URIs
http://data.ordnancesurvey.co.uk/id/7000000000017765
curl -L -H "Accept: text/turtle" http://data.ordnancesurvey.co.uk/id/7000000000017765
<http://data.ordnancesurvey.co.uk/doc/7000000000017765> rdf:type foaf:Document, dctype:Text ; foaf:primaryTopic <http://data.ordnancesurvey.co.uk/id/7000000000017765> ; dct:title "Linked Data for The County of Hampshire" ; dct:hasFormat <http://data.ordnancesurvey.co.uk/doc/7000000000017765.rdf> , <http://data.ordnancesurvey.co.uk/doc/7000000000017765.html> , <http://data.ordnancesurvey.co.uk/doc/7000000000017765.json> , <http://data.ordnancesurvey.co.uk/doc/7000000000017765.ttl> .
<http://data.ordnancesurvey.co.uk/id/7000000000017636> rdfs:label "Tadley" ; skos:prefLabel "Tadley" .
<http://data.ordnancesurvey.co.uk/id/7000000000017510> rdfs:label "Newton Valence" ; skos:prefLabel "Newton Valence" .
<http://data.ordnancesurvey.co.uk/id/7000000000017817> rdfs:label "Ashmansworth" ; skos:prefLabel "Ashmansworth" .
Digital Enterprise Research Institute www.deri.ie
HTTP URIs
http://bio2rdf.org/genbank:AC008393
curl -L -H "Accept: text/turtle” http://bio2rdf.org/genbank:AC008393
@prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> .@prefix owl: <http://www.w3.org/2002/07/owl#> .@prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .@prefix dc: http://purl.org/dc/elements/1.1/ .
<http://bio2rdf.org/genbank:AC008393> a <http://bio2rdf.org/genbank_resource:Sequence> ;rdfs:label "Homo sapiens chromosome 5 clone CTC-241N9, complete sequence
[genbank:AC008393]" ;owl:sameAs <http://bio2rdf.org/genbank:ac008393> ;dc:title "Homo sapiens chromosome 5 clone CTC-241N9, complete sequence" ;dc:modified "26-FEB-2002" ;<http://bio2rdf.org/bio2rdf_resource:length> "166847" ;<http://bio2rdf.org/bio2rdf_resource:linkedToFrom>
<http://bio2rdf.org/iproclass:A8MPR0> , <http://bio2rdf.org/iproclass:B5MCC3> .
Digital Enterprise Research Institute www.deri.ie
HTTP URIs
http://bnb.data.bl.uk/doc/resource/009468944
curl -L -H "Accept: text/turtle” http://bnb.data.bl.uk/doc/resource/009468944
@prefix rdf: <http://www.w3.org/1999/02/22-rdf-syntax-ns#> .@prefix rdfs: <http://www.w3.org/2000/01/rdf-schema#> .@prefix dct: <http://purl.org/dc/terms/> .@prefix blterms: <http://www.bl.uk/schemas/bibliographic/blterms#> .@prefix elements: <http://iflastandards.info/ns/isbd/elements/> .@prefix bibo: <http://purl.org/ontology/bibo/> .@prefix owl: <http://www.w3.org/2002/07/owl#> .@prefix foaf: <http://xmlns.com/foaf/0.1/> .@prefix linked-data: <http://purl.org/linked-data/api/vocab#> .@prefix void: <http://rdfs.org/ns/void#> .
<http://bnb.data.bl.uk/id/resource/009468944> dct:language <http://lexvo.org/id/iso639-3/eng> ; rdfs:seeAlso <http://www4.wiwiss.fu-berlin.de/bookmashup/books/0859761541> ; elements:P1053 "vii,147p."@en ; rdfs:label "William Wallace / Andrew Fisher" ; blterms:bnb "GB8714157" ; dct:creator <http://bnb.data.bl.uk/id/person/FisherAndrew1935-> ; bibo:isbn10 "0859761541" ; dct:title "William Wallace" ; rdf:type bibo:Book , dct:BibliographicResource ; dct:subject <http://bnb.data.bl.uk/id/concept/ddc/e19/941.1020924> .
Digital Enterprise Research Institute www.deri.ie
Linked Open Data
Digital Enterprise Research Institute www.deri.ie
20082007
20082008
20082009
20092010
10
Linked Open Data cloud
Digital Enterprise Research Institute www.deri.ie
Over 300 open data sets with 40 billion facts, interlinked by 500 million typed links.
http://lod-cloud.net/
Linked Open Data cloud
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
Linked Open Data cloud stats
triples distribution
links distribution
http://lod-cloud.net/state/
Digital Enterprise Research Institute www.deri.ie
… cost and benefits
Digital Enterprise Research Institute www.deri.ie
Linked Data life cycles
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
Linked Data life cycles
LOD cloud
5stardata.info
Neologism
Schema.org
Google Refine
D2RQ
FYN
LATC DSI
LATC 24/7 data-gov.ie
http://linked-data-life-cycles.info
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
Modeling
LOD cloud
5stardata.info
Neologism
Schema.org
Google Refine
D2RQ
FYN
LATC DSI
LATC 24/7 data-gov.ie
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
Neologism
http://neologism.deri.ie/
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
Neologism
http://vocab.data.gov/
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
Schema.org – Linked Data
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
Publishing
LOD cloud
5stardata.info
Neologism
Schema.org
Google Refine
D2RQ
FYN
LATC DSI
LATC 24/7 data-gov.ie
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
Google Refine extension
http://lab.linkeddata.deri.ie/2010/grefine-rdf-extension/
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
RDB2RDF – D2RQ
http://d2rq.org/
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
Discovery
LOD cloud
5stardata.info
Neologism
Schema.org
Google Refine
D2RQ
FYN
LATC DSI
LATC 24/7 data-gov.ie
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
Follow-Your-Nose
24
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
http://dsi.lod-cloud.net/
Dataset discovery
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
Integration
LOD cloud
5stardata.info
Neologism
Schema.org
Google Refine
D2RQ
FYN
LATC DSI
LATC 24/7 data-gov.ie
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
Why linking?
Central Contractor Registration (CCR)
Geonames
http://webofdata.wordpress.com/2011/05/22/why-we-link/
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
Effort distribution
Third Party Effort
Consumer‘s Effort
Publisher‘s Effort
Fix Overall Data Integration
Effort
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
LATC – Interlinking Platform
http://latc-project.eu/platform
Digital Enterprise Research Institute www.deri.ie
http://www4.wiwiss.fu-berlin.de/latc/toollibrary/screencast.html
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
Conclusion
Use the LOD cloud as test-bed (experiments) Benefit from LOD cloud in apps (context) Contribute to make your data more valuable
Op
po
rtu
nit
ies
Large-scale processing of Linked Data Distributed/federated SPARQL queries Quality of links and the data C
hal
len
ges
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
Resources
Tutorials, technologies, specifications: http://linkeddatabook.com http://lod-cloud.net http://linkeddata.org http://linkeddata-specs.info http://schema.rdfs.org
Videos: http://ted.com/talks/tim_berners_lee_on_the_next_web.html - Tim Berners-Lee’s TED
talk http://www.youtube.com/watch?v=GKfJ5onP5SQ - Linked Data (and the Web of Data) http://www.youtube.com/watch?v=4x_xzT5eF5Q - What is Linked Data? http://vimeo.com/36752317 - Linked Open Data (by Europeana)