michali s vafopoulos ntua &
DESCRIPTION
Linked Data in a nutshell . Michali s Vafopoulos NTUA & www.publicspending.net www.vafopoulos.org. summer school NCSR, IRSS -2013 . Welcome to the data era. Data: Open, big, linked. Open: access …everyone to use and republish as she wishes Big: scale - PowerPoint PPT PresentationTRANSCRIPT
![Page 1: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/1.jpg)
Michalis Vafopoulos NTUA & www.publicspending.net
www.vafopoulos.org
Linked Data in a nutshell
summer school NCSR, IRSS-2013
![Page 2: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/2.jpg)
Welcome to the data era
![Page 3: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/3.jpg)
Data: Open, big, linked
Open: access…everyone to use and republish as she wishes
Big: scalehigh volume, velocity and variety
Linked: usePublish once, use as many times
![Page 4: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/4.jpg)
Is it working?• Current
Employee Names, Salaries, and Position Titles
• The Open Database Of The Corporate World
• Crime map• NHS efficiency savings: the role of p
rescribing analytics• where public money goes worldwide
![Page 5: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/5.jpg)
How is it working?
Linked data in a nutshell
Sources: T. Heath, J. Sequeda, the Web
![Page 6: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/6.jpg)
The Web of Documents• Analogy: a global file system• Designed for: human consumption• Primary objects: documents• Links between: documents (or sub-parts
of)• Degree of structure in objects: fairly low• Semantics of content and links: implicit-
humans(Tom Heath)
The web = the internet + links + documents
![Page 7: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/7.jpg)
The Web of Documents• Simple, big and unstructured• Organized in Silos
But humans are interested in:• Things, no documents and• these Things might be in documents or elsewhere• Humans: Limited capacity to extract meaning...
![Page 8: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/8.jpg)
Limited SEARCH capacitySearch for: Football Players who went to the University of Texas at Austin, played for the Dallas Cowboys as Cornerback
(Juan F. Sequeda)
8
![Page 9: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/9.jpg)
Google, Bing, yahoo! irrelevant
9
![Page 10: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/10.jpg)
Wikipedia through LD: relevant
10
![Page 11: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/11.jpg)
The Web of Data• Analogy: a global filesystem ----> global
database• Designed for:human consumption ->machines first-
humans later• Primary objects: documents --> things (or
descriptions of things)
• Links between: documents --> things • Degree of structure in objects: fairly low --->
high• Semantics of content and links: implicit -->
explicit(Tom Heath)11
![Page 12: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/12.jpg)
The Modigliani Test• Show me all the locations of all
the original paintings of Modigliani
• Daniel Koller (@dakoller) showed that you can find this with a SPARQL query on DBpedia
Thanks Richard MacManus - ReadWriteWeb
![Page 13: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/13.jpg)
![Page 14: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/14.jpg)
Results of the Modigliani Test
• Atanas Kiryakov from Ontotext• Used LDSR – Linked Data Semantic
Repository– Dbpedia– Freebase– Geonames– UMBEL– Wordnet
Published April 26, 2010: http://www.readwriteweb.com/archives/the_modigliani_test_for_linked_data.php
![Page 15: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/15.jpg)
![Page 16: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/16.jpg)
The Web of Data: why?
16
– encourages reuse– reduces redundancy– maximises its (real and potential) inter-connectedness– enables network effects to add value to data
![Page 17: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/17.jpg)
The Web of Data: how?
17
– current state on the Web• Relational Databases• APIs• XML• CSV• XLS
Computers can’t consume data because:• Different formats & models• Not inter-connected
![Page 18: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/18.jpg)
The Web of Data: how?
18
– we need to create a standard way of publishing Data on the Web (like HTML for docs)
This is the Resource Description Framework
(RDF)
![Page 19: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/19.jpg)
Resource Description Framework (RDF)
• A data model – A way to model data– Inspired form Relational databases and
Logic• RDF is a triple data model• Labeled Graph (semantic networks)• Subject, Predicate, Object<Chios> <is part of> <Greece>
![Page 20: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/20.jpg)
Example: Document on the Web
![Page 21: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/21.jpg)
Databases back up documents
Isbn Title Author PublisherID
ReleasedData
978-0-596-15381-6
Programming the Semantic Web
Toby Segaran
1 July 2009
… … … … …PublisherID PublisherNa
me1 O’Reilly
Media… …
This is a THING:A book title “Programming the Semantic Web” by Toby Segaran, …
THINGS have PROPERTIES:A Book as a Title, an author, …
![Page 22: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/22.jpg)
Data representation in RDF
book
Programming the Semantic
Web
978-0-596-15381-6
Toby Segaran
Publisher O’Reilly
title
name
author
publisher
isbn
Isbn Title Author PublisherID
ReleasedData
978-0-596-15381-6
Programming the Semantic Web
Toby Segaran
1 July 2009
PublisherID
PublisherName
1 O’Reilly Media
![Page 23: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/23.jpg)
Everything on the web is identified by a URI!
![Page 24: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/24.jpg)
link the data to other data
http://…/
isbn978
Programming the Semantic
Web
978-0-596-15381-6
Toby Segaran
http://…/
publisher1
O’Reilly
title
name
author
publisher
isbn
![Page 25: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/25.jpg)
consider the data from Revyu.comhttp://
…/isbn978
http://…/
review1
Awesome Book
http://…/
reviewerJuan
Sequeda
hasReview
reviewerdescription
name
![Page 26: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/26.jpg)
start to link data
http://isbn978
Programming the Semantic
Web
978-0-596-15381-6
Toby Segaran
http://publisher
1O’Reilly
title
name
author
publisher
isbn
http://isbn978
sameAs
http://review1
Awesome Book
http://reviewe
rJuan
Sequeda
hasReview
hasReviewerdescription
name
![Page 27: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/27.jpg)
Juan Sequeda publishes data too
http://juansequeda.com/id
livesInJuan Sequedaname
http://dbpedia.org/Austin
![Page 28: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/28.jpg)
Let’s link more datahttp://
…/isbn978
http://…/
review1
Awesome Book
http://…/
reviewer
Juan Sequed
ahttp://
juansequeda.com/id
hasReview
hasReviewerdescription
name
sameAs
livesInJuan Sequedaname
http://dbpedia.org/Austin
![Page 29: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/29.jpg)
And more
http://…/
isbn978
Programming the Semantic Web
978-0-596-15381-6
Toby Segaran
http://…/publisher
1 O’Reilly
title
name
author
publisher
isbn
http://…/
isbn978
sameAs
http://…/
review1
Awesome Book
http://…/
reviewer
Juan Sequeda
http://juansequeda
.com/id
hasReview
hasReviewerdescription
name
sameAs
livesIn
Juan Sequedanamehttp://dbpedia.org/Austin
![Page 30: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/30.jpg)
Linked data = internet + http +
RDF
![Page 31: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/31.jpg)
Linked Data Principles1. Use URIs as names for things2. Use URIs so that people can
look up (dereference) those names.
3. When someone looks up a URI, provide useful information.
4. Include links to other URIs so that they can discover more things.
![Page 32: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/32.jpg)
Web as a database• Linked Data makes the web
exploitable as ONE GIANT HUGE GLOBAL DATABASE!
• Is there any query language like sql?SPARQL…
![Page 33: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/33.jpg)
The LOD cloud: May 2007
![Page 34: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/34.jpg)
Mar 2008
![Page 35: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/35.jpg)
Sept 2008
![Page 36: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/36.jpg)
Mar 2009
![Page 37: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/37.jpg)
![Page 38: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/38.jpg)
Fujitsu and DERI Revolutionize Access to Open Data by Jointly Developing Technology for Linked Open Data
![Page 39: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/39.jpg)
What is a Linked Data application/service?
Software system that makes use of data on the Web from multiple datasets and that
benefits from links between the datasets
![Page 40: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/40.jpg)
Characteristics of Linked Data Applications
• Consume data that is published on the web following the Linked Data principles: an application should be able to request, retrieve and process the accessed data
• Discover further information by following the links between different data sources
• Combine the consumed linked data with data from sources (not necessarily Linked Data)
• Expose the combined data back to the web following the Linked Data principles
• Offer value to end-users
![Page 41: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/41.jpg)
the 5 stars of open linked data
★make your stuff available on the Web (whatever format)★★make it available as structured data (e.g. excel instead of image scan of a table)★★★non-proprietary format (e.g. csv instead of excel)★★★★use URLs to identify things, so that people can point at your stuff★★★★★link your data to other people’s data to provide context
http://lab.linkeddata.deri.ie/2010/star-scheme-by-example/
![Page 42: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/42.jpg)
Ideas for projects1. Think of interesting questions 2. Search for related datasets
And start “playing” with:• Interconnections – links to other
datasets • Statistical analysis• Economic/business analysis• Public policy analysis
![Page 43: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/43.jpg)
43
• Where public money goes in a specific sector?
• Environment, education?• To which companies?
Interesting questions
![Page 44: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/44.jpg)
Questions??
![Page 45: Michali s Vafopoulos NTUA &](https://reader036.vdocuments.mx/reader036/viewer/2022062501/56816512550346895dd78cd1/html5/thumbnails/45.jpg)
More info• Twitter: @vafopoulos• [email protected]• www.Vafopoulos.org • www.publicspending.net • www.Youtube.com/websciencegr