towards 5-star data in the e-university
DESCRIPTION
Several information of interest regarding data openness in the context of the (semantic) Web. Different world-wide and local examples are also included.TRANSCRIPT
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
Dr. Sabin BuragaFaculty of Computer Science, “A. I. Cuza” of Iasi, Romania
www.purl.org/net/busaco @busaco4web
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
open participation
open data
open software
open app development
open web
open cloud
open (computing) hardware
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
World Wide Web = “a common information space
in which we communicate by sharing information”
Tim Berners-Lee (2013)
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
Client Web application Storage
(user interface) server/framework (data persistence)
Internet
(Web)
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
Client Web application Storage
(user interface) server/framework (data persistence)
Internet
(Web)
URL – Uniform Resource Identifier
addressability
for example: http://www.slideshare.net/busaco/presentations/
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
Client Web application Storage
(user interface) server/framework (data persistence)
Internet
(Web)
HTTP – HyperText Transfer Protocol
access to resources
a browser asks a Web server to provide a resource representation
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
Client Web application Storage
(user interface) server/framework (data persistence)
Internet
(Web)
HTML (HyperText Markup Language), JSON, PDF, PNG,…
representation(s) of a resource
a Web page includes URLs to other resourceshypermedia
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
Reusing & sharing data available on the Web
data access via a Web service
usually, by using an API
(Application Programming Interface)
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
Web servicespublic APIsmash-ups
www.programmableweb.com
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
coopen data
“A piece of content or data is open
if anyone is free to use, reuse, and redistribute it.”
http://opendefinition.org/
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
> “If you have access to the data,
then you can achieve continuity
even if you don’t have access to
the underlying source of the application.
Open data is more important than open source. […]
Data persists, open data endures.”
Ian Davis, 2009
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
legal/technical openness
availability & access
reusing & sharing
universal participation
inter-operability
opendatahandbook.org
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
Reusing data available on the Web
necessity of adopting a (re)use license
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
coReusing data available on the Web
necessity of adopting a (re)use license
fair use
public domain
copyleft
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
coReusing data available on the Web
necessity of adopting a (re)use license
Creative Commons
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
openness, transparency, respect
https://creativecommons.org/
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
Data availability
on the Web
as “opaque” document
(usually, using a proprietary format)
does not refer – via current Web technologies –
other resources of interest
Tom Health (2007)
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
Data availability
in the Web
assuring discoverability via hypermedia
uses open data models/formats
(e.g., HTML, XML, JSON, CSV, RDF etc.)
platform independent
Tom Health (2007)
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
Can we evaluate the data openness?
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
5 ★ Open Data
Tim Berners-Lee (2009)
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
1-star data
the content is available on the Web – by using any
format – according to an open license
http://opendefinition.org/licenses/
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
users can view, print, locally store,
and – eventually – modify the document
the document itself can be shared on the Internet
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
a PDF containing a scanned image ☹
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
the document could be easily published on the Web
in order to reuse the data kept into the document,
additional processing might be necessary
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
2-star data
additionally, the content must be available
as structured data (e.g., relations between entities)
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
users can process the document by using, in most cases,
a proprietary software application
the document can be exported
into another (structured) format
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
a proprietary format
containing structured data ☹
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
the document can be easily published on the Web
data is still “locked” into the document +
processing is depending by a specific application
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
3-star open data
using an open (non-proprietary) format
to make data available on the Web
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
same content as HTML document ☺
<section><p>10:15 – 11:00</p><p>Towards 5-Star Data in the E-university</p><p>Presenter: Sabin Buraga</p>
</section>
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
data can be managed (viewed, processed, filtered,
converted, shared, reused, etc.) in any manner
important aspect: platform independence
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
the document is still rather simple to be published on Web
exporting data into a proprietary format
could be problematic
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
4-star open data
each “thing” (entity) of interest from the document
is denoted by a Web address – URL
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
data, information, and knowledge are identified via URLs
in order to be accessed and (re)used
RDF model (Resource Description Framework)W3C standards (1998, 2004, 2014)
www.w3.org/standards/semanticweb/
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
machine-friendly RDF assertions ☺
<div resource="#busaco" typeof="Person"><a property="url" href="http://purl.org/net/busaco">
<span property="name">Sabin Buraga</span></a>
</div>
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
content publishing could be much difficult, employing
the adoption of the semantic Web – or Web of Data –
technologies, tools, and methodologies
data in the Weblong term implications
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
5-star open data
additionally, data is inter-connected to other
datasets, according to the linked data initiative
http://linkeddata.org/
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
inter-connecting open datasets ☺
http://lod4all.net/
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
possibility to discover other (related) data of interest
while consuming the datanetwork effect
other advantage: Web-based automatic reasoning
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
difficulties:
assuring data/knowledge consistency
problems related to slow adoption
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
5stardata.info
Michael Hausenblas (2012)
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co★make your stuff available on the Web
(whatever format) under an open license
★★make it available as structured data
e.g., Excel instead of image scan of a table
★★★use non-proprietary formats
e.g., CSV (Comma Separated Values) instead of Excel
★★★★use Web addresses (URLs) to denote things,
so that people can point at your stuff
★★★★★link your data to other data – see http://datahub.io/ –
to provide context
Ed Summers (2010)
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
Several real-life examples
(in the academic context)?
Dr.
Sab
in B
ura
ga
www.purl.org/net/busa
co
augmenting the current Web search activities
Google knowledge graph – see schema.org & rdfa.info
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
freely available knowledge bases: DBpedia & Freebase
http://en.lodlive.it/
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
open e-science – see myexperiment.org
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
open e-government
access to official data according to the openness scorehttp://data.gov.uk/data/search?openness_score=5
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
copromoting open data & open software
student workshops & contests co-organized by
Faculty of Computer Science – UAIC Romania
Design Jam Iasi (3 editions), Firefox OS App Day, Firefox
OS Hackathon, Open Source Iasi, Winter Web Workshopand many others
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
open access to various student projects & initiatives
Faculty of Computer Science – UAIC Romania
http://profs.info.uaic.ro/~stefan.negru/studentprojects/
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
open access to various
educational resources
Faculty of Computer Science
UAIC Romania
http://profs.info.uaic.ro/~busaco/teach/
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co“Software – as a service or not – is just a container.
What makes software valuable has always been what
it does to data. Now, in the same spirit of SOA (Service
Oriented Architecture) and SaaS (Software As A Service),
a new concept is emerging, Data-as-a-Service – DaaS.”
Pete Soderling (2010)
Dr. S
abin
-Cor
nel
iuBura
ga–
ww
w.p
url.o
rg/n
et/b
usa
co
Dr. Sabin BuragaFaculty of Computer Science, “A. I. Cuza” of Iasi, Romania
www.purl.org/net/busaco @busaco4web