visualising statistical linked data with plone
TRANSCRIPT
Visualizing Statistical Linked Data with Plone
5 stars open data
* on the web, open license
** structured
*** non-proprietary open format
**** uses URIs to denote things
***** provides context through links to other data
http://5stardata.info
Linked data
dereferenceable HTTP URIs
content negotiation http://dbpedia.org/resource/Bucharest
http://dbpedia.org/data/Bucharest.jsonld (application/ld+json)
http://dbpedia.org/page/Bucharest (text/html)
Open Data Portals
Linked data producers
by research projects and web enthusiasts
conversion of existing data to RDF
increasingly adopted by the primary data producers
Open Data Portals
most of them based on CKAN (by OKFN)
open-data.europa.eu
catalog.data.gov
data.gov.ccTLD
datahub.io
http://lod-cloud.net
Statistical data
structurally different from other [linked] data
concerned with measures, indicators, series, time periods, statistical/geographical regions
multiple dimensions and measures
generic - place, time
or domain-specific
slices, agregations, totals, denominators
typically distributed as datasets
various RDF vocabularies for modeling datasets
XML-based standards
SDMX
Statistical Data and Metadata eXchange
probably the most widely used standard for statistical data exchange
adopted by major producers of statistical data (ECB, Eurostat, IMF, OECD, UNSD, UNESCO, World Bank)
http://ec.europa.eu/eurostat/web/sdmx-web-services
DSPL
Dataset Publishing Language
can be processed by Google Public Data Explorer
RDF vocabularies
Dublin Core Terms DCAT
http://www.w3.org/TR/vocab-dcat catalogues, dataset metadata, distribution
VoID http://www.w3.org/TR/void linked datasets
RDF Data Cube Vocabulary http://www.w3.org/TR/vocab-data-cube built on top of existing vocabularies focused on statistical data integrates dataset metadata, structure, codelists and
observations uses the core SDMX Information Model
Semantic interoperability
SPARQL - native query language of RDF knowledge bases
http://worldbank.270a.info/sparql
select * where
{
?s ?p <http://dbpedia.org/resource/Temperature>
}
<http://worldbank.270a.info/classification/variable/tas>
<http://www.w3.org/2004/02/skos/core#exactMatch>
http://worldbank.270a.info/classification/variable/tas
http://worldbank.270a.info/classification/variable
world-bank-climates/month-average-historical.html
SPARQL
PREFIX qb: <http://purl.org/linked-data/cube#>
PREFIX sdmx-dimension: <http://purl.org/linked-data/sdmx/2009/dimension#>
PREFIX owl: <http://www.w3.org/2002/07/owl#>
PREFIX skos: <http://www.w3.org/2004/02/skos/core#>
PREFIX property: <http://worldbank.270a.info/property/>
PREFIX d-climates: http://worldbank.270a.info/dataset/world-bank-climates/
select ?value where {
?o a qb:Observation;
qb:dataSet d-climates:month-average-historical;
sdmx-dimension:refArea [owl:sameAs <http://dbpedia.org/resource/Romania>];
property:variable [skos:exactMatch <http://dbpedia.org/resource/Temperature>];
property:recurring-interval <http://reference.data.gov.uk/def/intervals/October>;
property:month-average ?value
}
"10.191671" ^^<http://www.w3.org/2001/XMLSchema#decimal>
Observation page
http://digital-agenda-data.eu
European Commission
Directorate General for Communications Networks, Content & Technology
Digital Agenda, Europe 2020 strategy
https://ec.europa.eu/digital-agenda/
supports emerging technologies
Technology stack
Plone 4.3 https://github.com/eaudeweb/scoreboard.buildout
Content types dataset
visualisation
Chart configurator (Backbone.js)
SPARQL query generator uses eea.sparql and sparql-client
visualization library: Highcharts.js
triple store: Virtuoso Open Source
User requirements
target statistical data / Data Cube Vocabulary
non-technical audience
simple and intuitive navigation
lots of explanatory notes, labels and metadata
100+ indicators inside a single large dataset
no SPARQL
embedding, export, share, bookmark, etc.
moderated comments
good looking charts
Basic CMS features
content types (dataset and chart)
publish, comment and moderation
theme
custom pages
Dataset metadata
Dataset metadata
Dataset structure
Sample column chart
Visualisations
Chart configurator
Chart configurator
type of chart
single/multiple selection for dimensions
series
layout of filters
order and grouping in each filter
sorting of values
chart titles, tooltips, explanatory texts
legend and metadata from code lists
and more...
Chart configurator
Hierarchical code list
Sample column chart
Sample line charts
Additional explanations and user interaction
Navigation widget
http://digital-agenda-data.eu
Digital Agenda Key Indicators
137 indicators, 518k observations, 4.2 M triples
Digital Economy and Society Index
53 indicators, 6k observations, 51k triples)
Lead Indicators for DG Connect policy priorities
31 indicators, 35k observations, 280k triples)
Digital Economy and Society Index
Other resources
Use Cases and Lessons for the Data Cube Vocabulary
http://www.w3.org/TR/vocab-data-cube-use-cases
Technical information
http://digital-agenda-data.eu/documentation
Try it
test.digital-agenda-data.eu
https://github.com/eaudeweb/scoreboard.buildout
https://github.com/tenforce/vagrant-digital-agenda-scoreboard