visualising statistical linked data with plone

Post on 12-Apr-2017

287 Views

Category:

Technology

1 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Visualizing Statistical Linked Data with Plone

andrei.melis@eaudeweb.ro

5 stars open data

* on the web, open license

** structured

*** non-proprietary open format

**** uses URIs to denote things

***** provides context through links to other data

http://5stardata.info

Linked data

dereferenceable HTTP URIs

content negotiation http://dbpedia.org/resource/Bucharest

http://dbpedia.org/data/Bucharest.jsonld (application/ld+json)

http://dbpedia.org/page/Bucharest (text/html)

Open Data Portals

Linked data producers

by research projects and web enthusiasts

conversion of existing data to RDF

increasingly adopted by the primary data producers

Open Data Portals

most of them based on CKAN (by OKFN)

open-data.europa.eu

catalog.data.gov

data.gov.ccTLD

datahub.io

http://lod-cloud.net

Statistical data

structurally different from other [linked] data

concerned with measures, indicators, series, time periods, statistical/geographical regions

multiple dimensions and measures

generic - place, time

or domain-specific

slices, agregations, totals, denominators

typically distributed as datasets

various RDF vocabularies for modeling datasets

XML-based standards

SDMX

Statistical Data and Metadata eXchange

probably the most widely used standard for statistical data exchange

adopted by major producers of statistical data (ECB, Eurostat, IMF, OECD, UNSD, UNESCO, World Bank)

http://ec.europa.eu/eurostat/web/sdmx-web-services

DSPL

Dataset Publishing Language

can be processed by Google Public Data Explorer

RDF vocabularies

Dublin Core Terms DCAT

http://www.w3.org/TR/vocab-dcat catalogues, dataset metadata, distribution

VoID http://www.w3.org/TR/void linked datasets

RDF Data Cube Vocabulary http://www.w3.org/TR/vocab-data-cube built on top of existing vocabularies focused on statistical data integrates dataset metadata, structure, codelists and

observations uses the core SDMX Information Model

Semantic interoperability

SPARQL - native query language of RDF knowledge bases

http://worldbank.270a.info/sparql

select * where

{

?s ?p <http://dbpedia.org/resource/Temperature>

}

<http://worldbank.270a.info/classification/variable/tas>

<http://www.w3.org/2004/02/skos/core#exactMatch>

http://worldbank.270a.info/classification/variable/tas

http://worldbank.270a.info/classification/variable

world-bank-climates/month-average-historical.html

SPARQL

PREFIX qb: <http://purl.org/linked-data/cube#>

PREFIX sdmx-dimension: <http://purl.org/linked-data/sdmx/2009/dimension#>

PREFIX owl: <http://www.w3.org/2002/07/owl#>

PREFIX skos: <http://www.w3.org/2004/02/skos/core#>

PREFIX property: <http://worldbank.270a.info/property/>

PREFIX d-climates: http://worldbank.270a.info/dataset/world-bank-climates/

select ?value where {

?o a qb:Observation;

qb:dataSet d-climates:month-average-historical;

sdmx-dimension:refArea [owl:sameAs <http://dbpedia.org/resource/Romania>];

property:variable [skos:exactMatch <http://dbpedia.org/resource/Temperature>];

property:recurring-interval <http://reference.data.gov.uk/def/intervals/October>;

property:month-average ?value

}

"10.191671" ^^<http://www.w3.org/2001/XMLSchema#decimal>

Observation page

http://digital-agenda-data.eu

European Commission

Directorate General for Communications Networks, Content & Technology

Digital Agenda, Europe 2020 strategy

https://ec.europa.eu/digital-agenda/

supports emerging technologies

Technology stack

Plone 4.3 https://github.com/eaudeweb/scoreboard.buildout

Content types dataset

visualisation

Chart configurator (Backbone.js)

SPARQL query generator uses eea.sparql and sparql-client

visualization library: Highcharts.js

triple store: Virtuoso Open Source

User requirements

target statistical data / Data Cube Vocabulary

non-technical audience

simple and intuitive navigation

lots of explanatory notes, labels and metadata

100+ indicators inside a single large dataset

no SPARQL

embedding, export, share, bookmark, etc.

moderated comments

good looking charts

Basic CMS features

content types (dataset and chart)

publish, comment and moderation

theme

custom pages

Dataset metadata

Dataset metadata

Dataset structure

Sample column chart

Visualisations

Chart configurator

Chart configurator

type of chart

single/multiple selection for dimensions

series

layout of filters

order and grouping in each filter

sorting of values

chart titles, tooltips, explanatory texts

legend and metadata from code lists

and more...

Chart configurator

Hierarchical code list

Sample column chart

Sample line charts

Additional explanations and user interaction

Navigation widget

http://digital-agenda-data.eu

Digital Agenda Key Indicators

137 indicators, 518k observations, 4.2 M triples

Digital Economy and Society Index

53 indicators, 6k observations, 51k triples)

Lead Indicators for DG Connect policy priorities

31 indicators, 35k observations, 280k triples)

Digital Economy and Society Index

Other resources

Use Cases and Lessons for the Data Cube Vocabulary

http://www.w3.org/TR/vocab-data-cube-use-cases

Technical information

http://digital-agenda-data.eu/documentation

Try it

test.digital-agenda-data.eu

https://github.com/eaudeweb/scoreboard.buildout

https://github.com/tenforce/vagrant-digital-agenda-scoreboard

top related