lod2 review meeting

Download Lod2 review meeting

If you can't read please download the document

Upload: andreea-bonea

Post on 16-Apr-2017

6.058 views

Category:

Technology


0 download

TRANSCRIPT

Introducing a New Product

Creating Knowledge out of Interlinked Data

LOD2 Presentation . 02.09.2010 . Page

http://lod2.eu

[email protected]
[email protected]@i2g.pl

Luxembourg, Sep 14, 2012WP9: Use Case 3: LOD2 for Citizens

Creating Knowledge out of Interlinked Data

LOD2 Event . 06.09.2010 . Page

http://lod2.eu

Year 2 Deliverables (OKFN)D 9.1.1. Report on first release of the Publicdata.eu website Improvements to Publicdata.eu during the past year

D 9.3.1. Presentation on publishing Linked Data

D 9.3.2. Guide to publishing Linked Data

D 9.4 Report on publication of eGovernment Linked Open Data

Addressing Y1 Review Comments

Next steps

D 9.2.1. Further technical improvements to Publicdata.eu (personalization features)

Community engagement with Publicdata.eu

Year 2 Deliverables (Serbian CKAN, Instytut Informatyki Gospodarczej)D 9.5.1. Establishment of the Serbian CKAN

D 9.6. Requirements and Resources used by the Polish Ministry of Economy

Next stepsD 9.7.1. Adaptation of the LOD2 stack for Polish Ministry of Economy

Agenda

Hello everyone,My name is . and I am the new PM overseeing some sections of WP9 from OKFN's side. This presentation includes the work that was spearheaded by OFKFN, as well as Serbian CKAN (9.5.) and the the Requirements and Resources for the Polish Ministry of Economy (9.6.)The slides were drafted to highlight some of the work that was done for WP9 during the course of this year. Slide 2 showcases the deliverables that are part of this WP

Creating Knowledge out of Interlinked Data

LOD2 Event . 06.09.2010 . Page

http://lod2.eu

WP9 Objectives

The purpose of this PublicData.eu use case is to increase public access to high-value, machine-readable datasets generated by the European, national as well as regional governments and public administrations.

Hello everyone,My name is . and I am the new PM overseeing some sections of WP9 from OKFN's side. This presentation includes the work that was spearheaded by OFKFN, as well as Serbian CKAN (9.5.) and the the Requirements and Resources for the Polish Ministry of Economy (9.6.)The slides were drafted to highlight some of the work that was done for WP9 during the course of this year. Slide 2 showcases the deliverables that are part of this WP

Creating Knowledge out of Interlinked Data

LOD2 Presentation . 02.09.2010 . Page

http://lod2.eu

Year 2 Deliverables (OKFN)

Creating Knowledge out of Interlinked Data

LOD2 Event . 06.09.2010 . Page

http://lod2.eu

D 9.1.1. First release of Publicdata.eu

Submitted a thorough report summarizing our work on Publicdata.eu it's existing features, previous launches and plans for future improvements: http://svn.aksw.org/lod2/D9.1.1/

Year 2 Deliverables

Creating Knowledge out of Interlinked Data

LOD2 Event . 06.09.2010 . Page

http://lod2.eu

Year 2 Deliverables

Publicdata.eu Overview

PublicData.eu is pan-European data catalogue and federation mechanism, developed by OKF as part of WP9. Based on the CKAN open-source data portal software, the site is a use case for the citizen aiming to make data as accessible and re-usable as possible. It is a read-only aggregation of both official and community data portals across the EU.






Creating Knowledge out of Interlinked Data

LOD2 Event . 06.09.2010 . Page

http://lod2.eu

6

Year 2 Deliverables

Key Stats

Publicdata.eu provides robust search, filtering and previewing tools

It currently houses 17027 data sets, harvested from 18 data catalogues, and it provides the option to browse data sets by top level categories

Creating Knowledge out of Interlinked Data

LOD2 Event . 06.09.2010 . Page

http://lod2.eu

6

Technical improvements to Publicdata.eu during the past year

In March 2012 we upgraded PublicData.eu to CKAN version 1.6, adding the data preview functionality (powered by Recline), improvements to search, interface improvements to dataset pages, newly added resource (file) pages and group pages.

We also re-ran all the harvesters to have the most up to date set of datasets. Some catalogues have been migrated to groups on thedatahub.org and therefore can't currently be harvested without also including non EU datasets. In the future we may resolve this by extending the harvester to allow us to specify which groups or tags should be harvested. This would allow us to import relevant datasets from thedatahub.org without importing non EU datasets.

Many new CKAN instances have recently been launched by various countries, which we plan to include in publicdata.eu for the intermediate launch (August 2013)

Year 2 Deliverables

Creating Knowledge out of Interlinked Data

LOD2 Event . 06.09.2010 . Page

http://lod2.eu

6

Year 2 Deliverables

Amount of RDF data in publicdata.euWe added the Serbian CKAN to Publicdata.eu bringing their RDF data

Since Publicdata.eu is currently only a read-only portal, we have focused on encouraging the source catalogs to increase their RDF dataour deliverables of 9.3.1 and 9.3.2 facilitate this (presentation & guide on publishing linked open data)

worked with consortium partners to produce more RDF data for the eGovernment report (will cover in next slides)

In future launches we will allow users to add data, meaning we can add converted datasets to be accessible through publicdata.eu

Additionally we plan to improve our harvesting, allowing us to harvest groups to increase the opportunities for what datasets can be added to publicdata.eu (including groups on thedatahub.org)

Addressing Year 1 Review meeting comments





Creating Knowledge out of Interlinked Data

LOD2 Event . 06.09.2010 . Page

http://lod2.eu

6

Year 2 Deliverables

D 9.3.1. Presentation on publishing Linked Open Data





Overview

In Feb 2012, OKFN put together a best practices presentation regarding the publishing, linking and utilizing Open Data. The presentation is easy accessible for the non technical eye and details the economic, transparency, policy, and efficiency benefits for Governments to publish open data. Other aspects such as licensing, registering and getting the data online are also included in this presentation.

This is aimed to be a detailed resource that anybody can use when referring to Linked Open Data.

Current stateThe presentation can be found here:

[http://svn.aksw.org/lod2/D9.3/D9.3.1/D9.3.1-presentation.pdf]

Creating Knowledge out of Interlinked Data

LOD2 Event . 06.09.2010 . Page

http://lod2.eu

6

Year 2 Deliverables

OverviewThe Guide [http://svn.aksw.org/lod2/D9.3/D9.3.2/] is written in clear, non-technical language and introduces the reader to the concepts, rationale and tools of Linked Open Data, as well as providing a high-level overview of the publishing process. It has a particular focus on public-sector data, and aims to arm decision-makers with an understanding of Linked Data and the steps necessary to start publishing it.

Expected Impact

The guide will be published on the OKFN website [http://lod2.okfn.org/] where it is hoped that it will become a standard reference document, helping organizations that need to make decisions about whether and how to publish Linked Open Data.

D 9.3.2. Guide to publishing Linked Open Data





Section 1 rehearses the arguments of Open Data (how Governments are moving towards making their data available freely) whereas section 2 provides a full non technicalexplanation of Linked Data (concepts such as the 5 stars of LOD are presented) . Section 3 refers to the LOD life cycle (explains high level concepts such RDF, schemas, triple stores aso). Section 4 describes a step by step way to publish LOD and describes the tools in the LOD2 Stack. Step 5 presents some of the case studies of LOD2. Most known are the EC Financial Transparency System (all grants from the EC since 2007), the Global Health Observatory data set (stats for monitoring public health), Digital Agenda Scoreboard shows progression of countries in relation to DAE, Legal Thesauri by Wolters Kluewer (commercial publisher of legal info)

Creating Knowledge out of Interlinked Data

LOD2 Event . 06.09.2010 . Page

http://lod2.eu

6

Year 2 Deliverables

D 9.4. Report on publication of eGovernment Linked Open Data





OverviewThe report [http://svn.aksw.org/lod2/D9.4/] summarizes our assessment of the current state of linked data publishing by European Governments and organizations;

Additionally it highlights some of the work that LOD2 partners have been doing to publish more linked data (Publink initiative, Guides and Documentation)

The report details some of the benefits of publishing linked open data as well as the current technical and legal barriers preventing the publishing of more linked data and our proposed approach to increasing the amount of high quality linked data published during the next phase of the LOD2 project.

The report contains two Appendices:

Appendix A - a collection of 9 use cases, showcasing the benefits of LODAppendix B - presenting theLODStats system developed for high performance statistical analysis

Several public authorities(such as: UK Government White Paper, EC commissioner Neelie Kroes) are acknowledging the benefits of LOD (Organizations meet the transparency requirements, and more meeting is provided to data sets by placing them in context with other datasets); in this respect the EC also funded the LATC project that converted approx 20 sets over the past years.

DatasetProject URLTriples

WHOs Global Health Observatoryhttp://gho.aksw.org/273k

European Digital Agenda Scoreboardhttp://data.lod2.eu/scoreboard/127k

National Accounts Linked Data for the UK and Serbiahttp://ukstatistics.lod2.eu/

http://rs.ckan.net/dataset/rzs-national-accounts645k (UK)

10 million (Serbia)

World Bank Data as Linked Datahttp://csarven.ca/statistical-linked-dataspaces165 million

German Labour Law & Courts Thesaurihttp://vocabulary.wolterskluwer.de/150k

German Federal Ministry of Financehttp://data.lod2.eu/gfmf/ 2 million

UK public data setshttp://thedatahub.org/en/dataset/uk-gdp-since-1948http://thedatahub.org/en/dataset/epims-lod2http://thedatahub.org/en/dataset/uk-criminal-justice12 million (total)

LinkedGeoDatahttp://linkedgeodata.org 20 billion

Wiktionaryhttp://wiktionary.dbpedia.org 100 million

Czech tender datahttp://ld.opendata.cz:8900/sparql 1071859

Appendix A Open Data releases

Creating Knowledge out of Interlinked Data

LOD2 Presentation . 02.09.2010 . Page

http://lod2.eu

Next Steps (OKFN)

Creating Knowledge out of Interlinked Data

LOD2 Event . 06.09.2010 . Page

http://lod2.eu

Next Steps

D 9.2.1. Further technical improvements to Publicdata.eu

Improvements scheduled for the Dec 2012 release (Further personalization features)

Datasets ratings

Allow users to add/revise their own data sets

User tools to enable mash-ups and visualization of data

App marketplace for users to upload their own visualizations, stories and apps

Allow user commenting on datasets

Activity streams and follow support (i.e. allowing users to subscribe to activity updates)

Social / sharing buttons

Creating Knowledge out of Interlinked Data

LOD2 Event . 06.09.2010 . Page

http://lod2.eu

Next Steps

D 9.2.1. Further technical improvements to Publicdata.eu

Improvements scheduled for the Aug 2013 interim release

CKAN core technology improvements (Harvesting)Optimize & automate the harvesting process

Add further harvesters (to increase number of data and coverage)

Ability to only harvest changed data

Ability to harvest part of a site (e.g. a particular group vs whole catalog)

Additional featuresAdding more advanced multilingual capabilities to the portal to support its Europe-wide coverage

Add upgraded triple store and SPAQRL endpoint

Creating Knowledge out of Interlinked Data

LOD2 Event . 06.09.2010 10 Page

http://lod2.eu

Next Steps

Community Engagement for Publicdata.eu

We can divide our community building and engagement strategy around PublicData.eu into two main clusters: supply & demand

Objectives on the supply side:

Engage more with data publishers by building (a) a stronger community of official representatives and data catalogue maintainers around PublicData.eu and (b) consensus around key legal and technical standards (e.g. making metadata explicitly open, enabling data catalog interoperability)

Establish datacatalogs.org as the de facto place to go to find out about data catalogs around the world - and encourage data catalog maintainers and other official contact points to maintain up to date information about national, regional and local catalogs, and lists of catalogs

Creating Knowledge out of Interlinked Data

LOD2 Event . 06.09.2010 11 Page

http://lod2.eu

Next Steps

Community Engagement for Publicdata.eu

Objectives on the demand side:

To build a stronger and better connected community of open data re-users from across EU27 around PublicData.eu

Continue to identify and pursue opportunities to engage with the Linked Data community and to use the LOD2 Stack to publish Linked Data derived from PublicData.eu.

We are hoping to achieve this by performing the following activities:

-Organize 2-4 OKFN Labs sprints per year on a variety of different topics and disseminateresults via press releases, media contacts and partners -Promote PublicData.eu at events, workshops and hackdays across EU27 -Dissemination via blogs, guest posts and articles on third party sites, and press releases

Creating Knowledge out of Interlinked Data

LOD2 Presentation . 02.09.2010 . Page

http://lod2.eu

Year 2 Deliverables (Serbian CKAN, Instytut Informatyki Gospodarczej)

SORSServer 1

LOD2

Online
dissemination
DB

XSLT

Server 2Serbian CKAN

RDFCKAN

publishing

search

Server 3Publicdata.euCKAN

import

search

Code listsLOD2

http://elpo.stat.gov.rs/lod2/RS-DATAhttp://elpo.stat.gov.rs/lod2/RS-DIC http://rs.ckan.net search

D 9.5.1 Establishing Serbian CKAN - Infrastructure for Public Sector Information

Year 2 Deliverables

National accounts

Prices

Usage of ICT

Science, Technology and Innovations

Goal

Identify the requirements of Polish Ministry of Economy for publication of the data

Analyze changes of data over time, temporal and topical scope

Prepare for adoption of LOD2 Stack for publication of Ministry data

Status

Delivered on time for M20

Work continues on Task 9.7

D9.6 Requirements and resources used by the Polish Ministry of Economy

Year 2 Deliverables

http://data.gov.pl Current State

Year 2 Deliverables

Need for data

Requirements Querying

INSIGOSInternet System for Business Information

Access to statistical data concerning economy and foreign trade

POLGOS - presentation of comparative data concerning Polish economy

HZ - information about Polish foreign trade

ENERGY mission: energy security

Challenges - Multidimensional database (data ware House) - Possible linking to source for drilling-down - Not up to date probably needs a supplementing process - ENERGY many files in ugly-formatted Excel files

D9.6 Key Points

Year 2 Deliverables

CEIDGCentral Register and Information on Economic Activity

access to data concerning natural persons businesses

references to other registries

ca. 2.9 million records

Challenges -data is not clean -available via API -dynamic data set: ~1000/1000 applications for de-/registration daily -snapshots evolution phase of LOD2 LifecyclePublic procurement data - Pulished in XML, volume of data in 2011 alone: 828MB

D9.6 Key Points

Year 2 Deliverables

No sophisticated tools used at MoE

Groups of requirements

Data Acquisition 7 requirements

Data Processing/Transformation 2 requirements

Publication 3 requirements

Data Analysis 2 requirements

Alignment with LOD2 Life Cycleall 8 phases seem to be important but

Alignment with LOD2 Stackcrucial components identified

D2R/Triplify, Virtuoso, CKAN, PoolParty, Ontowiki, Silk, Sigma

D9.6 Requirements - Summary

Year 2 Deliverables

Creating Knowledge out of Interlinked Data

LOD2 Presentation . 02.09.2010 . Page

http://lod2.eu

Next Steps (Instytut Informatyki Gospodarczej)

Task 9.7 Adoption of the LOD2 Stack for Polish economy data (I2G)

Goaladaptation of the LOD2 Stack to the requirements of Polish Ministry of Economy

identification of crucial components and how to configure and link them

Statusfirst deliverable D9.7.1 scheduled for M30

identification of existing functionalities in the working infrastructure

first vocabularies linked using Silk Workbench

Next stepsfinishing and cleaning vocabulary

design of data model using SDMX vocabulary

filling in the model

Establishing the Polish CKAN

Next Steps

Creating Knowledge out of Interlinked Data

LOD2 Presentation . 02.09.2010 . Page

http://lod2.eu

Thank you for your attention!

EU-FP7 LOD2 WP10 22.-23.9.2011. 02.09.2010 . Page http://lod2.eu

Creating Knowledge out of Interlinked Data

EU-FP7 LOD2 WP6 13.-14.09.2012. Page http://lod2.eu

Creating Knowledge out of Interlinked Data

Click to edit the outline text formatSecond Outline LevelThird Outline LevelFourth Outline LevelFifth Outline LevelSixth Outline LevelSeventh Outline LevelEighth Outline Level

Ninth Outline LevelMastertextformat bearbeitenZweite Ebene Dritte Ebene

Vierte Ebene

Fnfte Ebene

Click to edit the title text formatMastertitelformat bearbeiten

EU-FP7 LOD2 Project Overview . Page http://lod2.eu

Creating Knowledge out of Interlinked Data

EU-FP7 LOD2 Project Overview . Page http://lod2.eu

Creating Knowledge out of Interlinked Data

Click to edit the outline text formatSecond Outline LevelThird Outline LevelFourth Outline LevelFifth Outline LevelSixth Outline LevelSeventh Outline LevelEighth Outline Level

Ninth Outline LevelMastertextformat bearbeitenZweite Ebene Dritte Ebene

Vierte Ebene

Fnfte Ebene

Click to edit the title text formatMastertitelformat bearbeiten