lod2 open government data stakeholder survey, michael martin and martin kaltenböck

29
Creating Knowledge out of Interlinked Data The Open Government Data Stakeholder Survey Michael Martin, University of Leipzig (Germany) Martin Kaltenböck, Semantic Web Company (Austria) Sören Auer (University of Leipzig), Helmut Nagy (Semantic Web Company) OKCon2011 - Berlin, 30.06. 2011 These slides are published under : http://creativecommons.org/licenses/by/3.0

Upload: lod2-creating-knowledge-out-of-interlinked-data

Post on 11-May-2015

2.360 views

Category:

Education


0 download

DESCRIPTION

Slides of the presentation by Michael Martin (ULEI, INFAI) and Martin Kaltenböck (Semantic Web Company) at the OKCon2011 in Berlin on 30th of June 2011: The LOD2 Open Government Data Stakeholder Survey

TRANSCRIPT

Page 1: LOD2 Open Government Data Stakeholder Survey, Michael Martin and Martin Kaltenböck

Creating Knowledge out of Interlinked Data

The Open Government Data Stakeholder Survey Michael Martin, University of Leipzig (Germany)

Martin Kaltenböck, Semantic Web Company (Austria) Sören Auer (University of Leipzig), Helmut Nagy (Semantic Web Company)

OKCon2011 - Berlin, 30.06. 2011

These slides are published under : http://creativecommons.org/licenses/by/3.0

Page 2: LOD2 Open Government Data Stakeholder Survey, Michael Martin and Martin Kaltenböck

LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu

Agenda

• LOD2 Project & the OGD Stakeholder Survey

• Results of the OGD Stakeholder Survey

• Publishing the Stakeholder Survey as LOD

• Conclusion & Outreach - What‘s next

Page 3: LOD2 Open Government Data Stakeholder Survey, Michael Martin and Martin Kaltenböck

LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu

OGD Stakeholder Survey: Results

LOD2 Creating Knowledge out of

Interlinked Data

Page 4: LOD2 Open Government Data Stakeholder Survey, Michael Martin and Martin Kaltenböck

LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu

LOD2 in a Nutshell

Creating Knowledge out of Interlinked Data

Research focus • Very large RDF data management • Enrichment & Interlinking • Fusion & Information Quality • Adaptive User Interfaces 3 Use Cases • Media & Publishing • Linked Enterprise Data • Open Government Data

10 Partners of 7 countries • University of Leipzig, Germany • DERI Galway, Ireland • FU Berlin, Germany • Semantic Web Company, Austria • OpenLink Software, UK • TenForce, Belgium • Exalead, France • Wolters Kluwer, Germany • Open Knowledge Foundation, UK • CWI, Netherlands

Page 5: LOD2 Open Government Data Stakeholder Survey, Michael Martin and Martin Kaltenböck

LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu

Page 6: LOD2 Open Government Data Stakeholder Survey, Michael Martin and Martin Kaltenböck

LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu

3 Use Cases in LOD2

Objective Applying Linked Data technologies in an enterprise stack to support Human Resources (HR) related issues. ENTERPRISE

APPLICATIONS

Exalead, France

MEDIA &

PUBLISHING

Wolters Kluwer Germany

OPEN GOVERNMENT

DATA

Open Knowledge Foundation, UK

Objective Improving accessibility, findability & reusability of Open Government Data in Europe: publicdata.eu

Objective Supporting content-related production workflows in the media & publishing industry.

Page 7: LOD2 Open Government Data Stakeholder Survey, Michael Martin and Martin Kaltenböck

LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu

Open Governmental Data – and ideal testbed for Linked Data?

Close cooperation with W3C eGov IG, OKFN’s OpenEUdata, PSI & grassroots efforts

CKAN.org | semic.eu | EIF European Interoperability Framework | ICT2010 Networking Session

UIs and Personalization

Individual mashups of data with other sources

Notification/subscription service based on personal preferences

Transparency wishlists, upload revisions, derivates

Create and publish queries, reports and visualisations

Single Point of Access: European registry & collaboration for open government data Outreach & involve data providers - local, regional, national and European

Page 8: LOD2 Open Government Data Stakeholder Survey, Michael Martin and Martin Kaltenböck

LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu

LOD2 Open Government Data Use Case: publicdata.eu

Page 9: LOD2 Open Government Data Stakeholder Survey, Michael Martin and Martin Kaltenböck

LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu

OGD Stakeholder Survey: Results

The LOD2 Open Government Data (OGD)

Stakeholder Survey

Page 10: LOD2 Open Government Data Stakeholder Survey, Michael Martin and Martin Kaltenböck

LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu

WHY

• Involve OGD community in Europe (& worldwide) in publicdata.eu process

• Ask for their needs & requirements in the area of OGD & publicdata.eu

• Use results for requirements elicitation for the LOD2 use case: publicdata.eu

HOW

• Set up by OKFN & SWC with support from DERI, Wolters Kluwer and ULEI

• Easy to use online survey tool (surveygizmo.com)

• Promoted via blogs, mailings, mailing lists and additional viral marketing

channels as well as at related events in Europe & by the EC

• Duration: 5 weeks

• 329 participants

• Results available since May 2011: http://survey.lod2.eu/

• Published in HTML, PDF & raw survey data in CSV & RDF

for re-use under CC-BY license

The LOD2 OGD Stakeholder Survey

Page 11: LOD2 Open Government Data Stakeholder Survey, Michael Martin and Martin Kaltenböck

LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu

OGD Stakeholder Survey: Results

Interest in ‚domains of data‘

For the complete results you can see that "Geospatial information", "Scientific data" and "Environment" data are the top ranked domains.

Page 12: LOD2 Open Government Data Stakeholder Survey, Michael Martin and Martin Kaltenböck

LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu

Currently used formats

For the preferences on the format of data you can see that current "traditional" formats like HTML, PDF, CSV/XLS and DOC/RTF are preferred.

OGD Stakeholder Survey: Results

Page 13: LOD2 Open Government Data Stakeholder Survey, Michael Martin and Martin Kaltenböck

LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu

Requested (future) formats

In general you can also see that ideally ("in future") formats like "XML", "RDF" and "APIs" will become more importance. It seems that all user types find DOC, RTF and PDF not a suitable solution for a future Open Government Data infrastructure. As we haven't listed JSON, RSS and YAML in this list, respondents have urged that in the free comment field.

OGD Stakeholder Survey: Results

Page 14: LOD2 Open Government Data Stakeholder Survey, Michael Martin and Martin Kaltenböck

LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu

From where should the data come?

The results show that "national" data is most important followed by "regional", "EU-wide" and "worldwide" data.

OGD Stakeholder Survey: Results

Page 15: LOD2 Open Government Data Stakeholder Survey, Michael Martin and Martin Kaltenböck

LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu

What is important for quality assurance?

The complete results show that "provenance/source of data" followed by "format of data", "completeness of meta data", "ranking / comments by users" and "official certificates"

OGD Stakeholder Survey: Results

Page 16: LOD2 Open Government Data Stakeholder Survey, Michael Martin and Martin Kaltenböck

LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu

What features does a EU data catalogue need?

The complete results show that "providing raw datasets", "information about versioning of data sets" and "searching, exploring, grouping and clustering of data sets" are the features which are "expected to have" while "crowd sourcing mechanism (e.g. data repair)", "alerts on regional information" and "analysis and visualisation tools" have the highest rating in the "like to have" category.

OGD Stakeholder Survey: Results

Page 17: LOD2 Open Government Data Stakeholder Survey, Michael Martin and Martin Kaltenböck

LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu

What is also expected on publicdata.eu?

For this question "white papers & best practice", "news on Open Government Data" and "use cases & sucess stories" are "expected to have" while "ideas for apps", "events" and again "use cases & success stories" are ranked highest in the "like to have" category.

OGD Stakeholder Survey: Results

Page 18: LOD2 Open Government Data Stakeholder Survey, Michael Martin and Martin Kaltenböck

LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu

OGD Stakeholder Survey: Results

Publishing the Survey as RDF

Page 19: LOD2 Open Government Data Stakeholder Survey, Michael Martin and Martin Kaltenböck

LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu

- Questionnaire and results of the survey published as - HTML,

- CSV and - PDF

-These formats are only for humans which represent use case specific views

Publishing the Survey as RDF

OGD Stakeholder Survey: Results

Page 20: LOD2 Open Government Data Stakeholder Survey, Michael Martin and Martin Kaltenböck

LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu

- Publishing in RDF give further benefits: -Aggregating data to enable other user(-generated) aspects (SPARQL)

- Interlink the data with other datasets … - … which enable (advanced) users to create more complex queries (aggregating resources from non-local information spaces like DBpedia )

- Schema and data can be queried together

Publishing the Survey as RDF

OGD Stakeholder Survey: Results

Page 21: LOD2 Open Government Data Stakeholder Survey, Michael Martin and Martin Kaltenböck

LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu

RDF-Schema creation to represent Surveys

Publishing the Survey as RDF

http://ns.aksw.org/survey/

OGD Stakeholder Survey: Results

Page 22: LOD2 Open Government Data Stakeholder Survey, Michael Martin and Martin Kaltenböck

LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu

Transformation of the Data Questionnaire delivered as PDF containing 5 survey sections, 60 questions, 221 Options for single -/multiple - choice questions -> modeled with the survey vocabulary Resultset of the Survey tool delivered as CSV which is being transformed with PHP

Creating RDF-Resources for every : Participant: Response (329) Answers to Questions (12891)

Overall ~ 70,000 triples

Publishing the Survey as RDF

OGD Stakeholder Survey: Results

Page 23: LOD2 Open Government Data Stakeholder Survey, Michael Martin and Martin Kaltenböck

LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu

Deployment of the Data Upload the knowledge base into Virtuoso

Questionnaire and survey results as one model

Universal Server with multiple RDF / LOD functionalities

Adding metadata to the OGD-Survey-model (contributor, publisher, license)

Publishing the Survey as RDF

OGD Stakeholder Survey: Results

Page 24: LOD2 Open Government Data Stakeholder Survey, Michael Martin and Martin Kaltenböck

LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu

Publishing the model via OntoWiki human and machine friendly HTML/JS – Web interface to explore and maintain the data SPARQL Editor / Endpoint LOD client and server

Publishing the Survey as RDF

OGD Stakeholder Survey: Results

Page 25: LOD2 Open Government Data Stakeholder Survey, Michael Martin and Martin Kaltenböck

LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu

Publishing the Survey as RDF

http://data.lod2.eu/2010/ogd-survey/

OGD Stakeholder Survey: Results

Page 26: LOD2 Open Government Data Stakeholder Survey, Michael Martin and Martin Kaltenböck

LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu

OGD Stakeholder Survey: Results

Conclusion & Outreach

Page 27: LOD2 Open Government Data Stakeholder Survey, Michael Martin and Martin Kaltenböck

LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu

• Analyzing the 329 responses showed the importance of facilitating

Open Government Data

• Geospatial information, Scientific & Environmental Data are the top

ranked requested domains.

• National & regional data seems to be most important for users

• It shows a shift in currently used formats to new formats

• It shows that the source of a dataset is the most important indicator

for quality assurance

• White Papers, Best Practise and Success Stories are requested as

additional information on publicdata.eu

Conclusion

OGD Stakeholder Survey: Results

Page 28: LOD2 Open Government Data Stakeholder Survey, Michael Martin and Martin Kaltenböck

LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu

• LOD2 PUBLINK Consultancy 2010/2011 • Greater London Authority, UK

• City of Vienna, Austria

• Umweltbundesamt, Austria

• Historische Kommission, Germany

• Parliament of Finland, Finland

• Instituto Canario de Estadística – ISTAC

• Next PUBLINK call Sept 2011 (LOD consumption)

• LOD2 Webinar Series

• Results of LOD2 OGD Stakeholder Survey 2010 A new survey is planned for late 2011

• LOD2 Technology Stack coming soon (autumn2011)

• Open Data Camp 2011 powered by LOD2 in

Warsaw, Poland - around 21st of October 2011

http://lod2.eu http://blog.lod2.eu http://survey.lod2.eu

LOD2 Outreach- What’s next in LOD2

Page 29: LOD2 Open Government Data Stakeholder Survey, Michael Martin and Martin Kaltenböck

LOD2 OGD Stakeholder Survey 30.06. 2011 http://lod2.eu

Thank you for your attention!

Web: http://www.semantic-web.at Blog: http://blog.semantic-web.at Mail: [email protected] Phone: +43 - 1 - 402 12 35 – 25

LOD2 Project: http://lod2.eu

LOD2 Blog: http://blog.lod2.eu

LOD2 OGD Stakeholder Survey: http://survey.lod2.eu/

LOD2 OGD Stakeholder Survey data: http://data.lod2.eu/2010/ogd-survey/

PUBLINK LOD Consultancy: http://lod2.eu/Article/Publink.html

Martin Kaltenböck, Semantic Web Company Web: http://aksw.org Blog: http://blog.aksw.org Mail: [email protected] Phone: +49 341 97-32322

Michael Martin, University of Leipzig

These slides are published under : http://creativecommons.org/licenses/by/3.0