qb'er demonstration

23
Tool for converting and linking statistical datasets to a cloud of interconnected historical datasets. QB’er - Demonstration Ashkan Ashkpour, IISH – CLARIAH WP4 07-10-2016

Upload: clariah

Post on 18-Jan-2017

36 views

Category:

Software


0 download

TRANSCRIPT

Page 1: QB'er demonstration

Tool for converting and linking statistical datasets to a cloud of interconnected historical datasets.

QB’er - Demonstration

Ashkan Ashkpour, IISH – CLARIAH WP407-10-2016

Page 2: QB'er demonstration

GOAL OF THIS PRESENTATIONFrom CSV files and structured statistical data to (harmonized) Interlinked data on the Web

Data Tooling Interlinked Datasets on the web

Page 3: QB'er demonstration

• Gather and enter own data• Find data on multiple repositories• Download• Clean and reshape• Merge• Clean and reshape…• Analyse

PROBLEM - Today’s Workflow

Page 4: QB'er demonstration

PROBLEMDisconnected data and efforts

We keep repeating ourselves and do this repeatedly for the same datasets

Comparability across time and datasets

Page 5: QB'er demonstration

https://blog.gaijinpot.com/knowledge-sharing-economy/

Page 6: QB'er demonstration

LOSS OFF.. Provenance Cleaning efforts (sometimes up to 60% of the work) Valuable mappings (discarding time consuming prior work) Expert decisions Discoverability

Page 7: QB'er demonstration

SOLUTION: INTEGRATE DISSIMILAR DATA IN FLEXIBLE AND ACCOUNTABLE WAYS

Page 8: QB'er demonstration

HARMONIZATION AND RDF What we want is harmonization by way of;

Standardization and Classification

Flexible approach while providing accountability

Page 9: QB'er demonstration
Page 10: QB'er demonstration
Page 11: QB'er demonstration
Page 12: QB'er demonstration

QB’EREmpower individual researchers to:

Code and harmonize individual datasets according to best practices of thecommunity (e.g. HISCO, SDMX, Worldbank, etc.) or against their colleagues

Share their own code lists with fellow researchers

Align code lists across datasets

Publish their standards-compliant datasets on a Structured Data Hub

Collaborative growing of a graph of interconnected datasets

Page 13: QB'er demonstration

INPUT

Page 14: QB'er demonstration

INPUT

Page 15: QB'er demonstration

INPUT

Page 16: QB'er demonstration

INPUT

Page 17: QB'er demonstration
Page 18: QB'er demonstration

DEMO EXAMPLE Nieuwkomers in de Utrechtse volkstelling van 1829 en 1839

http://hdl.handle.net/10622/KMAJLE

Page 19: QB'er demonstration

Utrecht 1829

Page 20: QB'er demonstration

Utrecht 1839

Variables

Values

Page 21: QB'er demonstration

DEMO

Qb’er Demonstration Video

Page 22: QB'er demonstration

TO CONCLUDE…• Generic, domain-independent tool• Uploading of a dataset and extraction of variables and value Frequencies• Mapping of variable values to codes (while preserving the originals!)• Publishing of dataset structure as Linked Data• Align codes and identifiers across datasets• Provenance of all assertions to the SDH traceable to time and person• Crowd-based production of code lists and mappings• Sharing / Reuse other people’s work (or stand on the shoulders of giants)• No disposable research

Page 23: QB'er demonstration

QUESTIONS ?

QB’er - Demonstration

Ashkan Ashkpour – CLARIAH WP407-10-2016