provenance information in the web of data

Post on 10-May-2015

2.553 Views

Category:

Technology

4 Downloads

Preview:

Click to see full reader

DESCRIPTION

The slides for my presentation at the Linked Data Workshop at WWW 2009.

TRANSCRIPT

Provenance Informationin the Web of Data

Olaf HartigHumboldt-Universität zu Berlin

http://olafhartig.de/foaf.rdf#olaf

Olaf Hartig - Provenance Information in the Web of Data 2

● Provenance of a data item: information about the history

Olaf Hartig - Provenance Information in the Web of Data 3

● Provenance of a data item: information about the history

Olaf Hartig - Provenance Information in the Web of Data 4

● Provenance of a data item: information about the history

Olaf Hartig - Provenance Information in the Web of Data 5

Outline

Towards a model ofWeb data provenance

Provenance informationin the Web of data today

Upcomingtasks

Olaf Hartig - Provenance Information in the Web of Data 6

● Main research areas: (scientific) workflows, DBMSs

● General focus: data creation

Existing Provenance Research

Olaf Hartig - Provenance Information in the Web of Data 7

Olaf Hartig - Provenance Information in the Web of Data 8

Olaf Hartig - Provenance Information in the Web of Data 9

Olaf Hartig - Provenance Information in the Web of Data 10

Olaf Hartig - Provenance Information in the Web of Data 11

Web data provenancecomprises

two dimensions:

Data Creation • Data Access

Olaf Hartig - Provenance Information in the Web of Data 12

Basics of the Provenance Model

● Provenance graph describes provenance of a data item● Nodes: provenance elements – pieces of provenance info● Edges: relate provenance elements to each other● Subgraphs for related data items possible

Olaf Hartig - Provenance Information in the Web of Data 13

Basics of the Provenance Model

● Provenance model defines:● Types of provenance elements● Relationships

Olaf Hartig - Provenance Information in the Web of Data 14

Basics of the Provenance Model

● Provenance model defines:● Types of provenance elements● Relationships

● High level of abstraction (only main element types)

Olaf Hartig - Provenance Information in the Web of Data 15

Basics of the Provenance Model

● General differentiation:

Actors

Executions

Artifacts

Olaf Hartig - Provenance Information in the Web of Data 16

Data Access Dimension

Data Item

Information Resource

Data Access

contains

Relation tothe provided Information

Resource

Data Providing Service (Non-Human)

Data Publisher(Human)

Service Provider

uses controls

Data Accessor(Non-Human)

Access Time

Olaf Hartig - Provenance Information in the Web of Data 17

Data Access Dimension cont.

Public Key

(Signed)Artifact

Integrity Assurance

Relation tothe signed Data

Signer

owns

Verification Result

Digital Signature

signs

Olaf Hartig - Provenance Information in the Web of Data 18

ProvenanceInformation

ProvenanceInformation

ProvenanceInformation

Data Creation Dimension

Data Creator(Human or Non-human)

{complete,disjoint}

Relation tothe created Data

Creation Time

Creation Guidelines

Data Creation

responsible for responsible for

Data Creating Service (e.g. Software Agent)

Data Creating Entity (e.g. Person, Group, Orga.)

Data Creating Device(e.g. Sensor)

Source Data

Data Item

(Encompassing)Data Item

part of

Olaf Hartig - Provenance Information in the Web of Data 19

Provenance informationin the Web of data today

Olaf Hartig - Provenance Information in the Web of Data 20

Provenance-related Vocabularies

DC – Dublin Core Metadata Terms

FOAF – Friend of a Friend

SIOC – Semantically-Interlinked Online Communities

● SWP – Semantic Web Publishing vocabulary

● WOT – Web of Trust schema

● OMV – Ontology Metadata Vocabulary

● PML – Proof Markup Language

● Changeset vocabulary

● Ouzo Provenance Ontology

Olaf Hartig - Provenance Information in the Web of Data 21

Provenance-related Vocabularies

DC – Dublin Core Metadata Terms

Olaf Hartig - Provenance Information in the Web of Data 22

Provenance-related Vocabularies

DC – Dublin Core Metadata Terms● dc:creator

● dc:contributor

● dc:source

● dc:created

● dc:modified

● dc:publisher

● dc:provenance

Olaf Hartig - Provenance Information in the Web of Data 23

Provenance-related Vocabularies

DC – Dublin Core Metadata Terms● dc:creator

● dc:contributor

● dc:source

● dc:created

● dc:modified

● dc:publisher – “an entity responsible for making the resource available”

● dc:provenance

Olaf Hartig - Provenance Information in the Web of Data 24

DC – Dublin Core Metadata Terms● dc:creator

● dc:contributor

● dc:source

● dc:created

● dc:modified

● dc:publisher – “an entity responsible for making the resource available”

● dc:provenance

Provenance-related Vocabularies

Data Access

Data Providing Service (Non-Human)

Data Publisher(Human)

Service Provider

uses controls

Olaf Hartig - Provenance Information in the Web of Data 25

Main Issues Today

● Vocabularies:● Partly unsuitable● Lack of certain features● Coverage of provenance model impossible

Olaf Hartig - Provenance Information in the Web of Data 26

Provenance-related Vocabularies

DC – Dublin Core Metadata Terms

Property Occurrences*

dc:creator about 24,284

dc:contributor 476

dc:source about 3,631

dc:created about 82,720

dc:modified about 12,020

dc:provenance 7

*Measured by querying Sindice; Feb. 7, 2009 (by that time Sindice indexed about 48,99 million documents)

Olaf Hartig - Provenance Information in the Web of Data 27

Main Issues Today

● Vocabularies:● Partly unsuitable● Lack of certain features● Coverage of provenance model impossible

● General lack of provenance-related metadata on the Web of data

Olaf Hartig - Provenance Information in the Web of Data 28

Possible Reasons

● Lack of suitable vocabularies● Lack of usable tools● Ignorance / lack of sensitization

Olaf Hartig - Provenance Information in the Web of Data 29

Upcomingtasks

Olaf Hartig - Provenance Information in the Web of Data 30

Address the Issues

● Let's develop a vocabulary for Web data provenance● Proposal: refine the presented provenance model● Integrate existing vocabularies for specific types of

provenance elements

Olaf Hartig - Provenance Information in the Web of Data 31

Address the Issues

● Let's develop a vocabulary for Web data provenance● Proposal: refine the presented provenance model● Integrate existing vocabularies for specific types of

provenance elements

● Let's develop usable tools for data providers● Edit and publish provenance-related metadata● Automatic generation if possible

Olaf Hartig - Provenance Information in the Web of Data 32

Address the Issues

● Let's develop a vocabulary for Web data provenance● Proposal: refine the presented provenance model● Integrate existing vocabularies for specific types of

provenance elements

● Let's develop usable tools for data providers● Edit and publish provenance-related metadata● Automatic generation if possible

● Let's raise awareness of data providers● Probably the hardest task● Maybe voiD can help

Olaf HartigHumboldt-Universität zu Berlin

http://olafhartig.de/foaf.rdf#olaf

Thank you!

Olaf Hartig - Provenance Information in the Web of Data 34

These slides have been created byOlaf Hartig

http://olafhartig.de

This work is licensed under aCreative Commons Attribution-Share Alike 3.0 License

(http://creativecommons.org/licenses/by-sa/3.0/)

Attribution:● http://www.flickr.com/photos/adrenalin/3032734/● http://www.hasslefreeclipart.com● http://www.flickr.com/photos/dullhunk/428079229/● http://www.flickr.com/photos/darwinbell/1337963794/● http://www.flickr.com/photos/alandd/2780700767/● http://www.flickr.com/photos/simeon_barkas/2872099696/● http://www.flickr.com/photos/robinh00d/122544491/● http://www.flickr.com/photos/adrenalin/3032747/

top related