the web of data - tom heath

Post on 30-Nov-2014

1.597 Views

Category:

Education

0 Downloads

Preview:

Click to see full reader

DESCRIPTION

Tom Heath's presentation at SSSW 2012

TRANSCRIPT

shared innovation

The Web of Data

Tom Heath

Senior Research ScientistTalis Education Ltd

July 2012, SSSW2012, Cercedilla, Spain

shared innovation

This talk is about the basics!

shared innovation

Overview

• Why a Web of Data is not just about data on the Web

• Building blocks of a Web of Data (i.e. Linked Data fundamentals)

shared innovation

Data on the Web

shared innovation

Data on the Web

• Takes many different forms

• Falls on a continuum of accessibility/usability, structure, meaning and connectedness...

shared innovation

Data on the Web

PDF → HTML → Microformats → CSV/XML → Microdata →RDF

shared innovation

shared innovation

shared innovation

shared innovation

shared innovation

shared innovation

shared innovation

Schema.org

• The Semantic Web has gone mainstream!

• But they're trying to create one ontology for everything!

• Is it linked to anything else?

shared innovation

Issues, PDF through Microdata

• Structure, parseability– How do we validate what we're extracting?– Is there an underlying data model?

• How do we know what the data means?

• How do you relate parts of the data together?

• How do consumers discover related data?

shared innovation

Data on the Web != A Web of Data

shared innovation

If there are no links, it's not a Web!

shared innovation

The Web of Data

shared innovation

What is the Web of Data?

• A boundless graph of interconnected data available online...

shared innovation

What is a Graph?

Tom

Enrico

Talis Education

Spain

SSSW2012

Cercedilla

Italy

Madrid

2012-07-08

employedBy

studentOf

tutorAt

startedOn

bornIn

locatedIn

England

near

capitalOf

locatedIn

shared innovation

One Graph, Many Documents

Tom

Enrico

Talis Education

Spain

SSSW2012

Cercedilla

Italy

Madrid

2012-07-08

employedBy

studentOf

tutorAt

startedOn

bornIn

locatedIn

England

near

capitalOf

lost Euro2012Final

locatedIn

won

shared innovation

SubjectObject

Predicate

The RDF Data Model: Triples

shared innovation

TomTalis

employedBy

The RDF Data Model: Triples

shared innovation

Why I RDF

• Clear underlying data model

• Linkability baked into the data model

• Ways to express what the data means

• Many different ways to write it (for different use cases)

• Very easily parsed (in some cases)

shared innovation

RDF Serialisation Formats

• RDFa

• RDF/XML

• N-Triples

• Turtle

• RDF/JSON

shared innovation

Identifying Things in the Graph

• Use URIs

• But not any old URIs

• HTTP URIs– (almost completely) distributed ownership– anyone (with a credit card) can create/own them– can identify anything– lookup mechanism is baked in

shared innovation

http://tomheath.com/id/mehttp://www.talis.com/id/company

http://example.org/employedBy

HTTP URIs for Things

shared innovation

Retrieving Humans over the Web

shared innovation

Linked Data Hands-On Session

• Goal:– Give practical experience of working with Linked Data from

the Web– Give exposure to different application architectures

• Scenario– Building a travel guide application

• Tools– Curl, LDSpider, Fuseki, Rapper, Squin

shared innovation

Questions?

• Contact Info– tom.heath@talis.com– http://tomheath.com/id/me– http://www.talis.com/– @tommyh (twitter)

• Slides– http://tomheath.com/slides/2012-07-cercedilla-

web-of-data.pdf

shared innovation

top related