foundations of semantic web technologies · tu dresden foundations of semantic web technologies...
TRANSCRIPT
FOUNDATIONS OF SEMANTICWEB TECHNOLOGIES
Linked Data and Applications
Sebastian Rudolph
Agenda
1 Linked (Open) Data
2 Semantic Web and HTMLRDFaMicroformatsGoogle Knowledge Graph
3 OWL ApplicationsOWL DL Application EDF EnergyOWL Profile Application BBC World CupSemantic Technologies in the Pharmaceutical Industry
4 Summary
TU Dresden Foundations of Semantic Web Technologies slide 2 of 50
Agenda
1 Linked (Open) Data
2 Semantic Web and HTMLRDFaMicroformatsGoogle Knowledge Graph
3 OWL ApplicationsOWL DL Application EDF EnergyOWL Profile Application BBC World CupSemantic Technologies in the Pharmaceutical Industry
4 Summary
TU Dresden Foundations of Semantic Web Technologies slide 3 of 50
Data in the Web• more and more data is available in the Web for programmatic access• often specified using Semantic Web Standards, e.g., the following Linking
Open Data (LOD) Initiativehttp://www.w3.org/wiki/SweoIG/TaskForces/CommunityProjects/LinkingOpenData
• using APIs, e.g. via JSON/REST
• Semantic Web technologies simplify the integration of data from differentsources
• combination of data leads to deeper insights
TU Dresden Foundations of Semantic Web Technologies slide 4 of 50
Linked Data in the Web 01.05.2007
Linking Open Data cloud diagram, by Richard Cyganiak and AnjaJentzsch. http://lod-cloud.net/
TU Dresden Foundations of Semantic Web Technologies slide 5 of 50
Linked Data in the Web 31.03.2008
SWConference
Corpus
DBpedia RDF Book Mashup
DBLPBerlin
Revyu
Project Guten-berg
FOAFprofiles
Geo-names
Music-brainz
Magna-tuneJamendo
World Fact-book
DBLPHannover
SIOCprofiles
Sem-Web-
Central
Euro-stat
ECS South-ampton
BBCLater +TOTP
Doap-space
Open-Guides
Gov-Track
US Census Data
W3CWordNet
flickrwrapprWiki-
company
OpenCyc
lingvoj
Onto-world
BBCJohnPeel
Flickrexporter
Audio-Scrobbler QDOS
updated
RKB Explorer
NEW!riese
NEW!
Linking Open Data cloud diagram, by Richard Cyganiak and AnjaJentzsch. http://lod-cloud.net/
TU Dresden Foundations of Semantic Web Technologies slide 6 of 50
Linked Data in the Web 14.07.2009
As of July 2009
LinkedCTReactome
Taxonomy
KEGG
PubMed
GeneID
Pfam
UniProt
OMIM
PDB
SymbolChEBI
Daily Med
Disea-some
CAS
HGNC
InterPro
Drug Bank
UniParc
UniRef
ProDom
PROSITE
Gene Ontology
HomoloGene
PubChem
MGI
UniSTS
GEOSpecies
Jamendo
BBCProgrammes
Music-brainz
Magna-tune
BBCLater +TOTP
SurgeRadio
MySpaceWrapper
Audio-Scrobbler
LinkedMDB
BBCJohnPeel
BBCPlaycount
Data
Gov-Track
US Census Data
riese
Geo-names
lingvoj
World Fact-book
Euro-stat
flickrwrappr
Open Calais
RevyuSIOCSites
Doap-space
Flickrexporter
FOAFprofiles
CrunchBase
Sem-Web-
Central
Open-Guides
Wiki-company
QDOS
Pub Guide
RDF ohloh
W3CWordNet
OpenCyc
UMBEL
Yago
DBpediaFreebase
Virtuoso Sponger
DBLPHannover
IRIT Toulouse
SWConference
Corpus
RDF Book Mashup
Project Guten-berg
DBLPBerlin
LAAS- CNRS
Buda-pestBME
IEEE
IBM
Resex
Pisa
New-castle
RAE 2001
CiteSeer
ACM
DBLP RKB
Explorer
eprints
LIBRIS
SemanticWeb.org
Eurécom
RKBECS
South-ampton
CORDIS
ReSIST ProjectWiki
NationalScience
Foundation
ECS South-ampton
LinkedGeoData
BBC Music
Linking Open Data cloud diagram, by Richard Cyganiak and AnjaJentzsch. http://lod-cloud.net/
TU Dresden Foundations of Semantic Web Technologies slide 7 of 50
Linked Data in the Web 22.09.2010
As of September 2010
MusicBrainz
(zitgist)
P20
YAGO
World Fact-book (FUB)
WordNet (W3C)
WordNet(VUA)
VIVO UFVIVO
Indiana
VIVO Cornell
VIAF
URIBurner
Sussex Reading
Lists
Plymouth Reading
Lists
UMBEL
UK Post-codes
legislation.gov.uk
Uberblic
UB Mann-heim
TWC LOGD
Twarql
transportdata.gov
.uk
totl.net
Tele-graphis
TCMGeneDIT
TaxonConcept
The Open Library (Talis)
t4gm
Surge Radio
STW
RAMEAU SH
statisticsdata.gov
.uk
St. Andrews Resource
Lists
ECS South-ampton EPrints
Semantic CrunchBase
semanticweb.org
SemanticXBRL
SWDog Food
rdfabout US SEC
Wiki
UN/LOCODE
Ulm
ECS (RKB
Explorer)
Roma
RISKS
RESEX
RAE2001
Pisa
OS
OAI
NSF
New-castle
LAAS
KISTIJISC
IRIT
IEEE
IBM
Eurécom
ERA
ePrints
dotAC
DEPLOY
DBLP (RKB
Explorer)
Course-ware
CORDIS
CiteSeer
Budapest
ACM
riese
Revyu
researchdata.gov
.uk
referencedata.gov
.uk
Recht-spraak.
nl
RDFohloh
Last.FM (rdfize)
RDF Book
Mashup
PSH
ProductDB
PBAC
Poké-pédia
Ord-nance Survey
Openly Local
The Open Library
OpenCyc
OpenCalais
OpenEI
New York
Times
NTU Resource
Lists
NDL subjects
MARC Codes List
Man-chesterReading
Lists
Lotico
The London Gazette
LOIUS
lobidResources
lobidOrgani-sations
LinkedMDB
LinkedLCCN
LinkedGeoData
LinkedCT
Linked Open
Numbers
lingvoj
LIBRIS
Lexvo
LCSH
DBLP (L3S)
Linked Sensor Data (Kno.e.sis)
Good-win
Family
Jamendo
iServe
NSZL Catalog
GovTrack
GESIS
GeoSpecies
GeoNames
GeoLinkedData(es)
GTAA
STITCHSIDER
Project Guten-berg (FUB)
MediCare
Euro-stat
(FUB)
DrugBank
Disea-some
DBLP (FU
Berlin)
DailyMed
Freebase
flickr wrappr
Fishes of Texas
FanHubz
Event-Media
EUTC Produc-
tions
Eurostat
EUNIS
ESD stan-dards
Popula-tion (En-AKTing)
NHS (EnAKTing)
Mortality (En-
AKTing)Energy
(En-AKTing)
CO2(En-
AKTing)
educationdata.gov
.uk
ECS South-ampton
Gem. Norm-datei
datadcs
MySpace(DBTune)
MusicBrainz
(DBTune)
Magna-tune
John Peel(DB
Tune)
classical(DB
Tune)
Audio-scrobbler (DBTune)
Last.fmArtists
(DBTune)
DBTropes
dbpedia lite
DBpedia
Pokedex
Airports
NASA (Data Incu-bator)
MusicBrainz(Data
Incubator)
Moseley Folk
Discogs(Data In-cubator)
Climbing
Linked Data for Intervals
Cornetto
Chronic-ling
America
Chem2Bio2RDF
biz.data.
gov.uk
UniSTS
UniRef
UniPath-way
UniParc
Taxo-nomy
UniProt
SGD
Reactome
PubMed
PubChem
PRO-SITE
ProDom
Pfam PDB
OMIM
OBO
MGI
KEGG Reaction
KEGG Pathway
KEGG Glycan
KEGG Enzyme
KEGG Drug
KEGG Cpd
InterPro
HomoloGene
HGNC
Gene Ontology
GeneID
GenBank
ChEBI
CAS
Affy-metrix
BibBaseBBC
Wildlife Finder
BBC Program
mesBBC
Music
rdfaboutUS Census
Linking Open Data cloud diagram, by Richard Cyganiak and AnjaJentzsch. http://lod-cloud.net/
TU Dresden Foundations of Semantic Web Technologies slide 8 of 50
Linked Data in the Web 19.09.2011
As of September 2011
MusicBrainz
(zitgist)
P20
Turismo de
Zaragoza
yovisto
Yahoo! Geo
Planet
YAGO
World Fact-book
El ViajeroTourism
WordNet (W3C)
WordNet (VUA)
VIVO UF
VIVO Indiana
VIVO Cornell
VIAF
URIBurner
Sussex Reading
Lists
Plymouth Reading
Lists
UniRef
UniProt
UMBEL
UK Post-codes
legislationdata.gov.uk
Uberblic
UB Mann-heim
TWC LOGD
Twarql
transportdata.gov.
uk
Traffic Scotland
theses.fr
Thesau-rus W
totl.net
Tele-graphis
TCMGeneDIT
TaxonConcept
Open Library (Talis)
tags2con delicious
t4gminfo
Swedish Open
Cultural Heritage
Surge Radio
Sudoc
STW
RAMEAU SH
statisticsdata.gov.
uk
St. Andrews Resource
Lists
ECS South-ampton EPrints
SSW Thesaur
us
SmartLink
Slideshare2RDF
semanticweb.org
SemanticTweet
Semantic XBRL
SWDog Food
Source Code Ecosystem Linked Data
US SEC (rdfabout)
Sears
Scotland Geo-
graphy
ScotlandPupils &Exams
Scholaro-meter
WordNet (RKB
Explorer)
Wiki
UN/LOCODE
Ulm
ECS (RKB
Explorer)
Roma
RISKS
RESEX
RAE2001
Pisa
OS
OAI
NSF
New-castle
LAASKISTI
JISC
IRIT
IEEE
IBM
Eurécom
ERA
ePrints dotAC
DEPLOY
DBLP (RKB
Explorer)
Crime Reports
UK
Course-ware
CORDIS (RKB
Explorer)CiteSeer
Budapest
ACM
riese
Revyu
researchdata.gov.
ukRen. Energy Genera-
tors
referencedata.gov.
uk
Recht-spraak.
nl
RDFohloh
Last.FM (rdfize)
RDF Book
Mashup
Rådata nå!
PSH
Product Types
Ontology
ProductDB
PBAC
Poké-pédia
patentsdata.go
v.uk
OxPoints
Ord-nance Survey
Openly Local
Open Library
OpenCyc
Open Corpo-rates
OpenCalais
OpenEI
Open Election
Data Project
OpenData
Thesau-rus
Ontos News Portal
OGOLOD
JanusAMP
Ocean Drilling Codices
New York
Times
NVD
ntnusc
NTU Resource
Lists
Norwe-gian
MeSH
NDL subjects
ndlna
myExperi-ment
Italian Museums
medu-cator
MARC Codes List
Man-chester Reading
Lists
Lotico
Weather Stations
London Gazette
LOIUS
Linked Open Colors
lobidResources
lobidOrgani-sations
LEM
LinkedMDB
LinkedLCCN
LinkedGeoData
LinkedCT
LinkedUser
FeedbackLOV
Linked Open
Numbers
LODE
Eurostat (OntologyCentral)
Linked EDGAR
(OntologyCentral)
Linked Crunch-
base
lingvoj
Lichfield Spen-ding
LIBRIS
Lexvo
LCSH
DBLP (L3S)
Linked Sensor Data (Kno.e.sis)
Klapp-stuhl-club
Good-win
Family
National Radio-activity
JP
Jamendo (DBtune)
Italian public
schools
ISTAT Immi-gration
iServe
IdRef Sudoc
NSZL Catalog
Hellenic PD
Hellenic FBD
PiedmontAccomo-dations
GovTrack
GovWILD
GoogleArt
wrapper
gnoss
GESIS
GeoWordNet
GeoSpecies
GeoNames
GeoLinkedData
GEMET
GTAA
STITCH
SIDER
Project Guten-berg
MediCare
Euro-stat
(FUB)
EURES
DrugBank
Disea-some
DBLP (FU
Berlin)
DailyMed
CORDIS(FUB)
Freebase
flickr wrappr
Fishes of Texas
Finnish Munici-palities
ChEMBL
FanHubz
EventMedia
EUTC Produc-
tions
Eurostat
Europeana
EUNIS
EU Insti-
tutions
ESD stan-dards
EARTh
Enipedia
Popula-tion (En-AKTing)
NHS(En-
AKTing) Mortality(En-
AKTing)
Energy (En-
AKTing)
Crime(En-
AKTing)
CO2 Emission
(En-AKTing)
EEA
SISVU
education.data.g
ov.uk
ECS South-ampton
ECCO-TCP
GND
Didactalia
DDC Deutsche Bio-
graphie
datadcs
MusicBrainz
(DBTune)
Magna-tune
John Peel
(DBTune)
Classical (DB
Tune)
AudioScrobbler (DBTune)
Last.FM artists
(DBTune)
DBTropes
Portu-guese
DBpedia
dbpedia lite
Greek DBpedia
DBpedia
data-open-ac-uk
SMCJournals
Pokedex
Airports
NASA (Data Incu-bator)
MusicBrainz(Data
Incubator)
Moseley Folk
Metoffice Weather Forecasts
Discogs (Data
Incubator)
Climbing
data.gov.uk intervals
Data Gov.ie
databnf.fr
Cornetto
reegle
Chronic-ling
America
Chem2Bio2RDF
Calames
businessdata.gov.
uk
Bricklink
Brazilian Poli-
ticians
BNB
UniSTS
UniPathway
UniParc
Taxonomy
UniProt(Bio2RDF)
SGD
Reactome
PubMedPub
Chem
PRO-SITE
ProDom
Pfam
PDB
OMIMMGI
KEGG Reaction
KEGG Pathway
KEGG Glycan
KEGG Enzyme
KEGG Drug
KEGG Com-pound
InterPro
HomoloGene
HGNC
Gene Ontology
GeneID
Affy-metrix
bible ontology
BibBase
FTS
BBC Wildlife Finder
BBC Program
mes BBC Music
Alpine Ski
Austria
LOCAH
Amster-dam
Museum
AGROVOC
AEMET
US Census (rdfabout)
Linking Open Data cloud diagram, by Richard Cyganiak and AnjaJentzsch. http://lod-cloud.net/
TU Dresden Foundations of Semantic Web Technologies slide 9 of 50
Linked Data Principles*
Linked Data consists mainly of a number of principles for publishing data in theWeb:
1 Use URIs as names for things– documents, people, locations, concepts, etc.
2 Use HTTP URIs so that people can look up those names3 When someone looks up a URI, provide useful information, using the
standards (RDF*, SPARQL)4 Include links to other URIs, so that they can discover more things.
*http://www.w3.org/DesignIssues/LinkedData.html
TU Dresden Foundations of Semantic Web Technologies slide 10 of 50
5 Star Linked (Open) Data
8 Available on the web (whatever format) but with an openlicence, to be Open Data
88 Available as machine-readable structured data (e.g. excelinstead of image scan of a table)
888 as (2) plus non-proprietary format (e.g. CSV instead of excel)
8888 All the above plus, use open standards from W3C (RDF andSPARQL) to identify things, so that people can point at yourstuff
88888 All the above, plus: Link your data to other people’s data toprovide context
http://www.w3.org/DesignIssues/LinkedData.html
TU Dresden Foundations of Semantic Web Technologies slide 11 of 50
De-Referencing of an IRI• These IRIs can then be used also in other documents• For example in the document <http://ex.org/jones>:
<#denise> fam:child <#edwin>,<http://ex.org/smith#carol> .
• One can then extract the URL <http://ex.org/smith> from<http://ex.org/smith#carol> and find information about #carolfrom there
TU Dresden Foundations of Semantic Web Technologies slide 12 of 50
Connection between the IRI of a Thingand IRI of a Source
User Agent
Web Server
http://www.polleres.net/foaf.rdf#me
http://www.polleres.net/foaf.rdf
HTTPGET RDF
TU Dresden Foundations of Semantic Web Technologies slide 13 of 50
Connection between the IRI of a Thingand IRI of a Source
User Agent
Web Server
http://dbpedia.org/resource/Gordon_Brown
http://dbpedia.org/data/Gordon_Brown
http://dbpedia.org/page/Gordon_Brown
HTTPGET 303* HTTP
GET RDF
*HTTP Response Code 303: See OtherTU Dresden Foundations of Semantic Web Technologies slide 14 of 50
Representations• Information resources can have different representations• A representation is a stream of bytes in a certain format such as HTML,
RDF/XML or JPEG• Example: an invoice is an information resource that might be represented
as printable PDF or as RDF document• A single resource can have many different representations, e.g., in
different formats, resolutions or languages
TU Dresden Foundations of Semantic Web Technologies slide 15 of 50
HTTP Content Negotiation• Content Negotiation (CN, conneg) is the process of selecting the best
representation for a query if several representations are available
$ curl -I -H "Accept: application/rdf+xml"http://dbpedia.org/resource/Gordon_Brown
$ curl -I -H "Accept: text/html"http://dbpedia.org/resource/Gordon_Brown
curl – Tool to send requests to a server or receive responses-H Custom header to pass to server-I Show document info only
TU Dresden Foundations of Semantic Web Technologies slide 16 of 50
HTTP Content Negotiation
$ curl -I -H "Accept: text/html"http://dbpedia.org/resource/Gordon_Brown
HTTP/1.1 303 See OtherDate: Mon, 04 Feb 2013 10:37:10 GMTContent-Type: text/html; charset=UTF-8Content-Length: 0Connection: keep-aliveServer: Virtuoso/06.04.3132 (Linux) [...]Accept-Ranges: bytesLocation: http://dbpedia.org/page/Gordon_Brown
TU Dresden Foundations of Semantic Web Technologies slide 17 of 50
HTTP Content Negotiation
$ curl -I -H "Accept: application/rdf+xml"http://dbpedia.org/resource/Gordon_Brown
HTTP/1.1 303 See OtherDate: Mon, 04 Feb 2013 10:36:59 GMTContent-Type: application/rdf+xml; qs=0.95Content-Length: 0Connection: keep-aliveServer: Virtuoso/06.04.3132 (Linux) [...]Accept-Ranges: bytesTCN: choiceVary: negotiate,acceptContent-Location: /data/Gordon_Brown.xmlLink: <http://mementoarchive.lanl.gov/dbpedia[...]Location: http://dbpedia.org/data/Gordon_Brown.xmlTU Dresden Foundations of Semantic Web Technologies slide 18 of 50
Linked Data Applications: MinimalArchitecture
As of September 2011
MusicBrainz
(zitgist)
P20
Turismo de
Zaragoza
yovisto
Yahoo! Geo
Planet
YAGO
World Fact-book
El ViajeroTourism
WordNet (W3C)
WordNet (VUA)
VIVO UF
VIVO Indiana
VIVO Cornell
VIAF
URIBurner
Sussex Reading
Lists
Plymouth Reading
Lists
UniRef
UniProt
UMBEL
UK Post-codes
legislationdata.gov.uk
Uberblic
UB Mann-heim
TWC LOGD
Twarql
transportdata.gov.
uk
Traffic Scotland
theses.fr
Thesau-rus W
totl.net
Tele-graphis
TCMGeneDIT
TaxonConcept
Open Library (Talis)
tags2con delicious
t4gminfo
Swedish Open
Cultural Heritage
Surge Radio
Sudoc
STW
RAMEAU SH
statisticsdata.gov.
uk
St. Andrews Resource
Lists
ECS South-ampton EPrints
SSW Thesaur
us
SmartLink
Slideshare2RDF
semanticweb.org
SemanticTweet
Semantic XBRL
SWDog Food
Source Code Ecosystem Linked Data
US SEC (rdfabout)
Sears
Scotland Geo-
graphy
ScotlandPupils &Exams
Scholaro-meter
WordNet (RKB
Explorer)
Wiki
UN/LOCODE
Ulm
ECS (RKB
Explorer)
Roma
RISKS
RESEX
RAE2001
Pisa
OS
OAI
NSF
New-castle
LAASKISTI
JISC
IRIT
IEEE
IBM
Eurécom
ERA
ePrints dotAC
DEPLOY
DBLP (RKB
Explorer)
Crime Reports
UK
Course-ware
CORDIS (RKB
Explorer)CiteSeer
Budapest
ACM
riese
Revyu
researchdata.gov.
ukRen. Energy Genera-
tors
referencedata.gov.
uk
Recht-spraak.
nl
RDFohloh
Last.FM (rdfize)
RDF Book
Mashup
Rådata nå!
PSH
Product Types
Ontology
ProductDB
PBAC
Poké-pédia
patentsdata.go
v.uk
OxPoints
Ord-nance Survey
Openly Local
Open Library
OpenCyc
Open Corpo-rates
OpenCalais
OpenEI
Open Election
Data Project
OpenData
Thesau-rus
Ontos News Portal
OGOLOD
JanusAMP
Ocean Drilling Codices
New York
Times
NVD
ntnusc
NTU Resource
Lists
Norwe-gian
MeSH
NDL subjects
ndlna
myExperi-ment
Italian Museums
medu-cator
MARC Codes List
Man-chester Reading
Lists
Lotico
Weather Stations
London Gazette
LOIUS
Linked Open Colors
lobidResources
lobidOrgani-sations
LEM
LinkedMDB
LinkedLCCN
LinkedGeoData
LinkedCT
LinkedUser
FeedbackLOV
Linked Open
Numbers
LODE
Eurostat (OntologyCentral)
Linked EDGAR
(OntologyCentral)
Linked Crunch-
base
lingvoj
Lichfield Spen-ding
LIBRIS
Lexvo
LCSH
DBLP (L3S)
Linked Sensor Data (Kno.e.sis)
Klapp-stuhl-club
Good-win
Family
National Radio-activity
JP
Jamendo (DBtune)
Italian public
schools
ISTAT Immi-gration
iServe
IdRef Sudoc
NSZL Catalog
Hellenic PD
Hellenic FBD
PiedmontAccomo-dations
GovTrack
GovWILD
GoogleArt
wrapper
gnoss
GESIS
GeoWordNet
GeoSpecies
GeoNames
GeoLinkedData
GEMET
GTAA
STITCH
SIDER
Project Guten-berg
MediCare
Euro-stat
(FUB)
EURES
DrugBank
Disea-some
DBLP (FU
Berlin)
DailyMed
CORDIS(FUB)
Freebase
flickr wrappr
Fishes of Texas
Finnish Munici-palities
ChEMBL
FanHubz
EventMedia
EUTC Produc-
tions
Eurostat
Europeana
EUNIS
EU Insti-
tutions
ESD stan-dards
EARTh
Enipedia
Popula-tion (En-AKTing)
NHS(En-
AKTing) Mortality(En-
AKTing)
Energy (En-
AKTing)
Crime(En-
AKTing)
CO2 Emission
(En-AKTing)
EEA
SISVU
education.data.g
ov.uk
ECS South-ampton
ECCO-TCP
GND
Didactalia
DDC Deutsche Bio-
graphie
datadcs
MusicBrainz
(DBTune)
Magna-tune
John Peel
(DBTune)
Classical (DB
Tune)
AudioScrobbler (DBTune)
Last.FM artists
(DBTune)
DBTropes
Portu-guese
DBpedia
dbpedia lite
Greek DBpedia
DBpedia
data-open-ac-uk
SMCJournals
Pokedex
Airports
NASA (Data Incu-bator)
MusicBrainz(Data
Incubator)
Moseley Folk
Metoffice Weather Forecasts
Discogs (Data
Incubator)
Climbing
data.gov.uk intervals
Data Gov.ie
databnf.fr
Cornetto
reegle
Chronic-ling
America
Chem2Bio2RDF
Calames
businessdata.gov.
uk
Bricklink
Brazilian Poli-
ticians
BNB
UniSTS
UniPathway
UniParc
Taxonomy
UniProt(Bio2RDF)
SGD
Reactome
PubMedPub
Chem
PRO-SITE
ProDom
Pfam
PDB
OMIMMGI
KEGG Reaction
KEGG Pathway
KEGG Glycan
KEGG Enzyme
KEGG Drug
KEGG Com-pound
InterPro
HomoloGene
HGNC
Gene Ontology
GeneID
Affy-metrix
bible ontology
BibBase
FTS
BBC Wildlife Finder
BBC Program
mes BBC Music
Alpine Ski
Austria
LOCAH
Amster-dam
Museum
AGROVOC
AEMET
US Census (rdfabout)
Que
ry
Res
pons
e
TU Dresden Foundations of Semantic Web Technologies slide 19 of 50
Linked Data Summary
Semantic technologies simplify the access to data:• Facts regarding Berlin?
– http://de.dbpedia.org/resource/Berlin• Information about Queen
– BBC Music: http://www.bbc.co.uk/music/artists/0383dadf-2a4e-4d10-a46a-e9e041da8eb3
– MusicBrainz: http://musicbrainz.org/artist/0383dadf-2a4e-4d10-a46a-e9e041da8eb3.html
• Data integration gives additional benefits
TU Dresden Foundations of Semantic Web Technologies slide 20 of 50
Linked Data Tools• Tabulator Browser PlugIn/Ajax Scripts:
http://www.w3.org/2005/ajar/tab
• Semantic Web Client Library (Querying the complete Semantic Web withSPARQL): http://wifo5-03.informatik.uni-mannheim.de/bizer/ng4j/semwebclient/
• D2R Server: Accessing databases with SPARQL and as Linked Data:http://d2rq.org/d2r-server
• Data cleaning & linking to Freebase:https://github.com/OpenRefine (was Google Refine)
• RDF Export for Google Refine: http://lab.linkeddata.deri.ie/2010/grefine-rdf-extension/
TU Dresden Foundations of Semantic Web Technologies slide 21 of 50
Agenda
1 Linked (Open) Data
2 Semantic Web and HTMLRDFaMicroformatsGoogle Knowledge Graph
3 OWL ApplicationsOWL DL Application EDF EnergyOWL Profile Application BBC World CupSemantic Technologies in the Pharmaceutical Industry
4 Summary
TU Dresden Foundations of Semantic Web Technologies slide 22 of 50
RDFa Example• Integration of RDF in (X)HTML documents
All content on this site is licensed under <ahref="http://creativecommons.org/licenses/by/3.0/">a Creative Commons License</a>.
versusAll content on this site is licensed under<a rel="license"
href="http://creativecommons.org/licenses/by/3.0/">a Creative Commons License</a>.
<http://example.org/a.html><http://creativecommons.org/licenses/by/3.0/>
license
TU Dresden Foundations of Semantic Web Technologies slide 23 of 50
RDFa Example 2
<div><h2>The trouble with Bob</h2><h3>Alice</h3>...
</div>
versus
<div xmlns:dc="http://purl.org/dc/elements/1.1/"><h2 property="dc:title">The trouble with Bob</h2><h3 property="dc:creator">Alice</h3>...
</div>
TU Dresden Foundations of Semantic Web Technologies slide 24 of 50
RDFa Example 3<div>
<p>Alice Birpemswick</p><p>Email: <a href="mailto:[email protected]">
[email protected]</a></p><p>Phone: <a href="tel:+1-617-555-7332">
+1 617.555.7332</a></p></div>
versus
<div typeof="foaf:Person"xmlns:foaf="http://xmlns.com/foaf/0.1/">
<p property="foaf:name">Alice Birpemswick</p><p>Email: <a href="mailto:[email protected]"
rel="foaf:mbox">[email protected]</a></p><p>Phone: <a href="tel:+1-617-555-7332"
rel="foaf:phone">+1-617-555-7332</a></p></div>TU Dresden Foundations of Semantic Web Technologies slide 25 of 50
Applications of RDFa• Google filters, for example, RDFa terms and uses them to improve the
presentation of search results
TU Dresden Foundations of Semantic Web Technologies slide 26 of 50
Applications of RDFa• Google filters, for example, RDFa terms and uses them to improve the
presentation of search results
TU Dresden Foundations of Semantic Web Technologies slide 27 of 50
MicroformatsMicroformats are simple and open data formats based on existing standards(XHTML)
• Uses POSH (Plain Old Semantic HTML), i.e., HTML Tags that do notspecify the presentation (bold, i), but that have semantics (abbr,acronym, title, . . . )
• Use of semantic CSS class names– not: <span class="blueText">...</span>– but: <span class="submenu">...</span>
• Special vocabularies for the markup in some domains• Considers the roles and semantics of the elements
TU Dresden Foundations of Semantic Web Technologies slide 28 of 50
hRecipe – Microformat for Recipes• For the semantic annotation of web pages for recipes• Allows for
– searching for recipes with certain ingredients– automatic grouping of recipes– finding quick recipes (short preparation time)
• Mapping into RDFa exists (hrecipe-rdf)
Example<div class="hrecipe">
<h1 class="fn">French Fries</h1><p class="summary">French Fries ...</p><p>
Contributed by <span class="author">Tom</span>and the <span class="author vcard"><a class="url fn" href="...">Cooky Gang</a></span>.
</p>...
TU Dresden Foundations of Semantic Web Technologies slide 29 of 50
hRecipe – Microformat for Recipes
Example...<p>Published <span class="published">
<span class="value-title"title="2008-10-14T10:05:37-01:00"/>
14. Oct 2008</span></p><h2>Ingredients</h2><ul>
<li class="ingredient"><span class="value">500</span><span class="type">gr</span> potatoes.
</li>...
</ul>...
</div>
TU Dresden Foundations of Semantic Web Technologies slide 30 of 50
Example in the Web• The Recipe Depository http://www.therecipedepository.com/• SAPO Sabores http://sabores.sapo.pt/• ITV Food http://www.itv.com/food/
• Epicurious.com http://www.epicurious.com/
• http://foodnetwork.com/
• Plan to Eat http://www.plantoeat.com/recipe_book• essen & trinken http://www.essen-und-trinken.de/
hRecipe-conform meta data in RDF• ...
TU Dresden Foundations of Semantic Web Technologies slide 31 of 50
TU Dresden Foundations of Semantic Web Technologies slide 32 of 50
Micro Data and schema.org• Joint project of Google, Microsoft and Yahoo!• Compromise between the extensibility of RDFa and the simplicity of
microformats• Goal: better understanding of the contents of web pages and, as a result,
a better presentation of search results
TU Dresden Foundations of Semantic Web Technologies slide 33 of 50
Google Knowledge Graph• The US version of Google now also uses structured data (from Freebase)• For the disambiguation of search terms and direct presentation of
relevant information• Considers search terms no longer as simple strings (but as designator for
things)
TU Dresden Foundations of Semantic Web Technologies slide 34 of 50
TU Dresden Foundations of Semantic Web Technologies slide 35 of 50
TU Dresden Foundations of Semantic Web Technologies slide 36 of 50
TU Dresden Foundations of Semantic Web Technologies slide 37 of 50
Agenda
1 Linked (Open) Data
2 Semantic Web and HTMLRDFaMicroformatsGoogle Knowledge Graph
3 OWL ApplicationsOWL DL Application EDF EnergyOWL Profile Application BBC World CupSemantic Technologies in the Pharmaceutical Industry
4 Summary
TU Dresden Foundations of Semantic Web Technologies slide 38 of 50
Application Areas of OWL• OWL DL mainly is used outside the Semantic Web• Many applications in medicine and life sciences• Terminologies are traditionally popular there• Keyword indexing of documents• Semantic annotations of research data (e.g., gene sequences)• Classification used in health records and for statistics
TU Dresden Foundations of Semantic Web Technologies slide 39 of 50
Example Ontologies in OWL• OBO Foundry: The Open Biological and Biomedical Ontologies• BioPortal ontologies
– Terms for the electronic patient record– Annotation of gene sequences– Research into new drugs
• GO Gene Ontology• ICD International Classification of Diseases• FMA Formal Model of Anatomy• . . .
TU Dresden Foundations of Semantic Web Technologies slide 40 of 50
Use of OWL in the EDF EnergyManagement Advisor
Weather
Consumption
Building
. . .50 parameters
EMA
TipTip
Tip
TU Dresden Foundations of Semantic Web Technologies slide 41 of 50
EMA Ontology• Ontology models the domain and situation of customers• First modelling was strictly following existing binary decision diagrams
(simplified)• Reasoner “recognises” the situation of a customer• Certain situations correspond to tips• Original ontology used nominals and role chains• Was difficult to comprehend and reasoner performance was not optimal
TU Dresden Foundations of Semantic Web Technologies slide 42 of 50
EMA Ontology Improvements• Customers are directly modelled in an ABox• Per customer only simple ABox facts are loaded• Constructors that are problematic for reasoning are avoided: nominals,
role chains• TBox modelling was simplified• Ontology now allows incremental reasoning• Customers can be classified independent of each other in different
reasoner instances• Used for about 30.000 customers in France
TU Dresden Foundations of Semantic Web Technologies slide 43 of 50
BBC Website for the Football World Cup2010
• Ontology describes how facts about the world cup relate to each other• Such meta data are saved as RDF triples• For example, “Frank Lampard” is part of “England Squad” or “England
Squad” competed in “Group C” of the “FIFA World Cup 2010”
TU Dresden Foundations of Semantic Web Technologies slide 44 of 50
BBC Website for the Football World Cup2010
“The underlying publishing framework does not author contentdirectly; rather it publishes data about the content - metadata. Thepublished metadata describes the world cup content at a fairlylow-level of granularity, providing rich content relationships andsemantic navigation. By querying this published metadata we areable to create dynamic page aggregations for teams, groups andplayers."
Jem Rayfield, Senior Technical Architect, BBC News and Knowledgehttp://www.bbc.co.uk/blogs/bbcinternet/2010/07/bbc_world_cup_2010_dynamic_sem.html
TU Dresden Foundations of Semantic Web Technologies slide 45 of 50
BBC Website for the Football World Cup2010
• OWL inference used to enrich the data (forward chaining) and SPARQLused for queries
• Ontology contains texts contributed by journalists: stories, blogs, profiles,pictures, videos and statistics
• Journalistic contributions are automatically classified (NLP techniques)and manually tagged
• Statistics and game results from other sources are imported from XMLand mapped to ontological concepts
• Web pages are automatically assembled and contain relevant links• Approach also used for Olympia 2012
TU Dresden Foundations of Semantic Web Technologies slide 46 of 50
TU Dresden Foundations of Semantic Web Technologies slide 47 of 50
Semantic Web Technologies MeetPharmaceutical DataPhil Ashworth presents at the 2. European Semantic Technology Conference:http://videolectures.net/estc08_ashworth_swtpdi/
TU Dresden Foundations of Semantic Web Technologies slide 48 of 50
Agenda
1 Linked (Open) Data
2 Semantic Web and HTMLRDFaMicroformatsGoogle Knowledge Graph
3 OWL ApplicationsOWL DL Application EDF EnergyOWL Profile Application BBC World CupSemantic Technologies in the Pharmaceutical Industry
4 Summary
TU Dresden Foundations of Semantic Web Technologies slide 49 of 50
Summary• The amount of available machine processable data grows continuously• Semantics is needed to integrate data from different sources• Query and visualisation of data provides added value
• The processing and querying data from different sources increases thetransparency and facilitate research (tests of hypotheses becomes easier)
TU Dresden Foundations of Semantic Web Technologies slide 50 of 50