lod2 ckan ws vienna: poolparty für semantische suche und vokabular management für ckan, thomas...
DESCRIPTION
Folien vom LOD2 CKAN Workshop Vienna am 15.6. 2011 in Wien - Metadaten Management und Semantic Search in Open Data Katalogsystemen mittels PoolParty (http://www.poolparty.biz) von Thomas Schandl (Semantic Web Company - SWC). (License: CC-BY 3.0)TRANSCRIPT
PoolParty für
semantische Suche und
Vokabular Management für CKAN
Mag. Thomas Schandl
Semantic Web Company
Agenda
• Live Demo PoolParty Semantic Search
• Szenarios für semantische Suche
• Die Rolle von Thesauri bei semantischer Suche
• PoolParty Demo am Beispiel OpenData und
CKAN
2
OGD/CKAN Herausforderungen
• Wo suchen? Verteilte nationale und internationale
Datenbestände
• Welche Suchbegriffe verwenden?Uneinheitliche Metadaten und Verschlagwortung,
verschiedene Sprachen und Begrifflichkeiten
• Verschiedene andere Katalogsysteme
© Semantic Web Company – http://www.semantic-web.at/ 3
4
“In the Semantic Web, it
is not the Semantic
which is new, it is the
Web which is new”
Dr. Chris Welty, IBM
Watson Research Center
Some thoughts on the Semantic Web
5
Some thoughts on the Semantic Web
Prof. Jim Hendler
Rensselaer Polytechnic Institute
“A little Semantics
Goes a Long Way”
PoolParty Überblick
6
• Hauptanwendungsgebiete:– SKOS Thesaurus Management
– Linked Data (publishing & consuming)
– Semantic Search & Semantic Indexing
• Verbindung CKAN und PoolParty in LOD 2 Projekt
7
Semantic Search Demo http://bit.ly/semantic_search
Semantic search has many faces
htt
p:/
/ww
w.f
lickr.co
m/p
hoto
s/t
echb
urs
t/2
79642
1248
/
8
Weitere Semantic Search Szenarios
Semantic search has many faces
htt
p:/
/ww
w.f
lickr.co
m/p
hoto
s/t
echb
urs
t/2
79642
1248
/
Situations in which semantic search
can help
9
I can´t
remember how
to spell the
search term
I can´t
remember
exactly what I
was looking for
I want to gain
background
knowledge to a
certain document
I want to know
more about this
entity in a
certain context.
I want to see facts
from different
sources describing
this entity.
I want to search
in different
languages
simultaneously
I forgot some of
the names for
the entity I´m
looking for
I want the
software to
understand what I
mean by „Jaguar“
Find information faster – Auto-
Complete
10
I can´t
remember how
to spell the
search term
To provide powerful auto-complete also for enterprise search
scenarios you need to establish an enterprise vocabulary.
Reveal hidden information – Status
quo
11
SNCR Search
SNCR OR „Selective non- Search
I forgot some of
the names for
the entity I´m
looking for
Reveal hidden information with
query expansion
12
SNCR SearchOR "selective non catalytic reduction"
SNCR
selective non
catalytic reduction
alternative Label
preferred Label
Multi-lingual search based on a
thesaurus
13
clean energy SearchOR energía limpia
clean energy
energía limpia
preferredLabel @es
preferred Label @en
I want to search
in different
languages
simultaneously
Reveal hidden information and
relations
14
Find documents
or images related
to any other text.
http://poolparty.punkt.at/demozone
I want to gain
background
knowledge to a
certain document
Find more specific information with
faceted search
15
facets support
structured queries
facets help
to drill down
search results,
adapt dynamically
Zero-result queries
won´t happen
anymore
Complex queries with faceted
search over linked data
16
„Show me all airlines
whose
parent company is
Lufthansa“
http://dbpedia.neofonie.de/
My Energy-Dossier about
Find linked information – Status quo
17
I want to see facts
from different
sources describing
this entity.
The user has to put
together manually
energy-related
information about
a country.
360O views: Find linked information
18
Energy-related
information about countries
are „mashed“ automatically
by using „linked data“
http://www.reegle.info/countries
19
The role of thesauri in semantic search
htt
p:/
/ww
w.f
lickr.co
m/p
hoto
s/t
echb
urs
t/2
79642
1248
/
© Semantic Web Company – http://www.semantic-web.at/ 20
SKOS – Open Standard for Thesauri
• SKOS = Simple Knowledge Organisation System(s)
• Goal …– Simple, flexible, extensible, machine-
understandable representation for…• Thesauri
• Classification Schemes
• Taxonomies
• Subject Headings
• Other types of ‘controlled vocabulary’…
The role of thesauri in semantic
search
21
The role of thesauri in semantic
search (contd.)
22
Thesaurus as the central point
to control:
• labels & query expansion
• facets
• refine search mechanisms
• metadata integration
Content annotation:
Traditional approach
http://www.punkt.at/ 23
Apple is in the
process of launching
an application to
allow iPhone, iPad
and iPod Touch users
to purchase Apple
merchandise straight
from their devices.
Apple
application
merchandise
iPod touch
iPadiPhone
Semantic Web approach:
Concepts, NOT simply text
http://www.punkt.at/ 24
Apple is in the
process of launching
an application to
allow iPhone, iPad
and iPod Touch users
to purchase Apple
merchandise straight
from their devices.
http://my.com/Apple
Apple
Apple Inc.
http://my.com/iPhone
http://my.com/iPhone3G
iPhone
iPhone 3GS
iPhone 3G
http://my.com/smartphone
PoolParty Tag Suggestions
http://www.punkt.at/ 25
• Support of different
formats (html, doc,
pdf, ppt, …)
• Thesaurus based
extraction
• Integrable with
CMS, CRM etc.
Zusammenspiel CKAN und PoolParty
© Semantic Web Company – http://www.semantic-web.at/ 26
CKAN UK
CKAN Norway
CKAN Austria
CKAN Netherlands
Service für Tagvorschläge
aus Thesaurus
Andere Datenquellen
Zusammenspiel CKAN und PoolParty
© Semantic Web Company – http://www.semantic-web.at/ 27
CKAN UK
CKAN Norway
CKAN Austria
CKAN Netherlands
Indizierung der Metadaten
Andere Datenquellen
PoolParty System Architecture
28
Search Services
Search Application
Collector
CKAN
Austria
Semantic
Indexer Index
RDF
Cartridge
CKAN
UK
Verbindung CKAN und PoolParty in
LOD 2 Projekt
• Wo suchen?Zentrale Suche über verteilte Systeme
• Welche Suchbegriffe? Harmonisierte Metadaten durch mehrsprachige,
semantische Tags
• Weitere Features– Kategorisierung
– Autocomplete
– Recommender für ähnliche Datenquellen
© Semantic Web Company – http://www.semantic-web.at/ 29
http://www.punkt.at/ 30
PoolParty Demo
• HP: http://poolparty.punkt.at/PoolParty/
• Doku: https://grips.punkt.at/display/POOLDOKU/
Latest Update:
Version 2.9.2, May 2011
© Semantic Web Company – http://www.semantic-web.at/ 31
Danke für Ihre Aufmerksamkeit!
Mag. Thomas Schandl
Semantic Web Company
GmbH
Lerchenfelder Gürtel 43
A-1160 Wien
Tel. +43 1 402 12 35