from taxonomies to linked data

37
Introduction to the Semantic Web From Taxonomies to Linked Open Data Andreas Blumauer Semantic Web Company www.semantic-web.at www.poolparty.biz

Upload: semantic-web-company

Post on 27-Jan-2015

107 views

Category:

Technology


3 download

DESCRIPTION

Taxonomies are well known means to create knowledge models. With the rise of Linked data, more expressive ways to create knowledge graphs can be introduced on a cost-efficient basis. Thesauri or ontologies help to create applications like Dynamic Semantic Publishing (DSP) or knowledge bases in enterprises.

TRANSCRIPT

Page 1: From Taxonomies to Linked Data

Introduction to the Semantic WebFrom Taxonomies to Linked Open Data

Andreas Blumauer

Semantic Web Companywww.semantic-web.at

www.poolparty.biz

Page 2: From Taxonomies to Linked Data

Welcome!

• CEO and managing partner of Semantic Web Company

• Master's degree in business informatics (University of Vienna)

• Experienced consultant in the areas of knowledge management systems, linked data, data analytics and semantic technologies

• Product management of PoolParty Semantic Suite

• Lecturer at several Universities of Applied Sciences and at Danube University Krems

• Editor of one the first comprehensive books in the area of the semantic web for German speaking community

Semantic Web Knowledge

Management Text Analytics Data Analytics Recommender Systems Semantic Search Taxonomies

Thesaurus Management

SKOS Enterprise Data Integration

Linked Data Social Tagging Business Intelligence Confluence Semantic Sharepoint

Page 3: From Taxonomies to Linked Data

What is the Semantic Web? What is Linked Data?

Which knowledge models are used on the Semantic Web?

Linked Data based Text MiningPoolParty Semantic Suite

Examples & Demos

Agenda

Page 4: From Taxonomies to Linked Data

Uniform Resource Identifiers

1. Give things / objects / resources a URI, not only webpages!

http://www.bbc.co.uk/nature/species/Red_Squirrel

Page 5: From Taxonomies to Linked Data

Semantic relations (‘triples’)

Sciurus vulgaris

Red squirrel

http://www.bbc.co.uk/nature/life/Sciuridae

2. Assign names to the resources and link resources to others in a meaningful way!

http://www.bbc.co.uk/nature/habitats/Taiga

is a

lives in has label

other label

Page 6: From Taxonomies to Linked Data

Establish links across data silos

3. Reuse existing information from all over the places, internally and externally!

Page 7: From Taxonomies to Linked Data

Resource Description Framework (RDF)

4. Use Semantic Web Standards to describe your resources in a machine-readable format!

Page 8: From Taxonomies to Linked Data

Mashups: Semantic Dynamic Publishing

5. Reuse assets and save production costs while providing highly dynamic content

Page 9: From Taxonomies to Linked Data

Semantic ‘Search’

6. Understand the user’s query on a conceptual level!

sciurus vulgaris search

‘sciurus vulgaris’ OR ‘red suirrel’

Show stories from the East of England dealing with mammals which also live in the Taiga

go

Page 10: From Taxonomies to Linked Data

Content recommendation

7. Provide users with recommendations from a bird’s eye perspective!

Animals with same• behaviour• habitats• family etc.

‘Similar’ news containing• related places• related animals etc.

• related videos (YouTube)• related images (Flickr)

Page 11: From Taxonomies to Linked Data

So, what exactly is ‘Linked Data’?

• ‘Linked Data’ is based on the Semantic Web standards by the World Wide Web Consortium (W3C)

• Linked Data is a way to link & integrate data based on web technologies

• Linked Data IS NOT Open Data automatically

• Linked Data can be used on the web AND on the intranet

• The Linked Open Data Cloud (LOD) consists of over 62 bio. Facts/triples (http://stats.lod2.eu/)

• Linked Data is an emerging technology for enterprise information integration

Data is no longer locked up in databases and its non-intuitive schemas.

To access linked data, simply use the web as your platform.

Graph-based models and SPARQL are in many cases much more powerful than the relational models and SQL.

Page 12: From Taxonomies to Linked Data

What is the Semantic Web? What is Linked Data?

Which knowledge models are used on the Semantic Web?Linked Data based Text Mining

PoolParty Semantic SuiteExamples & Demos

Agenda

Page 13: From Taxonomies to Linked Data

Taxonomies

Bushy-tailed squirrels

Squirrels

Rodents

Mammals

is a

is a

is a

is a

http://sws.geonames.org/2641455 (Norfolk)

England

United Kingdom

Europe

Earth

part of

part of

part of

part of

http://www.bbc.co.uk/nature/life/Red_Squirrel

Page 14: From Taxonomies to Linked Data

Thesaurus & light-weight ontology

foaf:logo

foaf:homepage

http://www.poolparty.biz/portfolio-item/poolparty-extractor/

Page 15: From Taxonomies to Linked Data

Heavy-weight ontology

Page 16: From Taxonomies to Linked Data

Querying graphs with SPARQL

RDF Store

SPARQL - endpoint

http-clientPREFIX p: <http://dbpedia.org/property/>SELECT ?film ?titleWHERE { ?film p:starring <http://dbpedia.org/resource/Melanie_Griffith> . ?film p:starring <http://dbpedia.org/resource/Bruce_Willis> . ?film <http://dbpedia.org/property/title> ?title .}

http://dbpedia.org/resource/The_Bonfire_of_the_Vanities_(film)

Page 17: From Taxonomies to Linked Data

Some advanced features of SPARQL

Shortest path between two nodes in a graph

Federated queries

Transitivities

Can also be used to query relational databases See: RDB to RDF Mapping Language

Page 18: From Taxonomies to Linked Data

What is the Semantic Web? What is Linked Data?

Which knowledge models are used on the Semantic Web?

Linked Data based Text MiningPoolParty Semantic Suite

Examples & Demos

Agenda

Page 19: From Taxonomies to Linked Data

Entity extraction based on Linked Data

http://www.bbc.co.uk/nature/life/Red_Squirrel

Bushy-tailed squirrels

Squirrels

is a

is a

http://sws.geonames.org/2641455

England

United Kingdom

part of

part of

- population: 832500- long: 1- lat: 52.66667

Page 21: From Taxonomies to Linked Data

What is the Semantic Web? What is Linked Data?

Which knowledge models are used on the Semantic Web?

Linked Data based Text MiningPoolParty Semantic Suite

Examples & Demos

Agenda

Page 22: From Taxonomies to Linked Data

PoolParty Semantic Suite

Page 23: From Taxonomies to Linked Data

PoolParty Semantic Search Server

Page 25: From Taxonomies to Linked Data

Thesaurus Server & Extractor

1. Create Thesaurus

2. Get documents annotated

3. Receive RDF for further processing

Page 26: From Taxonomies to Linked Data

PoolParty API

All RESTful services based on standards: Future-proof application development

Thesaurus as a Service: Query thesauri & embed them into your apps

Query thesauri + annotated documents: Semantic Search

Text Mining as a Service: All documents converted in RDF based on controlled vocabularies

SPARQL-endpoints for all knowledge graphs

Page 27: From Taxonomies to Linked Data

What is the Semantic Web? What is Linked Data?

Which knowledge models are used on the Semantic Web?

Linked Data based Text MiningPoolParty Semantic Suite

Examples & Demos

Agenda

Page 28: From Taxonomies to Linked Data

Example: Country Mashups

http://reegle.info/countries

Page 29: From Taxonomies to Linked Data

reegle API:3 levels of services – in 5 languages

Page 30: From Taxonomies to Linked Data

Tagging API Example

http://api.reegle.info

blog.okfn.org/2013/04/08/sustainable-energy-policy-demands-sustainable-open-data/

Page 31: From Taxonomies to Linked Data

Knowledge graph visualization

http://www.gbpn.org/databases-tools/glossary

Page 32: From Taxonomies to Linked Data

News Aggregator

http://www.gbpn.org/newsroom/news-aggregator

Page 33: From Taxonomies to Linked Data

Wolters Kluwer’s LOD strategy

http://vocabulary.wolterskluwer.de/

Page 34: From Taxonomies to Linked Data

Increased Search experiences

http://www.wienwin.at/

Page 35: From Taxonomies to Linked Data

Classification based on Wikipedia

Nanotechnology researchers use graphene in nanoelectronic chip breakthrough(Nanowerk News) Electronic components built from single molecules using chemical synthesis could pave the way for smaller, faster and more green and sustainable electronic devices. Now for the first time, a transistor made from just one molecular monolayer has been made to work where it really counts. On a computer chip.The molecular integrated circuit was created by a group of chemists and physicists from the Department of Chemistry Nano-Science Center at the University of Copenhagen and Chinese Academy of Sciences, Beijing. Their discovery has just been published online in Advanced Materials ("Ultrathin Reduced Graphene Oxide Films as Transparent Top-Contacts for Light Switchable Solid-State Molecular Junctions"). The breakthrough was made possible through an innovative use of the two dimensional carbon material graphene.Kasper Nørgaard is an associate professor in chemistry at the University of Copenhagen. He believes that the first advantage of the newly developed graphene chip will be to ease the testing of coming molecular electronic components. But he is also confident, that it represents a first step towards proper integrated molecular circuits.

Read more: http://www.nanowerk.com/news2/newsid=30975.php#ixzz2WqZ2KyHW

Page 36: From Taxonomies to Linked Data

PoolParty Enterprise Solutions

- Automatic content classification based on linked open data sources like DBpedia and Geonames

- Targeted content distribution inside the intranet support knowledge workers; provide semantic search

- Enterprise vocabulary managementunderstand the context, enrich content with definitions etc.

- Semantic knowledge basesUse PoolParty as a linked data management system and provide knowledge graphs in enterprise systems like Sharepoint

Several integrations with Sharepoint, Confluence, Drupal, Oracle Text, andWordPress were implemented.