dbpedia-past-present-future
TRANSCRIPT
past, present & future
2007
2007
2014
???
Get me all soccer players, who played as goalkeeper for a club that has a stadium with more than 40.000 seats and who are born in a country with more than 10 million inhabitants
Structure in Wikipedia
bnjmbn
...
Infoboxes
???
How it all started
- 2006 - Sören Auer (busy with his PhD) asking people: “Wikipedia fact tables look like triples, don’t you want to write some extractor?”
- 6 months later: Sören wrote the extractor himself and asked Jens Lehmann to help with writing a paper
- Chris Bizer : “We are extracting people and place information from Wikipedia too – lets join efforts and call it DBpedia.”
- Kingsley Idehen: “I need a showcase for my Virtuoso triple store.”
Infobox Extraction
Wikitext
RDF
Taking a closer look
at heterogeneity…
- DBpedia Mappings wiki
Milestones
- 2008: DBpedia Live- 2009: Scala-Based framework- 2009: Mappings wiki- 2011: Internationalization- 2011: DBpedia Spotlight- 2014: DBpedia Association
Now
DBpedia 2014 (English):4.58 mio. entities and 583 mio. triples
131,2 mio. fact assertions (derived from infoboxes)
168,5 mio. triples representing Wikipedia structure
57,1 mio. links to external datasets
Localized DBpedia version for 125 languages, built from corresponding Wikipedia versions
12 DBpedia language chapters
Now
X
DBpedia has to evolve
- Fusion- Validation- NLP- Enterprize
Fusion
Validation
NLP
- Exploit the text…- Let different NLP tools & approaches
compete for the best quality (in a certain language)
- Need to define the interface (help needed)
Every Enterprise needs its DBpedia
- Represent common sense knowledge (DBpedia and other LOD datasets) as well as the specific enterprise knowledge
- Crystallization points for Linked Data intranets – an addition to SOA facilitating enterprise-wide data linking & integration
- Slicing & Dicing
Other ideas?
EU Projects
http://aligned-project.eu
http://smartdataweb.de/ http://www.freme-project.eu/
http://stack.lod2.eu/
http://geoknow.eu/
The soccer players (for the curious)
Thank you
Big thanks to Sören Auer & Markus Ackermann for slide contributions
Big thanks to Sören Auer & Markus Ackermann for slide contributions