8th dbpedia meeting / california 2016
TRANSCRIPT
DBpedia state of affairs8th DBpedia Meeting, Oct 27th 2016, Sunnyvale CA
Dimitris Kontokostas, DBpedia Association & AKSW/KILThttp://dbpedia.org
Contents - Technical updates
- Overview & updates
- What we’re busy with
- Organizational updates
- DBpedia Association
- Support & funding
Contents - Technical updates
- Overview
- What we’re busy with
- Organizational updates
- DBpedia Association
- Support & funding
2016-04 Release (highlights)
- 9.5 billion facts
- 130 languages, Commons, Wikidata
- Citations & cited facts (exp.)
- Raw duplicate property cleanup
- Wikidata-based view
Data Quality- 38% improvement in schema conformant data (2015-04 -> 2016-04)
- Measured with RDFUnit
- Major data cleanup processes
- Mapping validation tool (RDFUnit & RML)
○ From 18h to 30s
○ Mapping refinements
○ Ontology cleanup
Data Access- Main SPARQL endpoint see here for details
- Now 11M Hits/Day (09/16) - Fun fact: Hoteli Maestral one of our very high traffic users
- DataID- DCAT / VOID ++ extension- Dynamic Download page- Statistics generation- Dockerization
NLP / NIF CorpusProvide NIF Corpus (text & links) with each release
- Start with abstracts plan for whole page
- Fact extraction challenges
NIF: NLP Interchange Format (RDF vocabulary)
Other things we are busy with- Shortening time-to-release (~5 months)
- Wikidata data transform & integration
- Global (inter-wikipedia) pagerank (to be announced soon)
- Github-based Backlinking (to be announced soon)
- Mapping alignment (UPM)
- Mapping discovery / inference & provenance
Integration of (other) sources (Immediate) Goals:
- Unify data from all Wikimedia projects ○ Start by basing on Wikidata IDs○ Mint our own IDs and go beyond Wikimedia
- Company dataset (in progress)- Media, publishing, law & legislation (in discussion)
Contents - Technical updates
- Overview & updates
- What we’re busy with
- Organizational updates
- DBpedia Association
- Support & funding
DBpedia Association (non-profit)- Founded in 2014
- Operational since January 2016
- Support from DBpedia founding members (Soeren, Chris & Kinsley)
- Draft charter online http://wiki.dbpedia.org/dbpedia-association
○ All data published by the DBpedia Association should be made available free of charge under
a license equivalent to CC-0 or CC-BY without further restriction on commercial use and
redistribution.
Membership type Joined Applying
Students 18 6
individual/Self-Employed 1 18
SME, Research Institute 1 2
Start-up/Small Research Group 4
Voluntary Payment Option for membership fee
- Applicants can apply for reduction- Board approval needed- Fees are reduced to 20€ per year- Member can pay full fee voluntarily
DBpedia Members
Association goals (for 2017)- Official language & topic chapters
- E.g. Dutch, German, Media, Law & Legislation, ...
- Better community synchronization
- Improve services and secure funding
Funding (Why) http://wiki.dbpedia.org/why-is-dbpedia-so-important
- Basic side-financing via H2020 and German national funding for
- Core staff
- Some event & PR
- However:
- no funding for hosting
- no sustainable funding (bound to projects)
- Services are stable, but lot’s of “what if’s”
We need organisations to join and provide support, links, backlinks & data.
Funding (What)- Increase quality of hosting and data
- Provide support for community issues
- Better publicity and exploitation
- Merge community contributions
- Systematically develop & improve DBpedia’s public data, software and
services
Funding Strategies4 main strategies developed during the DBpedia board discussion:
● Public fundraising (donation campaigns)● Direct fundraising (direct company contact)● Membership fees● Community/Project fundraising (i.e. H2020, COST, ITN)
Would love to hear your ideas to make DBpedia self-sustainable
Thank you for your attention!
Questions?
(Available until Tuesday to meet & discuss / ping me!!!)
More information: http://dbpedia.orgJoin as a member: http://dbpedia.org/membership
Slide contributions by Julia Holze & Sebastian Hellmann