Download - Exposing the data from NARCIS with VIVO
Exposing the data from NARCIS with VIVOChristophe Guéret (@cgueret)
Meertens Institute, January 18
Data Archiving and Networked Services
DANS is een instituut van KNAW en NWO
Data does not travel well...
● Publications from Frank van Harmelen● Decreasing number from system to system
148 38 13
Web of Science has 43 publications
and Google 283 !
Why is information lost ?
● Incentives○ Keeping one data source up to date is costly○ Keeping several is even more so!
● Standards ○ Information that can not be expressed is lost
● Confusion○ Re-invent the well & (partially) duplicate information
What do we want ?
● The dream: single, user-curated, consistent and up to date source that knows everything about someone
● Many aiming at being the one
OK then, let's do that !
● Need three ingredients:○ Web 2.0 platform○ Incentive○ Network effect
● Bonuses○ Generates a lot of data○ All the data is entered and store with the desired
schema
But...
● One-size-fits-all solutions are though to design. Users may be left unhappy.
● Incentives are hard: why would one update Academia.edu rather than ResearchGate ?
● There will always be several publishers of data and messy/incomplete/... data inputs
So let's try something else...
● Use the Web !
● Set up an ecosystem of○ Authorities: assign identifiers to things○ Vocabularies: define standards to express the data○ Publishers: expose the data they have○ Consumers: generate value out of the data
● Apply Linked Open Data principles to interconnect all the data
A research information ecosystem
Prototype worked on at DANS
● Use data harvested for NARCIS as CSV
● Use VIVO harvester to convert the CSV files and load the data into the VIVO portal
● So far○ running VIVO instance for the VU○ no link to other data sets○ no data consumer
Demo !
http://demo.datanetworkservice.nl:8080/vivo/