the drawbridge to knowledge - linking scholarly publications and research information in primo

Post on 12-Sep-2014

1.470 Views

Category:

Education

1 Downloads

Preview:

Click to see full reader

DESCRIPTION

Presentation at IGeLU 2013 together with Dominique Ritze, Mannheim University.

TRANSCRIPT

Dominique Ritze - University of Mannheim Lukas Koster - University of Amsterdam

The drawbridge to knowledge

IGeLU 2013 Berlin

Linking scholarly publications and research information in Primo

Presenter
Presentation Notes
LK

Access to scholarly research can be magnificent

http://www.flickr.com/photos/ericarnau/6248082438

Presenter
Presentation Notes
LK

Now and then you have to wait a bit for access

http://www.flickr.com/photos/22912005@N06/5042782586

Presenter
Presentation Notes
DR

Sometimes there is just no way to get there

http://www.flickr.com/photos/moosterbroek/6111989517

Presenter
Presentation Notes
LK

Importance of connecting scholarly publications, research information and data

http://www.flickr.com/photos/johanwieland/4142657901

Presenter
Presentation Notes
DR

Traceability

Presenter
Presentation Notes
DR

Traceability

Presenter
Presentation Notes
DR Publications based on research might contain information about background, proceedings, methods, data, etc. -> traceability?

Reproducibility

Presenter
Presentation Notes
DR

Comparability

Presenter
Presentation Notes
DR

Historical background Academic libraries support teaching and research by providing access to 'scholarly information'

http://www.flickr.com/photos/uon/4667927963

http://www.flickr.com/photos/marsdd/2986989396/

Presenter
Presentation Notes
LK Traditionally, support for both domains coincided: providing access to scholarly information, in the form of publications (books, articles, etc.) Either: of interest for teaching and/or researchers Or: results of research

Digital era

1 - Back end/administrative

ILS Acquisitions Cataloguing Circulation

CRIS Projects Funding

2 - Front end/end user 3 - Research process

OPAC Discovery

Experiments Collaboration Writing Data

Presenter
Presentation Notes
LK Digital era (LK+DR) (explanation of systems current situation) First: systems for administrative (back end) tasks Libraries: cataloguing, acquisition, circulation Research: CRIS (projects, funding, etc.) Second: end user (front end) tools, web Libraries: discovery, delivery Third: Research process itself (data, processing, collaboration, publishing)

Current research information situation

Isolated silos

ILS Discovery CRIS RDM VRE

books journals

books articles

books articles project funding people organisations

data books articles project data people

Library systems focus Research information focus Library +

Presenter
Presentation Notes
LK Library systems still focused on workflow around traditional publications New research support tasks in separate systems, if at all. Isolated silos ILS+ (publications, journals, databases) (Ex Libris: Aleph, Voyager, Alma) Discovery (publications) (Ex Libris: Primo) CRIS (Current Research Information System) (Ex Libris: x) RDM (Research Data Management) (Ex Libris: Rosetta...) VRE (Virtual Research Environment) (Ex Libris: x)

Solutions

Integrated back office data and services infrastructure Long term?

?

books articles project funding data people organisations

Presenter
Presentation Notes
LK Back office infrastructure + services Long term, if at all.

Solutions

Front end services Short term

ILS Discovery CRIS RDM VRE

books journals

books articles

books articles project funding people organisations

data books articles project data people

Presenter
Presentation Notes
DR Short term: use existing systems+databases, integrate into existing or new services

Primo: discovery of ‘all’ ? scholarly information

Presenter
Presentation Notes
DR

Primo: discovery of ‘all’ ! scholarly information http://www.researchobject.org

Presenter
Presentation Notes
DR

Primo: current situation

Bibliographic metadata formats usually don’t have fields for research information In Primo: Add-ons needed, two approaches: • Harvesting extra author and publication identifiers • Text mining + semantic analysis

http://commons.wikimedia.org/wiki/File:Bundesarchiv_Bild_183-17031-0004,_Bergarbeiter_bohrend.jpg

http

://w

ww

.flic

kr.c

om/p

hoto

s/20

9602

56@

N04

/477

5687

776

Presenter
Presentation Notes
DR No research information metadata, both on datasource and discovery tool sides!

University of Amsterdam: PrimoResearchLinks

Primo Central

Primo Local

eJournal

NARCIS

CRIS Repository ALEPH

Background - metadata infrastructure

Website Personal pages

Presenter
Presentation Notes
LK NARCIS is Dutch national research portal, harvests all university institutional repositories.

University of Amsterdam: PrimoResearchLinks

Primo Local

NARCIS

CRIS Repository

Background - metadata infrastructure

Presenter
Presentation Notes
Best: link back from Primo Local Repository records to local CRIS. But CRIS not accessible for end users/etc. So will use links to Dutch National Research portal NARCIS

University of Amsterdam: PrimoResearchLinks

Primo Local

NARCIS

Repository

Background - metadata infrastructure

University of Amsterdam: PrimoResearchLinks http://dare.uva.nl

Presenter
Presentation Notes
Institutional Repository

University of Amsterdam: PrimoResearchLinks

Dublin Core

Record URL

PDF URL

Author names

Presenter
Presentation Notes
Institutional Repository

University of Amsterdam: PrimoResearchLinks

Publication ID

Internal Author ID

DAI Author ID

MODS/DIDL

Presenter
Presentation Notes
MODS/DIDL OAI harvest

University of Amsterdam: PrimoResearchLinks

PNX

NARCIS_link : http://www.narcis.nl/publication/RecordID/{{addata/lad02}} http://www.narcis.nl/publication/RecordID/oai:uva.nl:555

NARCIS_author : http://www.narcis.nl/person/info:eu-repo/dai/nl/{{addata/lad04}} http://www.narcis.nl/person/info:eu-repo/dai/nl/070563543

Mapping Table Delivery/Templates

Presenter
Presentation Notes
Primo configuration: PNX (via Normalisation Rules) + Mapping Table Advantages: when changing base-urls, display labels, only need to change mapping table, no Update process

University of Amsterdam: PrimoResearchLinks

Presenter
Presentation Notes
UvA Primo Test server with added links for institutional repository

University of Amsterdam: PrimoResearchLinks

http://www.narcis.nl/publication/RecordID/oai:uva.nl:555

Publication: Plain bibliographic information Identical to original repository Identical to Primo

Presenter
Presentation Notes
NARCIS Publication record, exactly the same info as Repository, Primo local. Not interesting

University of Amsterdam: PrimoResearchLinks

http://www.narcis.nl/person/info:eu-repo/dai/nl/070563543

Author: Lots of extra information Expertise Connections Projects Publications

Presenter
Presentation Notes
NARCIS Author record: extra information about expertise, projects, publications.

University of Amsterdam: PrimoResearchLinks

http://narcis.nl

Presenter
Presentation Notes
Pilot project Enhance Publications, small subset: linking research information together (RDF) + user interface

University of Amsterdam: PrimoResearchLinks

Link to Enhanced Publication RDF

Presenter
Presentation Notes
Very small number of Amsterdam repository records in Enhanced Publication pilot subset. When Enhanced Publication, then also RDF

University of Amsterdam: PrimoResearchLinks

http://www.narcis.nl/vpub/RecordID/ReM:oai:uva.nl:314296

Presenter
Presentation Notes
An Enhanced Publication

University of Amsterdam: PrimoResearchLinks <escape-agents:AcademicStaff rdf:about="http://www.narcis.nl/person/info:eu-repo/dai/nl/074673742"> <foaf:familyName>Stokhof</foaf:familyName> <escape-project:worksOn rdf:resource="http://www.narcis.nl/search/coll/person/dd_cat/D36000"/> <escape-project:worksOn rdf:resource="http://www.narcis.nl/search/coll/person/dd_cat/D32000"/> <foaf:img>http://www.narcis.nl/res/images/foto/PRS1237803.jpg</foaf:img> <dcterms:title>M.J.B. Stokhof</dcterms:title> <foaf:pastProject rdf:resource="http://www.narcis.nl/research/RecordID/OND1332663"/> <foaf:name>M.J.B.</foaf:name> <dai:daiId>info:eu-repo/dai/nl/074673742</dai:daiId> </escape-agents:AcademicStaff> <escape-project:Topic rdf:about="http://www.narcis.nl/search/coll/person/dd_cat/D36000"> <escape-project:isWorkedOnBy rdf:resource="http://www.narcis.nl/person/info:eu-repo/dai/nl/074673742"/> <dcterms:title>Language and literature studies</dcterms:title> </escape-project:Topic> <escape-project:Topic rdf:about="http://www.narcis.nl/search/coll/person/dd_cat/D32000"> <escape-project:isWorkedOnBy rdf:resource="http://www.narcis.nl/person/info:eu-repo/dai/nl/074673742"/> <dcterms:title>Philosophy</dcterms:title> </escape-project:Topic> <escape-agents:PhDCandidate rdf:about="http://www.narcis.nl/person/info:eu-repo/dai/nl/304351571"> <foaf:familyName>Bax</foaf:familyName> <foaf:img>http://www.narcis.nl/res/images/foto/PRS1317037.jpg</foaf:img> <dcterms:title>C. Bax</dcterms:title> <foaf:pastProject rdf:resource="http://www.narcis.nl/research/RecordID/OND1332663"/> <foaf:name>C.</foaf:name> <dai:daiId>info:eu-repo/dai/nl/304351571</dai:daiId> </escape-agents:PhDCandidate> <foaf:Project rdf:about="http://www.narcis.nl/research/RecordID/OND1332663"> <escape-project:endDate>09 / 2009</escape-project:endDate> <dcterms:title>Subjectivity after Wittgenstein. WittgensteinÂ’s embodied and embedded subject and the debate about the death of man</dcterms:title> <escape-project:startDate>01 / 2008</escape-project:startDate>

RDF

http://www.narcis.nl/unapi?id=ReM:oai:uva.nl:314296&format=rdf

Presenter
Presentation Notes
The RDF

University of Amsterdam: PrimoResearchLinks

Presenter
Presentation Notes
For now, only the link to Author in NARCIS will give extra information

University of Amsterdam: PrimoResearchLinks

Extra Tab Resolved RDF

Research information

Presenter
Presentation Notes
Better would be: Extra Research Information tab

University of Amsterdam: Vivo Pilot http://vivoweb.org/

Presenter
Presentation Notes
Vivo: Open Source Linked Data tool for connecting research information from different sources.

Mannheim University Library: InFoLiS

Primo Local

da|ra linked

integrated

Goal

Presenter
Presentation Notes
DR

Mannheim University Library: InFoLiS

As shown in the Eurobarometer1

1 And additional sources or the and

Table 1: population forecast for Germany (database: version 5)

Mannheim University Library: InFoLiS

Therefore, the data of from 1990 to 2003 are considered.

Figure 1: Random sample from 2004 to 2010 (source: 2005a)

References to research data

Mannheim University Library: InFoLiS

Extraction algorithm

Mannheim University Library: InFoLiS

primo enrichments

Mannheim University Library: InFoLiS

Mannheim University Library: InFoLiS

Link File

Mannheim University Library: InFoLiS

Publication ID

link to research data

Mannheim University Library: InFoLiS

linked research data

Mannheim University Library: InFoLiS

Mannheim University Library: InFoLiS

Mannheim University Library: InFoLiS

standardized interface, e.g. OAI-PMH

convert data from database dump

Mannheim University Library: InFoLiS

Experiences Mannheim

● Metadata of research data and publications rarely contain links

● Many different reference styles ● References to research data mostly incomplete ● Integration can be very hard, e.g. adapting the

metadata format and performing transformations ● First evaluation result?

Presenter
Presentation Notes
DR

Experiences Amsterdam

● Local Amsterdam repository and CRIS not optimal for reuse and linking (yet)

● No references to research datasets, projects in local, national and global systems (yet)

● Research project information not available ● Library systems in general not equipped for

○ non-bibliographic information ○ processing linked data, RDF

● Primo CAN be used for this, with lots of extra work

Presenter
Presentation Notes
LK

Pros and Cons (Mannheim)

Pro ● Links can be easily

integrated, e.g. by using Enrichments

● Integration of

research data to have only one system

Con ● Fulltexts are needed to identify references ● Integration very costly

without standardized interface

Presenter
Presentation Notes
DR

Pros and Cons (Amsterdam)

Pro ● Identifiers are ready to

use ● Links can be easily

integrated ● Extra information can be

embedded in extra tab in UI, but with workarounds

Con ● Identifiers, RDF not

always available in data sources

● PNX format not suited

for research information

● Integration very costly

without standardized interface

Presenter
Presentation Notes
LK

Recommendations Ex Libris

Primo • Include non-bibliographic metadata in PNX

o (links section, additional data section, new research section?)

• Harvest RDF • Resolve RDF/URIs

o During normalisation (in PNX) o On the fly (in search result)

• Easily create extra tab with PNX data • Publish Primo Central index as linked open data • Primo Central: harvest research information

Presenter
Presentation Notes
LK+DR

Recommendations Publishers, Aggregators

• Add additional references • Add additional metadata

o Publication ID o Author ID o Research project ID o Dataset ID

• Publish as Linked Open Data

Recommendations Researchers, Authors

• Proper citation of research data, e.g. by using unique identifiers like DOIs

• Publish and register research data as open as possible

● Many interesting ideas/projects:

○ OpenScience ○ DataCite ○ OpenAIRE ○ DCC ○ RD-A ○ W3C-ROSC

Like a bridge over troubled water

When times get rough When you’re down and out

I will ease your mind

top related