research graph: connecting identifiers across research data infrastructures

16
Research Graph: Connecting Identifiers across Research Data Infrastructures Dr. Amir Aryani Project Manager, Australian National Data Service (ANDS), http://ands.org.au at Australian National University (ANU), http://people.anu.edu.au/amir.aryani Co-chair of DDRI working group, Research Data Alliance, http://rd-alliance.org email: [email protected], twitter.com/amir_at_ands http://orcid.org/0000-0002-4259-9774 This presentation on "Creating a Distributed Graph using RD-Switchboard" by Dr. Amir Aryani is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License (http://creativecommons.org/licenses/by-sa/4.0/).

Upload: amiraryani

Post on 05-Apr-2017

28 views

Category:

Science


1 download

TRANSCRIPT

Research Graph: Connecting Identifiers across Research

Data InfrastructuresDr. Amir Aryani

Project Manager, Australian National Data Service (ANDS), http://ands.org.auat Australian National University (ANU), http://people.anu.edu.au/amir.aryaniCo-chair of DDRI working group, Research Data Alliance, http://rd-alliance.org

email: [email protected], twitter.com/amir_at_andshttp://orcid.org/0000-0002-4259-9774

This presentation on "Creating a Distributed Graph using RD-Switchboard" by Dr. Amir Aryani is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License (http://creativecommons.org/licenses/by-sa/4.0/).

Agenda

● Challenge of cross-platform discovery● Research Data Alliance working groups● Example of Research Graph connections and Neo4j queries● Impact of identifiers on connections

Challenge of cross-platform discovery

Question: What other researchers or research activities are connected to this work?”

● What other related datasets has been published by the authors?● What are the related publications to this dataset?● What is the research portfolio of the authors related to this work?● What research projects are related to this dataset?

researchgraph.org/schema/

Research Data Alliance Working Groups

Data Description Registry Interoperability (DDRI) WG

Goal: Enabling Cross-platform Discovery between research data infrastructures

https://www.rd-alliance.org/groups/data-description-registry-interoperability.html

Building Research Graph using Switchboard

RDA/WDS Scholarly Link Exchange (Scholix) WG

https://www.rd-alliance.org/groups/rdawds-scholarly-link-exchange-scholix-wg

Role of identifiers

● (D:Datacite:Dataset)--(P:CrossRef:Publication) ○ (doi) -- (doi)

● (D:Datacite:Dataset)--(P:CERN:Publication) ○ (doi) -- (doi)

● (D:Datacite:Dataset)--(R:CERN:Researcher) -- (P:CERN:Publication) ○ (doi) -- (inspireHepID) -- (doi)

● (D:DRYAD:Dataset)--(R:ORCID:Researcher)--(P:CERN:Publication) ○ (doi) -- (orcid) -- (doi)

● (g:ANDS:Grant)--(R:ANDS:Dataset)--(R:ORCID:Researcher) ○ (purl) -- (doi) -- (orcid)

Using Neo4j and RD-Switchboard

● match (n:dataset) where n.doi='10.4225/35/563159f223739' return n● match (n:dataset)--(r:researcher) where n.doi='10.4225/35/563159f223739'

return r.first_name, r.last_name

● match (r:orcid:researcher) where r.first_name='David' and r.last_name='Ellsworth' return r.orcid

Find all the publications by 3 degrees of separation

● match (n:dataset)-[*1..3]-(p:publication) where n.doi='10.4225/35/563159f223739' return n.title limit 50

Impact of identifiers

Impact of identifiers: DOI

Case A: match (d:dataset)--(m) where not exists(d.doi) return count(distinct(d))Case B: match (d:dataset)--(m) where exists(d.doi) return count(distinct(d))

Impact of identifiers: ORCID

Case A: match (r:researcher)--(m) where not exists(r.orcid) return count(distinct(r))Case B: match (r:researcher)--(m) where exists(r.orcid) return count(distinct(r))

Summary● President Identifiers has a crucial role in enabling connections in Research

Graph and enabling functionality of the Switchboard platform.● Research Graph distributed graph currently more than ten million nodes from

CERN, Dryad, ORCID, figshare, ANDS, NII Japan, and a number of other international partners. This data can be used for research in collaboration networks.

● The Research Graph is an open collaborative project. If you are interested to work with the project partners, please

○ Join DDRI Working Group: https://www.rd-alliance.org/groups/data-description-registry-interoperability.html

○ and contact [email protected]

● More information:○ http://researchgraph.org/○ https://github.com/researchgraph○ https://twitter.com/researchgraph