rdm@unsw › document › rdm at unsw town hall.p… · high performance computing (hpc) •free...

31
RDM@UNSW RDM@UNSW Townhall 30 April 2020 Research Data Management

Upload: others

Post on 29-May-2020

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: RDM@UNSW › document › RDM at UNSW Town Hall.p… · High Performance Computing (HPC) •Free for researchers and HDR candidates •As a service: NCI –Gadi (100 million compute

RDM@UNSW

RDM@UNSW Townhall

30 April 2020

Research Data Management

Page 2: RDM@UNSW › document › RDM at UNSW Town Hall.p… · High Performance Computing (HPC) •Free for researchers and HDR candidates •As a service: NCI –Gadi (100 million compute

High Performance Computing (HPC)•Free for researchers and HDR candidates•As a service: NCI – Gadi (100 million compute hours)•Katana – local HPC cluster (24 million compute hours)

Cloud Computing•Cloud services: Amazon AWS, Microsoft Azure, NECTAR •Seed money for exploring research in the cloud

Research Data

•Help with Data Management training, issues, information

•Assistance with data moves, storage, planning

[email protected]

Research Technology Services

Page 3: RDM@UNSW › document › RDM at UNSW Town Hall.p… · High Performance Computing (HPC) •Free for researchers and HDR candidates •As a service: NCI –Gadi (100 million compute

Research technology training• 40+ courses per year on campus and online• Free to researchers and HDR candidates

Consulting• Help with code and using HPC • Data Classification, Management, and tools help• Advising on, purchasing and configuring HPC equipment

Hacky Hour• Casual meetup 3pm every Thursday in Penny Lane (currently on Teams)• Bring your problems with code, HPC, data• Presentations about research technologies

[email protected]

Page 4: RDM@UNSW › document › RDM at UNSW Town Hall.p… · High Performance Computing (HPC) •Free for researchers and HDR candidates •As a service: NCI –Gadi (100 million compute

RDM Topics

• RDM online Training

• Data Classification &

Tools

• Specialist Tools

• Data Publication

• Q&A

Page 5: RDM@UNSW › document › RDM at UNSW Town Hall.p… · High Performance Computing (HPC) •Free for researchers and HDR candidates •As a service: NCI –Gadi (100 million compute

RDM@UNSW

Dr Adrian W. Chew

Data Management Training Consultant, Researcher Development

Adjunct Lecturer, School of Education, Arts & Social Sciences

Research Data Management

Page 6: RDM@UNSW › document › RDM at UNSW Town Hall.p… · High Performance Computing (HPC) •Free for researchers and HDR candidates •As a service: NCI –Gadi (100 million compute

Strategy 2025RDM Initiative

Division of Research

• Researcher Development

(Training + Engagement)

• Researcher Technology Services

(Data Team)

• PVC-RI

• IT

• Library

• Data Governance

• Research Integrity

PVC – Research Infrastructure

(Initiative Owner)

People

Tools

Policy

Page 8: RDM@UNSW › document › RDM at UNSW Town Hall.p… · High Performance Computing (HPC) •Free for researchers and HDR candidates •As a service: NCI –Gadi (100 million compute

Supporting/Related Documents• Data Governance Policy

• Data Classification Standard

• Data Handling Guidelines

• UNSW Privacy Management Plan

• UNSW Research Code of Conduct

• Authorship and Resolving Disputes Between Authors

Procedure

• Cold Storage Procedure

• Collibra Data Governance Centre:

https://unsw.collibra.com

• Conflict of Interest Policy

• Handling Allegations of Research Misconduct

Procedure

• Handling Research Material & Data Procedure

• Information Security Management System

• Insider Trading Policy

• Intellectual Property Policy

• IT Security Policy – Information Security Management

System (ISMS)

• IT Security Standards

• Paid Outside Work by Academic Staff Policy

• Radiation Research Safety Procedure

• Recordkeeping Policy

• Research Export Controls Procedure

• Statement of Authorship and Location of Data Form

• UNSW Register of Delegations

• UNSW Risk Management Framework

Page 9: RDM@UNSW › document › RDM at UNSW Town Hall.p… · High Performance Computing (HPC) •Free for researchers and HDR candidates •As a service: NCI –Gadi (100 million compute

Supporting/Related Documents• Data Governance Policy

• Data Classification Standard

• Data Handling Guidelines

• UNSW Privacy Management Plan

• UNSW Research Code of Conduct

• Authorship and Resolving Disputes Between Authors

Procedure

• Cold Storage Procedure

• Collibra Data Governance Centre:

https://unsw.collibra.com

• Conflict of Interest Policy

• Handling Allegations of Research Misconduct

Procedure

• Handling Research Material & Data Procedure

• Information Security Management System

• Insider Trading Policy

• Intellectual Property Policy

• IT Security Policy – Information Security Management

System (ISMS)

• IT Security Standards

• Paid Outside Work by Academic Staff Policy

• Radiation Research Safety Procedure

• Recordkeeping Policy

• Research Export Controls Procedure

• Statement of Authorship and Location of Data Form

• UNSW Register of Delegations

• UNSW Risk Management Framework

Data Consults + Solutions

Page 10: RDM@UNSW › document › RDM at UNSW Town Hall.p… · High Performance Computing (HPC) •Free for researchers and HDR candidates •As a service: NCI –Gadi (100 million compute

Classify all Research Data

using UNSW Data Classification

Standard

Use UNSW-Supported Data

Platforms for Research Data

Have A ’Living’ RDM Plan for each

UNSW Research Project

Complete Core RDM online

Training ModulesA Single Point of Contact

[email protected]

Page 11: RDM@UNSW › document › RDM at UNSW Town Hall.p… · High Performance Computing (HPC) •Free for researchers and HDR candidates •As a service: NCI –Gadi (100 million compute

RDM ONLINE TRAINING

RDMoT

Page 12: RDM@UNSW › document › RDM at UNSW Town Hall.p… · High Performance Computing (HPC) •Free for researchers and HDR candidates •As a service: NCI –Gadi (100 million compute

RDMoT

HDR Candidates Professional Staff

Academic Staff(i.e., Chief/Co-Investigators)

Classify all Research Data using

UNSW Data Classification Standard

Have a ‘Living’ RDM Plan for

each UNSW Research Project

Use UNSW Supported

Data Platforms for Research Data

1003

Page 13: RDM@UNSW › document › RDM at UNSW Town Hall.p… · High Performance Computing (HPC) •Free for researchers and HDR candidates •As a service: NCI –Gadi (100 million compute

Findings & Results of HDR Intro RDMoTInitial Release

97%

ModuleSatisfaction

94%

Module Engagement

HDR Intro RDMoT Survey Responses (n = 693)HREC Approval No: HC180808

M = 5.15, (SD) = (0.85) M = 4.94, (SD) = (0.94)

Page 14: RDM@UNSW › document › RDM at UNSW Town Hall.p… · High Performance Computing (HPC) •Free for researchers and HDR candidates •As a service: NCI –Gadi (100 million compute

HDR Intro RDMoTEndorsed By Dean of GRS, Professor Johnathan Morris & UHDRC

▪ All newly enrolled HDRs from 2019 (self-enrolment via Moodle).

• To be completed in the term they are enrolled in.

Research Data Management online Training(RDMoT)

Initial Release

Page 15: RDM@UNSW › document › RDM at UNSW Town Hall.p… · High Performance Computing (HPC) •Free for researchers and HDR candidates •As a service: NCI –Gadi (100 million compute

“Heads of Schools (or Chief Investigators or Principal

Investigators) are responsible for liaising with the Pro-Vice-

Chancellor (Research Infrastructure) to ensure staff are

trained on security of data and documentation”

(Research Data Governance & Materials Handling Policy, 2019, p. 3)

“Supervisors are expected to guide their HDR candidates to

develop an RDMP to plan how data and materials generated

from the research project will be collected, stored and used”

(HDR Supervision Guidelines, p. 2)

Research Data Management online Training(RDMoT)

Initial Release

Page 16: RDM@UNSW › document › RDM at UNSW Town Hall.p… · High Performance Computing (HPC) •Free for researchers and HDR candidates •As a service: NCI –Gadi (100 million compute

DATA TOOLSJake Surman – Data Team Lead

Page 17: RDM@UNSW › document › RDM at UNSW Town Hall.p… · High Performance Computing (HPC) •Free for researchers and HDR candidates •As a service: NCI –Gadi (100 million compute

Classifying your data

Some data is more sensitive

and requires more protection

UNSW has a data classification

standard: gs.unsw.edu.au/policy/datastandard.html

For Sensitive or Highly Sensitive data extra care must be taken in choosing storagegs.unsw.edu.au/policy/datahandlingguidelines.html

Page 18: RDM@UNSW › document › RDM at UNSW Town Hall.p… · High Performance Computing (HPC) •Free for researchers and HDR candidates •As a service: NCI –Gadi (100 million compute

Choosing where to store your data

Page 19: RDM@UNSW › document › RDM at UNSW Town Hall.p… · High Performance Computing (HPC) •Free for researchers and HDR candidates •As a service: NCI –Gadi (100 million compute

OneDrive

•Why:

• Safe for High Sensitivity data (talk to us!)

• Free

• Can share data to external and internal users

•What:

• Online file storage, copies to your devices

• 5+ terabytes of data per user

• Edit office files online with other users

•How:

• http://myoffice.unsw.edu.au/

*

Page 20: RDM@UNSW › document › RDM at UNSW Town Hall.p… · High Performance Computing (HPC) •Free for researchers and HDR candidates •As a service: NCI –Gadi (100 million compute

Microsoft Teams

•Why:

• Online collaboration tool for UNSW staff

• Free

• Can add external guest users

•What:

• Create your own team

• Comes with Chat, File storage, Video conferencing, and other functions

•How:

• http://teams.microsoft.com/

*

Page 21: RDM@UNSW › document › RDM at UNSW Town Hall.p… · High Performance Computing (HPC) •Free for researchers and HDR candidates •As a service: NCI –Gadi (100 million compute

Research Active Storage(also known as I: Drive, J: Drive, R: Drive, Shared Drive)

Why:

• Work with your files directly on the server (with SMB/NFS)

• Free

• Sharing within UNSW

What:

• Data stored on a UNSW server

• Multiple copies kept and backed-up

• Flexible permissions

How:

• https://www.it.unsw.edu.au/staff/file/index.html

Page 22: RDM@UNSW › document › RDM at UNSW Town Hall.p… · High Performance Computing (HPC) •Free for researchers and HDR candidates •As a service: NCI –Gadi (100 million compute

Data ArchiveWhy:

• Long-term storage needed to comply with grants, policies, regulations

• Free, no quotas

• Storing important data that may be needed later

• Frees up space on other storage locations

What:

• Storage for any size and number of files, permanently

• 3 tape copies kept, one taken off site after a week

• Available at any time to download

How:

• https://dataarchive.unsw.edu.au

Page 23: RDM@UNSW › document › RDM at UNSW Town Hall.p… · High Performance Computing (HPC) •Free for researchers and HDR candidates •As a service: NCI –Gadi (100 million compute

Specialist tools

Dr David Jung

Research Data Support Officer, Research Technology Services

Page 24: RDM@UNSW › document › RDM at UNSW Town Hall.p… · High Performance Computing (HPC) •Free for researchers and HDR candidates •As a service: NCI –Gadi (100 million compute

UNSW eNotebook

research.unsw.edu.au/enotebook

SearchabilityAccessibility

Collaboration

Contextualisation

Data protection

Page 25: RDM@UNSW › document › RDM at UNSW Town Hall.p… · High Performance Computing (HPC) •Free for researchers and HDR candidates •As a service: NCI –Gadi (100 million compute

REDCap & Qualtrics

research.unsw.edu.au/redcap

research.unsw.edu.au/qualtrics

Page 26: RDM@UNSW › document › RDM at UNSW Town Hall.p… · High Performance Computing (HPC) •Free for researchers and HDR candidates •As a service: NCI –Gadi (100 million compute

Data publication

Christopher FranksData publication enthusiast, Research Technology Services

Research Data Management

Page 27: RDM@UNSW › document › RDM at UNSW Town Hall.p… · High Performance Computing (HPC) •Free for researchers and HDR candidates •As a service: NCI –Gadi (100 million compute

Data publication – what is it?

• Publishing research data is making it permanently available as a

research output and part of the scholarly record.

• Published data is, building on Krantz & Strasser (2014,

10.12688/f1000research.3979.3): available now and into the future,

documented/described so others can use it, and citeable

– ‘The underlying goals of data publication are to enable research to be

reproduced and data to be reused.’

Page 28: RDM@UNSW › document › RDM at UNSW Town Hall.p… · High Performance Computing (HPC) •Free for researchers and HDR candidates •As a service: NCI –Gadi (100 million compute

Why publish data?

• A journal may require it – Alsheikh-Ali AA et al (2011,

10.1371/journal.pone.0024357) found 88% of the 50 highest impact

factor journals had some data availability requirement

• ARC and NHMRC encourage publication of research data

• Studies such as Colavizza et al (2019,

https://arxiv.org/abs/1907.02565v3) found a higher citation rate for

articles linking to publicly available data

• It’s good for research: allows independent verification/reproduction

of results, and new research including from expensive or

unrepeatable data collection efforts

Page 29: RDM@UNSW › document › RDM at UNSW Town Hall.p… · High Performance Computing (HPC) •Free for researchers and HDR candidates •As a service: NCI –Gadi (100 million compute

How can you do it?

• ResData provides functionality to add a metadata record for your

data to Research Data Australia, plus optional DOI minting and data

hosting

• Data repository for your research discipline

• Other ways – contact us to discuss

Page 30: RDM@UNSW › document › RDM at UNSW Town Hall.p… · High Performance Computing (HPC) •Free for researchers and HDR candidates •As a service: NCI –Gadi (100 million compute

Who can help?

[email protected]

• Outreach librarian for your school/research

centre

Page 31: RDM@UNSW › document › RDM at UNSW Town Hall.p… · High Performance Computing (HPC) •Free for researchers and HDR candidates •As a service: NCI –Gadi (100 million compute

If you have any questions on research data, please

contact us at our email address:

[email protected]

For more information visit our web site:

https://unsw.to/rdm

Research Technology Services:

https://ResTech.unsw.edu.au

Chat to us at Hacky Hour! Technology and Data help from the Restech team every Thursday 3-4pm. Previously at Penny Lane café, now on Teams in the “Hacky Hour”

channel inside the Research Technology Training Team.

Questions?