creating data management plans for nsf proposals · 2013-01-07 · supplementary document labeled...
TRANSCRIPT
B r i a n W e s t r a A p r i l 2 0 1 2
Creating Data Management Plans for NSF Proposals
libweb.uoregon.edu/datamanagement/
What is a data management plan (DMP)?
Briefly describes how you will manage and share your research data in compliance with funder requirements
Part of the grant review process
Requirements may vary by directorate, and some proposals provide requirements/guidance as well
libweb.uoregon.edu/datamanagement/
NSF policy
“Investigators are expected to share with other researchers, at no more than incremental cost and within a reasonable time, the primary data, samples, physical collections and other supporting materials created or gathered in the course of work under NSF grants. Grantees are expected to encourage and facilitate such sharing. See Award & Administration Guide (AAG) Chapter VI.D.4.”
http://www.nsf.gov/pubs/policydocs/pappguide/nsf11001/aag_6.jsp#VID4
libweb.uoregon.edu/datamanagement/
NSF Data Sharing Policy
Supplementary document labeled “Data Management Plan” 2 page limit Fastlane will not permit submission without a Data Management
Plan (DMP) DMP’s are peer reviewed, under Intellectual Merit or Broader
Impacts or both, as appropriate for the scientific community of relevance
Some programs may have more specific guidance. If guidance specific to the program is not provided, then the requirements established in Grant Proposal Guide, Chapter II.C.2.j apply
http://www.nsf.gov/pubs/policydocs/pappguide/nsf11001/gpg_2.jsp#dmp
libweb.uoregon.edu/datamanagement/
NSF Data Sharing Policy
“A valid Data Management Plan may include only the statement that no detailed plan is needed, as long as the statement is accompanied by a clear justification.” http://www.nsf.gov/pubs/policydocs/pappguide/nsf11001/gpg_2.jsp#dmp
libweb.uoregon.edu/datamanagement/
NIH Data Sharing Plan
In NIH's view, all data should be considered for data sharing. Data should be made as widely and freely available as possible while safeguarding the privacy of participants, and protecting confidential and proprietary data.
applications with direct costs of $500,000 or more are expected to address data sharing in their application
http://grants.nih.gov/grants/policy/data_sharing/data_sharing_guidance.htm http://grants.nih.gov/grants/policy/nihgps_2010/nihgps_ch8.htm#_Toc271264950
libweb.uoregon.edu/datamanagement/
NIH Data Sharing Plan
The plan should address:
(1) What data will be shared? (2) Who will have access to the data? (3) Where will the data to be shared be located? (4) When will the data be shared? (5) How will researchers locate and access the data? http://grants.nih.gov/grants/sharing_key_elements_data_sharing_plan.pdf
libweb.uoregon.edu/datamanagement/
Other Reasons to Plan
Save time Increase research impact Safeguard the investment
Key Questions to Consider
1. What type of data or files will be produced or used? 2. What standards will be used for documentation and description (metadata)? 3. How will the data be organized, stored and protected during the research project? 4. If you allow others to reuse your data, how will the data be accessed and shared? 5. How will the data be archived for preservation and long-term access?
1
What types of data will your research produce?
libweb.uoregon.edu/datamanagement/
What is “data”?
Recorded factual material commonly accepted in the scientific community as necessary to validate research findings. (NIH)
…primary data, samples, physical collections and other supporting materials created or gathered in the course of work (NSF)
What constitutes such data will be determined by the community of interest through the process of peer review and program management. This may include, but is not limited to: data, publications, samples, physical collections, software and models. (NSF)
libweb.uoregon.edu/datamanagement/
Considerations
libweb.uoregon.edu/datamanagement/
What is the data about X-Ray crystallography, for characterization of a chemical
compound
In what form and format is it stored Multiple files, but final standard form is a .cif file
Considerations
Raw Derivative
libweb.uoregon.edu/datamanagement/
workflow
Form of data (i.e., Excel vs. tab-delimited text) How unique/reproducible is it? Confidentiality/privacy Is proprietary software needed to view/access it? File sizes and growth rate Physical collections (herbaria, fossils)
Activity
What type of data does your research produce?
checklist #1
libweb.uoregon.edu/datamanagement/
NSF Data Sharing Policy
1. the types of data, samples, physical collections, software, curriculum materials, and other materials to be produced in the course of the project;
Suggested practice: Describe the materials to be produced in the course of the
proposed project. Indicate which will be shared and at what stage (raw,
processed, analyzed). Describe why the materials are of interest to a broader
community (impact).
libweb.uoregon.edu/datamanagement/
2
What standards will be used for documentation and description
(metadata)?
libweb.uoregon.edu/datamanagement/
Considerations
libweb.uoregon.edu/datamanagement/
What will help you find and understand the data during & after the project?
What information will help someone else find, understand/interpret, and use the data?
Are there metadata standards for this information?
Considerations
libweb.uoregon.edu/datamanagement/
No metadata = no data (no search engine results) Metadata = interoperability (machine to machine) Some data repositories will have their own metadata
standards and tools
Metadata
libweb.uoregon.edu/datamanagement/
History of the data Source Conditions of collection/generation Software requirements
Recorded in: Lab notebook Codebook Text file
Activity
What standards will you use for documentation and description
(metadata)?
checklist #2
libweb.uoregon.edu/datamanagement/
NSF Data Sharing Policy
2. the standards to be used for data and metadata format and content (where existing standards are absent or deemed inadequate, this should be documented along with any proposed solutions or remedies);
Suggested practice: Identify the formats of data files created over the course of the
project. Select file formats for sharing that maximize the potential for
reuse and longevity, and describe plans for conversion to those formats, if necessary.
Identify metadata standards to be used, who will create metadata and at what stage.
libweb.uoregon.edu/datamanagement/
3
How will the data be organized, stored, and protected
during the research project?
libweb.uoregon.edu/datamanagement/
Considerations
libweb.uoregon.edu/datamanagement/
Privacy/confidentiality File organization: file and directory naming
conventions Storage and backup media, backup schedules Access restrictions Tracking and recording changes to files
Activity
How will the data be organized, stored, and protected
during the research project?
Checklist #3
libweb.uoregon.edu/datamanagement/
NSF Data Sharing Policy
3. policies for access and sharing including provisions for appropriate protection of privacy, confidentiality, security, intellectual property, or other rights or requirements;
Suggested practice: NSF recognizes that legal and ethical requirements may
preclude sharing of some kinds of data. Trade secrets and commercial information are also not subject to the data sharing requirement.
Describe data management practices that will be used to maximize security and protect privacy, if applicable.
libweb.uoregon.edu/datamanagement/
4
If you allow others to reuse your data, how will the data be accessed and shared?
libweb.uoregon.edu/datamanagement/
Considerations
libweb.uoregon.edu/datamanagement/
NSF emphasizes sharing and open access to research data
Some data is not considered for sharing
libweb.uoregon.edu/datamanagement/
preliminary analyses, drafts of scientific papers, plans for future research, peer reviews, or communications with colleagues
Some data is not considered for sharing
libweb.uoregon.edu/datamanagement/
Research data also do not include: (A) Trade secrets, commercial information, materials necessary to be held confidential by a researcher until they are published, or similar information which is protected under law; and (B) Personnel and medical information and similar information the disclosure of which would constitute a clearly unwarranted invasion of personal privacy, such as information that could be used to identify a particular person in a research study.
Citing and Sharing Data
libweb.uoregon.edu/datamanagement/
Web site
Activity
If you allow others to reuse your data, how will the data be accessed and shared?
Checklist #4
libweb.uoregon.edu/datamanagement/
NSF Data Sharing Policy
4. policies and provisions for re-use, re-distribution, and the production of derivatives;
Suggested practice: Describe usage rights, licenses, or other policies related
to re-use and redistribution of data. Some data centers and repositories may have their own
policies; reconcile these with your preferences
libweb.uoregon.edu/datamanagement/
5
How will the data be archived for preservation and long-term access?
libweb.uoregon.edu/datamanagement/
Archives and Preservation
libweb.uoregon.edu/datamanagement/
Many repositories for sharing and access function as archives as well
Data sets may be registered and DOIs are produced Confirm the “lifetime” of data in these resources
NSF Data Sharing Policy
5. plans for archiving data, samples, and other research products, and for preservation of access to them.
Suggested practice: Describe means by which you will provide access to
data and applicable time frame. Describe means for preserving data, if different from
above.
libweb.uoregon.edu/datamanagement/
Budget
Costs associated with data management are allowable. Include on line G2 (publication, documentation,
dissemination). Justify in budget justification. http://www.nsf.gov/bfa/dias/policy/dmpfaqs.jsp
libweb.uoregon.edu/datamanagement/
Support
Web site: http://libweb.uoregon.edu/datamanagement/ Contact: [email protected] Brian Westra: [email protected] Materials adapted from the Univ. of Minnesota:
https://www.lib.umn.edu/datamanagement/workshops/dataplan
libweb.uoregon.edu/datamanagement/