cyberinfrastructure for data curation greg janée uc santa barbara; cdl
TRANSCRIPT
![Page 1: Cyberinfrastructure for data curation Greg Janée UC Santa Barbara; CDL](https://reader036.vdocuments.mx/reader036/viewer/2022072006/56649f425503460f94c61bb3/html5/thumbnails/1.jpg)
Cyberinfrastructure fordata curation
Greg JanéeUC Santa Barbara; CDL
![Page 2: Cyberinfrastructure for data curation Greg Janée UC Santa Barbara; CDL](https://reader036.vdocuments.mx/reader036/viewer/2022072006/56649f425503460f94c61bb3/html5/thumbnails/2.jpg)
2
• Data curation:
– management of data throughout its lifecycle,– such that it can be kept usable in the future,– affordably
![Page 3: Cyberinfrastructure for data curation Greg Janée UC Santa Barbara; CDL](https://reader036.vdocuments.mx/reader036/viewer/2022072006/56649f425503460f94c61bb3/html5/thumbnails/3.jpg)
3
• Three eras of science data
1. analog2. digital3. online
![Page 4: Cyberinfrastructure for data curation Greg Janée UC Santa Barbara; CDL](https://reader036.vdocuments.mx/reader036/viewer/2022072006/56649f425503460f94c61bb3/html5/thumbnails/4.jpg)
4
• New norms for science data
– discoverable by searching– available: at any time, immediately, in the future– usable– persistently identified and citable– supportive of replication (versioned)– meta-information: reviews, uses, provenance– linked to and within scholarly literature
• Who’s going to do all this?
![Page 5: Cyberinfrastructure for data curation Greg Janée UC Santa Barbara; CDL](https://reader036.vdocuments.mx/reader036/viewer/2022072006/56649f425503460f94c61bb3/html5/thumbnails/5.jpg)
5
• Data curation research at UCSB
– survey, faculty interviews, case studies
– great interest (1/3 response to survey)– broad applicability (90% of departments)– curation mandated (50%)– researchers personally responsible for data (90%)
– http://tinyurl.com/ucsb-data-curation
![Page 6: Cyberinfrastructure for data curation Greg Janée UC Santa Barbara; CDL](https://reader036.vdocuments.mx/reader036/viewer/2022072006/56649f425503460f94c61bb3/html5/thumbnails/6.jpg)
6
• But...– requested help with every aspect of curation– researcher motivations not aligned with curation
• Prototypical researcher:– focused on research area– resourceful in utilizing new tools, techniques– not knowledgeable of curatorial aspects of tools– not expert in data management– time- and resource-constrained– views data management as important but secondary
![Page 7: Cyberinfrastructure for data curation Greg Janée UC Santa Barbara; CDL](https://reader036.vdocuments.mx/reader036/viewer/2022072006/56649f425503460f94c61bb3/html5/thumbnails/7.jpg)
7
• What about libraries?
– cultural heritage institutions– curation expertise– metadata, cataloging, search expertise
– missing: experience working with data earlier in the lifecycle
![Page 8: Cyberinfrastructure for data curation Greg Janée UC Santa Barbara; CDL](https://reader036.vdocuments.mx/reader036/viewer/2022072006/56649f425503460f94c61bb3/html5/thumbnails/8.jpg)
8
• Cyberinfrastructure for data curation
– Services• generic (figshare), discipline-specific (GenBank)• systemwide (CDL)• campus (institutional repositories)
– Libraries• awareness, identification of curation issues• navigation of service space• education• assistance with projects• relationships with researchers
– Researchers• motivations unchanged