metadata standards & technology development for the nsf survey of earned doctorates

20
Metadata Standards & Technology Development for the NSF Survey of Earned Doctorates Kimberly Noonan (NSF NCSES) Pascal Heus (MTNA) Tim Mulcahy (NORC) May 22, 2013

Upload: espen

Post on 14-Jan-2016

35 views

Category:

Documents


0 download

DESCRIPTION

Metadata Standards & Technology Development for the NSF Survey of Earned Doctorates. Kimberly Noonan (NSF NCSES) Pascal Heus (MTNA) Tim Mulcahy (NORC) May 22, 2013. National Center for Science and Engineering Statistics (NCSES). A federal statistical agency within NSF - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Metadata Standards &   Technology Development  for the NSF  Survey of Earned Doctorates

Metadata Standards & Technology Development

for the NSF Survey of Earned Doctorates

Kimberly Noonan (NSF NCSES)Pascal Heus (MTNA)Tim Mulcahy (NORC)

May 22, 2013

Page 2: Metadata Standards &   Technology Development  for the NSF  Survey of Earned Doctorates

2

National Center for Science and Engineering Statistics (NCSES)

• A federal statistical agency within NSF • Charged with the mission to provide a central clearinghouse for the

collection, interpretation, and analysis of data on scientific and engineering resources

• 12 periodic data collections covering science and engineering• Research and Development• Education• Workforce

• Over 7 contracts for external support

• Building a central data system to store, maintain, and disseminate survey data in a faster, more flexible way

Page 3: Metadata Standards &   Technology Development  for the NSF  Survey of Earned Doctorates

3

NCSES Data SystemNCSES Data User Types

Researcher

Trend Analyst

Policy Analyst

Specific Interest

Browser

NCSES StaffFAC

GSS

FFS

SED

FSS

R&D

NSCG

SDR

NSRCG

... Tool

s &

Pro

cess

es

Metadata

Microdata

Tool

s &

App

lica

tion

s

IntegratedData System

Other data*

* Other data used regularly in NCSES publications

Var

ying

leve

ls o

f ac

cess

and

fun

ctio

nali

ty

Ext

erna

l sup

port

Page 4: Metadata Standards &   Technology Development  for the NSF  Survey of Earned Doctorates

4

Specify data delivery requirements• Microdata • Metadata

• Ensure comprehensive documentation

• Standardize delivery formats

• Automate data processing

• Adopt metadata standards• Data Documentation Initiative (DDI)• Globally recommended practices• Industry standards and technologies

NCSES Data Delivery

Page 5: Metadata Standards &   Technology Development  for the NSF  Survey of Earned Doctorates

5

Objectives• Capture comprehensive survey metadata, preferably in DDI format• Automate generation of essential documentation, standard reports• Generate delivery package compatible with the NCSES data systems• Case study with Survey of Earned Doctorates (SED)

Project Team• NSF NCSES

• Building next generation data system and management framework• NORC SED and Data Enclave team

• Public interest research years of survey and data/metadata specific knowledge

• Metadata Technology North America (MTNA)• Domain and technology experts in statistical data management

NSF Metadata Project: SED Case Study

Page 6: Metadata Standards &   Technology Development  for the NSF  Survey of Earned Doctorates

6

• Began in 1958• Annual survey• All individuals receiving research doctoral degrees from

accredited U.S. Institutions• Results used to asses characteristics and trends in

doctorate education and degrees• Survey is conducted by NORC• NCSES disseminates data, reports and documentation• SED sponsors

• National Science Foundation• National Institutes of Health• US Department of Agriculture• Department of Education• National Endowment for the Humanities• National Aeronautics & Space Administration

NSF Survey of Earned Doctorates

Page 7: Metadata Standards &   Technology Development  for the NSF  Survey of Earned Doctorates

7

• Define SED metadata

• Assess current situation, file inventory, capture comprehensive survey metadata

• Develop SED metadata schema

• Develop metadata model, based on metadata standards

• Prepare metadata for SED 2011

• Develop software tool, extend existing MTNA open source application

• Automate SED metadata preparation for 2008, 2009, 2010

• Extend to additional survey cycles

• Recommend maintenance and future steps

• Findings

• Next steps

SED Metadata Project Plan

Page 8: Metadata Standards &   Technology Development  for the NSF  Survey of Earned Doctorates

8

SED NSF Manager

Sync with internal repository

Automate mapping into target repository and files packaging

Capture standard metadata around surveys

Metadata driven production of documentation

Page 9: Metadata Standards &   Technology Development  for the NSF  Survey of Earned Doctorates

9

• Catalog Explorer• Import/create survey

• Metadata Editors• Survey, Questionnaire, Classification, Variables, Data,

Documentation, Notes

• Report Center• Codebook• Comparison reports, e.g. codebook comparison• Custom reports

• Benefits• Metadata are standardized• Metadata are DDI compliant • Metadata are automatically captured

Features

Page 10: Metadata Standards &   Technology Development  for the NSF  Survey of Earned Doctorates

10

Catalog Explorer

Page 11: Metadata Standards &   Technology Development  for the NSF  Survey of Earned Doctorates

11

Metadata Editors

Page 12: Metadata Standards &   Technology Development  for the NSF  Survey of Earned Doctorates

12

Page 13: Metadata Standards &   Technology Development  for the NSF  Survey of Earned Doctorates

13

Survey Editor

Page 14: Metadata Standards &   Technology Development  for the NSF  Survey of Earned Doctorates

14

Questionnaire Editor

Page 15: Metadata Standards &   Technology Development  for the NSF  Survey of Earned Doctorates

15

Variable Editor

Page 16: Metadata Standards &   Technology Development  for the NSF  Survey of Earned Doctorates

16

Repository Editor

Page 17: Metadata Standards &   Technology Development  for the NSF  Survey of Earned Doctorates

17

Report Center

Page 18: Metadata Standards &   Technology Development  for the NSF  Survey of Earned Doctorates

18

Report Center

Codebook Comparison report

Page 19: Metadata Standards &   Technology Development  for the NSF  Survey of Earned Doctorates

19

Next Steps

• Transition to production & maintenance mode– Training, document SED 2012, tweak reports /

metadata / templates , minor fixes & enhancements

• Transition to metadata driven environment– Need further research/enhancements– Establish variable/classification/question/concept

banks• Integrate in NCSES Data Repository• Extend to other NCSES surveys (SDR)