icpsr-sro shared data model project

12
ICPSR-SRO Shared Data Model Project Mary Vardigan Director, DDI Alliance

Upload: torgny

Post on 14-Jan-2016

43 views

Category:

Documents


0 download

DESCRIPTION

ICPSR-SRO Shared Data Model Project. Mary Vardigan Director, DDI Alliance. Both are units of the Institute for Social Research, University of Michigan Inter-university Consortium for Political and Social Research (ICPSR) ICPSR is a large social science data archive - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: ICPSR-SRO Shared Data Model Project

ICPSR-SRO Shared Data Model Project

Mary VardiganDirector, DDI Alliance

Page 2: ICPSR-SRO Shared Data Model Project

The Partners

• Both are units of the Institute for Social Research, University of Michigan

• Inter-university Consortium for Political and Social Research (ICPSR)– ICPSR is a large social science data archive

• Survey Research Operations (SRO)– SRO is a data collection center

Page 3: ICPSR-SRO Shared Data Model Project

Past Collaborations

• Worked together on the National Survey of Family Growth, sponsored by NCHS, to create an interactive codebook

• Partnered again on the Collaborative Psychiatric Epidemiology Surveys, sponsored by NIMH– This involved a harmonization of three

datasets and interactive documentation featuring question comparison and five languages

Page 4: ICPSR-SRO Shared Data Model Project
Page 5: ICPSR-SRO Shared Data Model Project

Rationale for Collaboration

• Together, SRO and ICPSR cover the life cycle of research data

• We share a need for rich, high-quality metadata

• We want to comply with metadata standards – in particular, the Data Documentation Initiative (DDI)

• We need to pass data easily from SRO to ICPSR without information loss

Page 6: ICPSR-SRO Shared Data Model Project

New SRO-ICPSR Joint Project

• Shared data model and database design for survey metadata to enhance collaboration

• Challenges:– Different computing platforms– Different end products– Different staff orientations

Page 7: ICPSR-SRO Shared Data Model Project

Blaise Datamodel

(BMI)

SRO Blaise Parsing Tool

Blaise Database

(BDB)

SRORelational Database

(online/networked SQL Server)

SRORelational Database

(online/networked SQL Server)

Client Relational Database (offline SQL Server

Express)

Client Relational Database (offline SQL Server

Express)

<XML/WSDL>

DDI 2 or 3 File

<M

etad

ata

&

Dat

a><

Tra

nsf

orm

--at

ion

s><

Dat

a S

tora

ge>

<A

pp

lica

tio

n

Lo

gic

>

ICPSR Import Tool

User specifies files (location, file type, etc.) using an application

Other File Types (e.g. SAS, SPSS, etc)

ICPSR Relational Database

(online/networked Oracle)

ICPSR Relational Database

(online/networked Oracle)

SRO/ICPSR/Other web client

Tas

k B

Tas

k A

Tas

k B

an

d D

Tas

ks C

an

d D

Other Importing Tool

Client Relational Database (offline SQL Server

Express)

Client Relational Database (offline SQL Server

Express)

Export data

Export data

Display meta-data

Display meta-data

Stand-alone client application

Client application with sync data

Edit / Reviewmeta-data

Edit / Reviewmeta-data

Export ques-

tionnaire

Export ques-

tionnaire

Export code-book

Export code-book

Web server

ICPSR web client::• Variable Search

• Internal Variable Browser• NSFG Data Management

Page 8: ICPSR-SRO Shared Data Model Project

Products and Benefits

SRO• Tools to complement MQDS, which produces

XML documentation from Blaise instruments• Tool to permit external users to add

metadata for the National Survey of Family Growth

ICPSR• Variable-level database that permits users to

search across the ICPSR collection; compare variables; create new datasets and questionnaires

Page 9: ICPSR-SRO Shared Data Model Project

Other Benefits of the Project

• Should allow nearly seamless data sharing between SRO and ICPSR

• Covers survey data life cycle from data production to data publication

• Creates a competitive set of services that we can continue to market

• Ultimately brings more data to a wider audience

Page 10: ICPSR-SRO Shared Data Model Project

Project Phases

Phase 1• Design and development of the database

(April 30, 2008) • Modification of MQDS to export to and

read from the database (April 30, 2008)• Interface to allow remote user access for

NSFG (July 31, 2008)• ICPSR Social Science Variables Database

(Late 2008)

Page 11: ICPSR-SRO Shared Data Model Project

Preview of SSVD Search Results

Page 12: ICPSR-SRO Shared Data Model Project

Preview of Variable Display