data documentation and metadata for data archiving and sharing managing research data well workshop...

14
Data documentation and metadata for data archiving and sharing Managing research data well workshop London, 30 June 2009 Manchester, 1 July 2009

Upload: amanda-baldwin

Post on 13-Jan-2016

229 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Data documentation and metadata for data archiving and sharing Managing research data well workshop London, 30 June 2009 Manchester, 1 July 2009

Data documentation and metadatafor data archiving and sharing

Managing research data well workshop London, 30 June 2009

Manchester, 1 July 2009

Page 2: Data documentation and metadata for data archiving and sharing Managing research data well workshop London, 30 June 2009 Manchester, 1 July 2009

2

Why document data?

• enables you to understand/interpret data• needed to make data independently understandable• ensures informed and correct use, reduces chance of

incorrect use/misinterpretation• if using your data for the first time, what would you need to

know?

• UKDA uses data documentation to: – create user guide(s) for dataset– ensure accurate processing and archiving– supplement information for catalogue record

Page 3: Data documentation and metadata for data archiving and sharing Managing research data well workshop London, 30 June 2009 Manchester, 1 July 2009

3

What is data documentation?

1. Wider contextual information about project(Study-level metadata)

• background, history, aims, objectives

• academia: end-of-award reports

• Government/voluntary sector: published reports, e.g. Family Spending (EFS), Living in Britain (GHS)

• publications based on dataset

Page 4: Data documentation and metadata for data archiving and sharing Managing research data well workshop London, 30 June 2009 Manchester, 1 July 2009

4

2. Methodology and processes: technical reports (also Study-level metadata)

• sample construction

• collection process - fieldwork, interviewer instructions

• instruments - questionnaires, showcards, interview schedules

• data validation - cleaning, error-checking

• data characteristics - temporal/geographic coverage

• variables - labels, coding, classifications, missing values

• derived variables - compilation

• dataset structure - files, relationships, cases, variables

What is data documentation?

Page 5: Data documentation and metadata for data archiving and sharing Managing research data well workshop London, 30 June 2009 Manchester, 1 July 2009

5

2. Methodology and processes: technical reports (contd.)

• confidentiality measures: anonymisation carried out – aggregation, banding, coding and top-coding,

disclosure control?– editing of sensitive material in interview transcripts

• weighting: factors and variables, weighting process• any secondary data sources used?

What is data documentation?

Page 6: Data documentation and metadata for data archiving and sharing Managing research data well workshop London, 30 June 2009 Manchester, 1 July 2009

6

3. researcher may add metadata routinely to files (Data-level metadata)

• quantitative data: variable/value labels; worksheet information; table relationships and queries in relational database; GIS data layers/tables

• qualitative data/text documents: interview transcript speech demarcation; respondent details

• technical reports (back to Study-level metadata)

• Data Documentation Initiative (DDI) (Study or Data-level metadata)

• http://www.ddialliance.org/codebook/index.html• metadata tools: http://tools.ddialliance.org• German Institute for Educational Progress (IQB) – educational

data codebooks www.iza.org

What is data documentation?

Page 7: Data documentation and metadata for data archiving and sharing Managing research data well workshop London, 30 June 2009 Manchester, 1 July 2009

7

UKDA metadata

• UKDA collects and creates structured metadata for each archived dataset

• created during ingest data processing (Data-level metadata) – data dictionaries, format transfer, data listing, ingest processing details and

information gathered in ‘readme’ file for users

• Catalogue record and keyword index(mix of Study-/Data-level metadata - ‘Catalogue metadata’. Also contains ‘Administrative metadata’, such as access conditions, date of publication, etc.)– data deposit form – keyword index covers data elements and concepts– international standards: DDI, METS, ISAD(G), TEI– standardised elements + controlled vocabularies = consistent search and retrieval– sufficient information for users to decide if the data suitable– information on the provenance of a dataset– record of publications

Page 8: Data documentation and metadata for data archiving and sharing Managing research data well workshop London, 30 June 2009 Manchester, 1 July 2009

8

Providing good documentation

• quality of the information provided by the data creator determines ease of discovery and appropriate re-use– comprehensive and comprehensible documentation and

metadata– complete the deposit form as fully as possible

• contact the UKDA if not sure what to produce or provide:– see advice on our Managing and Sharing web pages:

http://www.data-archive.ac.uk/sharing/metadata.asp– contact [email protected]

Page 9: Data documentation and metadata for data archiving and sharing Managing research data well workshop London, 30 June 2009 Manchester, 1 July 2009

9

Recap – why document data?

• enables you to understand/interpret data• needed to make data independently understandable• ensures informed and correct use, reduces chance of

incorrect use/misinterpretation• if using your data for the first time, what would you

need to know?

Page 10: Data documentation and metadata for data archiving and sharing Managing research data well workshop London, 30 June 2009 Manchester, 1 July 2009

10

Examples

• English Longitudinal Study of Ageing (ELSA) – very large study

• Quantitative dataset – depends on size and scale– Health Survey for England (HSE)– BHPS provides link to documentation site– smaller scale study, less documentation

• Qualitative dataset – depends on size and scale– data listing, interview schedules, methodology

Page 11: Data documentation and metadata for data archiving and sharing Managing research data well workshop London, 30 June 2009 Manchester, 1 July 2009

11

ELSA documentation

Page 12: Data documentation and metadata for data archiving and sharing Managing research data well workshop London, 30 June 2009 Manchester, 1 July 2009

12

Quantitative study

• smaller-scale study - user guide may just contain survey questionnaire, methodology information

• example from HSE 2007 – documents separated, bigger study

Page 13: Data documentation and metadata for data archiving and sharing Managing research data well workshop London, 30 June 2009 Manchester, 1 July 2009

13

Qualitative study 1

• User guide contains variety of documents

Page 14: Data documentation and metadata for data archiving and sharing Managing research data well workshop London, 30 June 2009 Manchester, 1 July 2009

14

Qualitative study 2

• Data Listing