metadata to support the survey life cycle

22
Metadata to Support the Survey Life Cycle Alice Born, Statistics Canada Joint UNECE/Eurostat/OECD Work Session on Statistical Metadata (METIS) Geneva, April 3-5, 2006

Upload: bedros

Post on 19-Jan-2016

50 views

Category:

Documents


0 download

DESCRIPTION

Metadata to Support the Survey Life Cycle. Alice Born, Statistics Canada Joint UNECE/Eurostat/OECD Work Session on Statistical Metadata (METIS) Geneva, April 3-5, 2006. Outline. Description of STC’s Integrated Metadatabase (IMDB) Common metatdata set for a survey life cycle - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Metadata to Support  the Survey Life Cycle

Metadata to Support the Survey Life Cycle

Alice Born, Statistics CanadaJoint UNECE/Eurostat/OECDWork Session on Statistical Metadata(METIS)Geneva, April 3-5, 2006

Page 2: Metadata to Support  the Survey Life Cycle

Outline

• Description of STC’s Integrated Metadatabase (IMDB)

• Common metatdata set for a survey life cycle

• Tools for entering metadata

• Time travel – versioning rules

• Complete model

Page 3: Metadata to Support  the Survey Life Cycle

Corporate metadata at Statistics Canada

• Integrated Metadatabase (IMDB)– Collection of information about each of

Statistics Canada’s 560+ current surveys– Aimed at helping users interpret statistical

data• Survey description• Survey instrument• Methodology• Data accuracy• Variables, classifications

Page 4: Metadata to Support  the Survey Life Cycle

What is the IMDB based on?

• ISO 11179 Specification and Standardization of Data Elements

• Corporate Metadata Repository (CMR) – USBC (D. Gillman)

• Extension of ANSI X3.285 for the management of statistical information (American National Standards Institute metamodel)

Page 5: Metadata to Support  the Survey Life Cycle

Surveys - definition

• Metadata in the IMDB is organized around the survey entity

• Refers to collection, compilation and publication of data measuring characteristics of a population

• Three types of surveys:• Direct • Administrative• Derived

Page 6: Metadata to Support  the Survey Life Cycle

Statistical Activities

• Group of surveys that share common feature, common explanatory text

• E.g., System of National Accounts:

The Canadian System of National Accounts (CSNA) provides a conceptually integrated framework of statistics and analysis for studying the state and behaviour of the Canadian economy. The accounts are centered on the measurement of activities associated with production of goods and services, the sales of goods and services in final markets, the supporting financial transactions and the resulting wealth positions.

Page 7: Metadata to Support  the Survey Life Cycle

Regions

Organization

Contact

Documentation

Identification

Time Frame

Keyword

Theme

Survey

Universe

Frame

Survey instance

Instrument

Question

Data file

Methodology

Instrument designSamplingData sourceError detectionImputationEstimationQuality evaluationDisclosure controlRevisions and seasonal adjustmentData accuracy

Data Element

Data Element Concept

Object Class

Property

Formula

Conceptual Domain

Value Domain

Stewardship

Identification

Classification

Statistical Activity

Page 8: Metadata to Support  the Survey Life Cycle

Common metadata set for survey life cycle

Statistical activity

Survey (direct, administrative, derived) Target population (population, statistical unit)

Survey instance (each survey process)

Collection instrument Methodology

Data accuracy

Documentation

Data file

(Data elements, value domains)

Page 9: Metadata to Support  the Survey Life Cycle

Common metadata set for survey life cycle

MethodologyInstrument designSampling Collection methodError detectionImputation EstimationQuality evaluationDisclosure controlRevisions and seasonal adjustment

Page 10: Metadata to Support  the Survey Life Cycle

Common metadata set for survey life cycle

Survey

Survey Instance- questionnaires- variables (DE)- methodology- data accuracy

Page 11: Metadata to Support  the Survey Life Cycle

Common metadata set for survey life cycle

Survey Instruments

Page 12: Metadata to Support  the Survey Life Cycle

Common metadata set for survey life cycle

Data elements

Page 13: Metadata to Support  the Survey Life Cycle

Common metadata set for survey life cycle

Methodology

Target population

Instrument design

Page 14: Metadata to Support  the Survey Life Cycle

Tools for loading metadata into IMDB

Page 15: Metadata to Support  the Survey Life Cycle

Statistical Activity - Identification Tab

Page 16: Metadata to Support  the Survey Life Cycle

Statistical Activity and Survey- DescriptionTab

Page 17: Metadata to Support  the Survey Life Cycle

Survey Instance (cycle) – Times Frames

Page 18: Metadata to Support  the Survey Life Cycle

Data sources – Description

Page 19: Metadata to Support  the Survey Life Cycle

Versioning (time-travel)

• Metadata change over time – each survey instance, survey or statistical activity

• Rules for revisions and versioning of administered items

• Three functions:– Create– Update– Version

Page 20: Metadata to Support  the Survey Life Cycle

Versioning (time-travel)

Survey:• Changes to mandate or subject of survey – new survey

(new IMDB record and new SDDS number)• Changes to characteristics of surveys – new version of

survey

Survey instance:• Each reference period – new version of the instance

– Now it coincides with release of data in the Daily– Demand for the new instance version to coincide with collection

start dates – Central link to versioning of other administered items

(instrument, methodology and data file)

Page 21: Metadata to Support  the Survey Life Cycle

Versioning (time-travel)

Target population:• Changes result in a new version of the survey

and target population

Statistical activity:• Changes to program mandate or structure

(addition or removal of surveys) results in new version of statistical activity

Page 22: Metadata to Support  the Survey Life Cycle

StatisticalActivity

Survey

Survey Instance

Data File

Methodology

Instrument

Applications/Software

Products(COR)

Data elements

Frame and Sample

Target population