creating a national data architecture for evidence-based...

36
Creating a National Data Architecture for Evidence-Based Policy Andrea Fernandez Deputy Director of Dissemination [email protected] March 10, 2020 1

Upload: others

Post on 28-Sep-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization

Creating a National

Data Architecture for

Evidence-Based Policy

Andrea Fernandez

Deputy Director of Dissemination

[email protected]

March 10, 2020 1

Page 2: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization

01

Business Problem

02

Basic Descriptions

03

Statistical Data Architecture

04

Geospatial Data Architecture

05

A platform for Evidence-Based

Decision Making

Table of

Contents

2

Page 3: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization

3

01

Business

Problem

❖From data to knowledge

❖From data producer to data user

(policy maker)

Page 4: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization

4

01

General

Approach

Page 5: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization

❖National Institute of Statistics

and Geography (INEGI)

❖The National Statistical System

❖Production of Statistical

Information

❖Production of Geographical

Information

❖Metadata

❖Analysis taxonomy

02

Basic

descriptions

5

Page 6: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization

A Federal Government Organization

50+ offices nationwide

16,000+ employees

150+ production lines

Technically Autonomous Institution

● Household

● Business

● Government and Security

● Geographic Information

4 Divisions that produce information:

● Headed by five board members (1 President, 4 Vice

presidents) appointed by the Senate.

6

Two roles

1. Coordination of the National Statistical and Geographical

System (Information produced by government agencies that support the

design and evaluation of public policy.)

2. Production of Official Statistics and Geographical

Information

❖ 02.1

National Institute of

Statistics and Geography

Page 7: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization

7

❖ 02.2

The National Statistical

System

National

Statistics

Coordinator

INEGI

Treasury

Interior

Ministry

Education

Ministry

Other

Ministries

Statistical

Division

Statistical

Division

Statistical

Division

Statistical

Division

Mandate

To produce information that supports the

design and evaluation of public policy

Status quo:

Multiple data producers with dispersed

systems of records, various types of data, and

conceptual frameworks.

Page 8: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization

❖ 02.3

Production of Statistical Information

DataCensus, business

registries, other data

Generic Statistical Business Process

Process Infrastructure

InformationCharts, tables, data

marks

Page 9: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization

❖ 02.4

Production of Geographical Information

DataHard copyDigital form

Geographical Information Systems

(Standardized following GSBPM)

InformationHard copyDigital form

LinesPointsAreas

Complex

MapsAttributes

ImagesLidarGPS

Page 10: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization

10

Metadata describes data lineage and relevant information that provides

context, contents, and meaning.

❖ 02.5

Metadata

❖Statistics ❖GIS

ISO Attribute standards

(metadata) to produce

and disseminate

information.

No standards for

simultaneous

production of statistical

and geographical

objects

19100

Page 11: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization

❖Dominant Schema

❖Standardized production line

(life cycle).

❖Statistical Data Domains

❖Aggregate Data Architecture

levels

❖Statistical Metadata

❖Opportunities

03

Statistical Data

Architecture

11

Page 12: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization

12

❖ 03.1

Dominant Schema

Specify needs Design Build Collect Process Analyse Disseminate Evaluate

Informant

data

Product

Specify needs Design Build Collect Process Analyse Disseminate Evaluate

Informant

data

Product

Specify needs Design Build Collect Process Analyse Disseminate Evaluate

Informant

data

Product

Specify needs Design Build Collect Process Analyse Disseminate Evaluate

Informant

data

Product

Page 13: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization

13

Data object

❖ 03.2

Standardized Production Line

Documentation

of Specify

needs

Design Build Collect Process Analyse

DisseminateEvaluate

Collected data

Processeddata

Dissemination principles

Aggregated data

Main Indicators

OtherIndicators

Additional Content

Information Set

Product

Specify needsInformant data

Page 14: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization

❖ 03.4

Aggregate Data Architecture levels

Page 15: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization

❖ 03.5

Statistical Metadata

Metadata

Standards

Conceptual

Metadata

● Purpose

● Coverage

● Analysis unit

● Concepts

● Universe

● Variables

● Classifications

● Categories

Data

collection

Metadata

● Methodology

● Sampling

● Collection strategy

Processing

metadata

● Entry

● Coding

● Editing

● Derivation

● Weighting

Support: Data discovery Data analysis Data distribution Data access Data availability

Page 16: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization

• Despite the heterogeneity in the production of information, every collected

statistical data is either georeferenced or geocoded.

• Other statistical information attributes (metadata) can be stored and

managed in a geospatial infrastructure.

• Consolidation in a Geospatial Infrastructure enables the analysis of cross-

discipline domains enhancing traditional statistical analysis.

16

❖ 03.5

Opportunities with statistical information

Page 17: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization

❖Geographical Information

Systems (GIS)

❖Geocoding vs. Georeferencing

❖Schemas for Geospatial Data

❖Grid-based representation

❖Administrative boundaries

❖Geographical Metadata

04

Geographical

Data

Architecture

17

Page 18: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization

A framework for gathering, managing, and analyzing data.

18

❖ 04.1

Geographic Information System (GIS)

Imagery

Transportation

Addresses

Water features

Boundaries

Elevation

Page 19: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization

Geocoding

❖ 04.2

Geocoding vs. georeferencing

Requires the latitude and longitude of every

statistical object

Substantial historical data is not geocoded

GeoreferencingFixed polígons

Widely available

Less flexible

Most historical data was georeferenced

Page 20: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization

Grid-Based

❖ 04.3

Schemas for Geospatial data

Requires geocoding of every statistical

observation

Some historical data was not geocoded

Administrative boundariesFixed polígons

Widely available

Less flexible

Page 21: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization

• Spatial units with equal size and even distribution.

• It offers flexibility in size.

• It is not population-centric.

• It can be applied across boundaries.

• Suitable for overlaying and spatial analysis.

21

❖ 04.4

Grid-based representation

Page 22: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization

Country

State

Municipality

Region

Enumeration

Area

22

❖ 04.5

Administrative Boundaries

Page 23: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization

• Identification

• Creation date, data author, contact information, source agency, map projection

and coordinate system, scale, error, explanation of symbology and attributes,

data dictionary, data restrictions, licensing.

• Assessment

• Use constraints, access constraints, data quality, availability

• Access

• On line, order, contact

23

❖ 04.6

Geographical Metadata

Page 24: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization

❖Evolution to support of

evidence-based analysis

❖Geospatial Data Infrastructure

❖Metadata Approach

❖Implementation stages

05

Geostatistical

Approach

24

Page 25: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization

25

National Statistics DataStatistical Analysis

Geostatistical

National Data

ArchitectureGeostatistical

Analysis

National Geospatial DataGeospatial Analysis

❖ 05.1

Evolution to support

evidence-based

analysis

Page 26: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization

Statistical Data

Geospatial Data

Stat Data Layers

Geo Data Layers

Imagery

Transportation

National Data Architecture

Geospatial Database

Addresses

Water features

Boundaries

Elevation

❖ 05.2

Geostatistical Data

Infrastructure

Page 27: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization

❖ 05.3

Metadata Approach

Time PlaceTheme Geospatial

Metadata

Statistical Metadata

Page 28: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization

❖ 05.4

05

Implementation Stages

Silos

0Standardization

of processes and deliverables

1Integration into a

Geospatial Infrastructure

2Geospatial

analysis

3Geospatial Network

4

Page 29: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization

❖ 05.4

Implementation Stages

Silos

0

Informant

data

Product

Informant

data

Product

Informant

data

Product

Informant

data

Product

Ad Hoc consolidation

Geographic Information

StatisticalInformation

Data Warehouse

Geospatial Database

Page 30: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization

❖ 05.4

Implementation Stages

Standardization of processes and

deliverables

1Geographic Information

StatisticalInformation

Both withstandardized:1)Output2)Metadata3)Paradata

Page 31: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization

❖ 05.4

Implementation Stages

Integration into a Geospatial

Infrastructure

2Geographic Information

StatisticalInformation Geospatial

Infrastructure

Page 32: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization

❖ 05.4

Implementation Stages

Geospatial analysis

3

Grid Base

Administrative Boundaries

Representation

Mapping

Analysis

Page 33: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization

❖ 05.4

Implementation Stages

Geospatial Network

4

Implement the schema in every Federal Statistical Institution...

Page 34: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization

❖ 05.4

Implementation Stages

A virtual web geospatial portal to

access ALL available data

Geospatial Network

4

Page 35: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization

Statistical production has been supporting traditional evidence-based policymaking.

Most of the statistical production has been taking place within independent silos.

Metadata standards can enable common interfaces, but they do not provide a

framework to define shared storage and use.

GIS provide a pool to concentrate statistical and geographical data into a common

Geospatial Infrastructure.

The existence of standardized statistical and geographical metadata allows the

consolidation of data and enhances the capabilities for representation, mapping

and geospatial analysis.

Summary

35

Page 36: Creating a National Data Architecture for Evidence-Based Policyggim.un.org/meetings/2020/WG-GI-Mexico-City/documents/5.Andrea... · Database 05.4 Implementation Stages Standardization

36