integrated public use microdata series international: census microdata for research and policy * * *...

22
Integrated Public Use Microdata Series Integrated Public Use Microdata Series International: census microdata for International: census microdata for research and policy research and policy * * * * * * Robert McCaa Robert McCaa Albert Albert Esteve Palós Esteve Palós Minnesota Population Center Minnesota Population Center Centre Centre d’Estudis Demogràfics d’Estudis Demogràfics “Only used statistics are useful statistics.”

Upload: trinity-buchanan

Post on 27-Mar-2015

215 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Integrated Public Use Microdata Series International: census microdata for research and policy * * * Robert McCaa Albert Esteve Palós Minnesota Population

Integrated Public Use Microdata SeriesIntegrated Public Use Microdata SeriesInternational: census microdata for research and policyInternational: census microdata for research and policy

* * ** * *Robert McCaa Robert McCaa Albert Esteve PalósAlbert Esteve Palós

Minnesota Population Center Minnesota Population Center Centre d’Estudis DemogràficsCentre d’Estudis Demogràfics

“Only used statistics are useful statistics.”

Page 2: Integrated Public Use Microdata Series International: census microdata for research and policy * * * Robert McCaa Albert Esteve Palós Minnesota Population

1. IPUMS international: goals and benefits1. IPUMS international: goals and benefits

“…“…best practice for a data repository of international statistical data”best practice for a data repository of international statistical data”--Dennis Trewin --Dennis Trewin

chair UNECE task force on Statistical Confidentiality & Microdata Accesschair UNECE task force on Statistical Confidentiality & Microdata Access

Page 3: Integrated Public Use Microdata Series International: census microdata for research and policy * * * Robert McCaa Albert Esteve Palós Minnesota Population

IPUMS-International GoalsIPUMS-International Goals

1.1. Preserve census microdata and documentation for all the Preserve census microdata and documentation for all the countries in the worldcountries in the world

2.2. Integrate microdata and metadataIntegrate microdata and metadata--a CD with source data and codebook is not sufficient--a CD with source data and codebook is not sufficient

3.3. Disseminate--without cost--extracts of samples to bona-fide Disseminate--without cost--extracts of samples to bona-fide researchers worldwide, regardless of country of birth, researchers worldwide, regardless of country of birth, citizenship or residence.citizenship or residence.

» Sustained, major funding since 1999 through 2014 by:Sustained, major funding since 1999 through 2014 by:» National Science Foundation (USA)National Science Foundation (USA)

» National Institutes of Health (USA)National Institutes of Health (USA)

» University of MinnesotaUniversity of Minnesota

3

Page 4: Integrated Public Use Microdata Series International: census microdata for research and policy * * * Robert McCaa Albert Esteve Palós Minnesota Population

Preservation: 1973 census tapes of Sudan at risk!

Page 5: Integrated Public Use Microdata Series International: census microdata for research and policy * * * Robert McCaa Albert Esteve Palós Minnesota Population

Benefits of IPUMS-InternationalBenefits of IPUMS-International» Preservation – IPUMS provides material and technical resourcesPreservation – IPUMS provides material and technical resources

» Recover Recover historical census data and documentationhistorical census data and documentation» ArchiveArchive data and documentation to the highest international standards data and documentation to the highest international standards

» Integration – IPUMS does the workIntegration – IPUMS does the work» DrawDraw high-precision samples to uniform specifications high-precision samples to uniform specifications» AnonymizeAnonymize microdata to highest international standards microdata to highest international standards» IntegrateIntegrate samples according to national practices samples according to national practices and and international international

principlesprinciples» Dissemination – IPUMS manages the riskDissemination – IPUMS manages the risk

» License License samples and documentation in a global initiative (US$5,000 per samples and documentation in a global initiative (US$5,000 per census of 1 million or more person records)census of 1 million or more person records)

» Disseminate Disseminate microdata with minimal risk and maximum benefit, at no microdata with minimal risk and maximum benefit, at no costcost

5

Page 6: Integrated Public Use Microdata Series International: census microdata for research and policy * * * Robert McCaa Albert Esteve Palós Minnesota Population

Microdata

Integrated into IPUMS

Entrusted to IPUMS None entrusted

None inventoried

IPUMS-International IPUMS-International dark greendark green = integrated and disseminating = integrated and disseminating

(55 countries, 159 censuses, 325 millon person records)(55 countries, 159 censuses, 325 millon person records)green = to be integrated (35 countries, 90 censuses, 150 mill.)green = to be integrated (35 countries, 90 censuses, 150 mill.)

Mollweide projection

IPUMS-InternationalIPUMS-International

2011:2011:Cambodia 2008Cambodia 2008Egypt 2006 Egypt 2006 France 2006France 2006GermanyGermanyIrelandIrelandNicaraguaNicaraguaSierra Leone Sierra Leone etc.etc.

Page 7: Integrated Public Use Microdata Series International: census microdata for research and policy * * * Robert McCaa Albert Esteve Palós Minnesota Population

www.iecm-project.org

Page 8: Integrated Public Use Microdata Series International: census microdata for research and policy * * * Robert McCaa Albert Esteve Palós Minnesota Population

PROJECT OVERVIEW | COORDINATION | HARMONIZATION | DISSEMINATIONwww.iecm-project.org

Integrated European Census Microdata

Coordination Integration Dissemination

Meetings:

Barcelona 2005

Paris 2006

Lisbon 2007

Barcelona 2008

Integrated Documentation

Intra-European classifications

Mirror site

Additional documentation

Data Browser /Online Tabulator

Page 9: Integrated Public Use Microdata Series International: census microdata for research and policy * * * Robert McCaa Albert Esteve Palós Minnesota Population

2. Integrating Census Microdata and 2. Integrating Census Microdata and MetadataMetadata

See also: See also: 2009: “Timely dissemination of integrated census microdata and metadata: The IPUMS-

International approach.” ASSD V: “Information and communication technology in data dissemination: bridging closer producers and users during the 2010 round of Population and Housing Censuses” (19-21 November 2009, Dakar, Senegal)

Page 10: Integrated Public Use Microdata Series International: census microdata for research and policy * * * Robert McCaa Albert Esteve Palós Minnesota Population

Constructing the IPUMS-International integrated Constructing the IPUMS-International integrated metadata and microdata systemmetadata and microdata system

» IPUMS-International NEVER IPUMS-International NEVER disseminates source microdata!disseminates source microdata!

» 5 step process of integration—5 step process of integration—2+ years to integrate2+ years to integrate metadata and microdata: metadata and microdata:

1.1. Confirm the integrity and validity of source Confirm the integrity and validity of source microdata and metadatamicrodata and metadata

2.2. Draw and anonymize high precision samples Draw and anonymize high precision samples 3.3. Integrate microdata sample (next slide)Integrate microdata sample (next slide)4.4. Integrate metadata (following slide)Integrate metadata (following slide)5.5. Confirm the integrity and validity of the Confirm the integrity and validity of the

integrated microdata sample and metadata integrated microdata sample and metadata 11

Page 11: Integrated Public Use Microdata Series International: census microdata for research and policy * * * Robert McCaa Albert Esteve Palós Minnesota Population

Step 3 of integration in the IPUMS systemStep 3 of integration in the IPUMS system• Composite coding scheme:Composite coding scheme:

1)1) preserve every significant detail and preserve every significant detail and 2)2) harmonize every code harmonize every code

• Example: marital statusExample: marital status• ……• 200 = married/in union200 = married/in union• 210 = married, formal 210 = married, formal • 211 = married, civil211 = married, civil• 212 = married, religious212 = married, religious• ……..• 215 = traditional or customary215 = traditional or customary• 217 = polygamous217 = polygamous• ……• 220 = married, consensual union220 = married, consensual union• ……

12

Page 12: Integrated Public Use Microdata Series International: census microdata for research and policy * * * Robert McCaa Albert Esteve Palós Minnesota Population

Step 4: integrate metadataStep 4: integrate metadata

4.4. Integrate metadata (XML): Document Integrate metadata (XML): Document every census, sample, variable and code:every census, sample, variable and code:

• Source documents (pdf) in official language Source documents (pdf) in official language and English and English

• Dynamic metadata system—compare any Dynamic metadata system—compare any combination of countries and samples:combination of countries and samples:

• wording of any census question and instructions wording of any census question and instructions to field workers to field workers

• Characteristics of each census and sampleCharacteristics of each census and sample• Describe each variable: “universe”, Describe each variable: “universe”,

definition, comparability, etc.definition, comparability, etc.13

Page 13: Integrated Public Use Microdata Series International: census microdata for research and policy * * * Robert McCaa Albert Esteve Palós Minnesota Population

3. IPUMS-International: 3. IPUMS-International: DisseminationDissemination

See also: See also: 2010: "Disseminating internationally integrated census microdata for the 2010 round and

beyond: the Integrated Public Use Microdata Series-International Experience.” ECE/CES/GE.41/2010/19.

Page 14: Integrated Public Use Microdata Series International: census microdata for research and policy * * * Robert McCaa Albert Esteve Palós Minnesota Population

2. 2. UsingUsing https://www.ipums.org/international:https://www.ipums.org/international:

1. Logon 1. Logon w/ passwordw/ password

2a. Study documentation2a. Study documentation2b. Design extract2b. Design extract

3. Receive email; 3. Receive email; logon with p/wordlogon with p/word

4. Download 4. Download extract (SSL extract (SSL encrypted)encrypted)

5. UnZip data5. UnZip data

(also SAS, (also SAS, STATA) STATA)

6. Analyze6. Analyze

Page 15: Integrated Public Use Microdata Series International: census microdata for research and policy * * * Robert McCaa Albert Esteve Palós Minnesota Population

4. IPUMS-International4. IPUMS-InternationalUsage statisticsUsage statistics

See card hand-out for list of current samples and usage statistics See card hand-out for list of current samples and usage statistics

Page 16: Integrated Public Use Microdata Series International: census microdata for research and policy * * * Robert McCaa Albert Esteve Palós Minnesota Population

Who Uses the Microdata (1,264 undertakings, 2007)Who Uses the Microdata (1,264 undertakings, 2007)

»AffiliationAffiliation»University professors and students: 91%University professors and students: 91%

»Others: 9%Others: 9%» International agencies (World Bank, DFID, etc.): n=31International agencies (World Bank, DFID, etc.): n=31

» International research institutes: n=26International research institutes: n=26

»United Nations (ILO, WHO, etc.): n=21United Nations (ILO, WHO, etc.): n=21

»National Statistical Officials: n=18National Statistical Officials: n=18

»National government officials: n=18National government officials: n=18

»Employees of Non-Governmental Organizations: n =3Employees of Non-Governmental Organizations: n =3

Page 17: Integrated Public Use Microdata Series International: census microdata for research and policy * * * Robert McCaa Albert Esteve Palós Minnesota Population

Who Uses the Microdata (1,264 undertakings, 2007)Who Uses the Microdata (1,264 undertakings, 2007)

»DisciplinesDisciplines»Economics: Economics: 44%44%

»Demography: Demography: 13%13%

» Sociology:Sociology: 12%12%

»Public policy:Public policy: 5% 5%

»History:History: 4% 4%

»Others:Others: 22% (32 disciplines) 22% (32 disciplines)

Page 18: Integrated Public Use Microdata Series International: census microdata for research and policy * * * Robert McCaa Albert Esteve Palós Minnesota Population

Research Topics—extraordinarily diverseResearch Topics—extraordinarily diverse

» Economists:Economists:» Comparative study of labor force participationComparative study of labor force participation» Demand and supply of public services (water, electricity, sewage, etc.)Demand and supply of public services (water, electricity, sewage, etc.)» Economic impact of family planning and fertility declineEconomic impact of family planning and fertility decline» Discrimination in credit marketsDiscrimination in credit markets» Econometric analysis of labor force and incomeEconometric analysis of labor force and income» Effect of long-term youth unemploymentEffect of long-term youth unemployment» Effects of volume of human capital on returns to educationEffects of volume of human capital on returns to education» Human capital and agingHuman capital and aging» Impact of trade policies on growth, development, immigration, labor Impact of trade policies on growth, development, immigration, labor

markets, and inequalitymarkets, and inequality» Etc.Etc.

Page 19: Integrated Public Use Microdata Series International: census microdata for research and policy * * * Robert McCaa Albert Esteve Palós Minnesota Population

For uses, see http://bibliography.ipums.orgFor uses, see http://bibliography.ipums.org

Page 20: Integrated Public Use Microdata Series International: census microdata for research and policy * * * Robert McCaa Albert Esteve Palós Minnesota Population

Better: scholar.google.com Better: scholar.google.com IPUMS & key-word: subject, name of country, etc.IPUMS & key-word: subject, name of country, etc.

Page 21: Integrated Public Use Microdata Series International: census microdata for research and policy * * * Robert McCaa Albert Esteve Palós Minnesota Population

Conclusion: Invitation to continued cooperationConclusion: Invitation to continued cooperation

» In 1999, our dream: integrate samples of 21 countries in 10 In 1999, our dream: integrate samples of 21 countries in 10 yearsyears

» Thanks to generous cooperation of 55 National Statistical OfficesThanks to generous cooperation of 55 National Statistical Offices» Undreamed technological innovationsUndreamed technological innovations

» By 2009, integrated samples for 44 countriesBy 2009, integrated samples for 44 countries» Number of users and usage far exceeded expectationsNumber of users and usage far exceeded expectations

» For the 2010 decade, our dream: For the 2010 decade, our dream: » Double (2x) the number of integrated samplesDouble (2x) the number of integrated samples» Triple (3x) the number of usersTriple (3x) the number of users» Quadruple (4x) research output from census microdataQuadruple (4x) research output from census microdata