integrated public use microdata series international: census microdata for research and policy * * *...
TRANSCRIPT
Integrated Public Use Microdata SeriesIntegrated Public Use Microdata SeriesInternational: census microdata for research and policyInternational: census microdata for research and policy
* * ** * *Robert McCaa Robert McCaa Albert Esteve PalósAlbert Esteve Palós
Minnesota Population Center Minnesota Population Center Centre d’Estudis DemogràficsCentre d’Estudis Demogràfics
“Only used statistics are useful statistics.”
1. IPUMS international: goals and benefits1. IPUMS international: goals and benefits
“…“…best practice for a data repository of international statistical data”best practice for a data repository of international statistical data”--Dennis Trewin --Dennis Trewin
chair UNECE task force on Statistical Confidentiality & Microdata Accesschair UNECE task force on Statistical Confidentiality & Microdata Access
IPUMS-International GoalsIPUMS-International Goals
1.1. Preserve census microdata and documentation for all the Preserve census microdata and documentation for all the countries in the worldcountries in the world
2.2. Integrate microdata and metadataIntegrate microdata and metadata--a CD with source data and codebook is not sufficient--a CD with source data and codebook is not sufficient
3.3. Disseminate--without cost--extracts of samples to bona-fide Disseminate--without cost--extracts of samples to bona-fide researchers worldwide, regardless of country of birth, researchers worldwide, regardless of country of birth, citizenship or residence.citizenship or residence.
» Sustained, major funding since 1999 through 2014 by:Sustained, major funding since 1999 through 2014 by:» National Science Foundation (USA)National Science Foundation (USA)
» National Institutes of Health (USA)National Institutes of Health (USA)
» University of MinnesotaUniversity of Minnesota
3
Preservation: 1973 census tapes of Sudan at risk!
Benefits of IPUMS-InternationalBenefits of IPUMS-International» Preservation – IPUMS provides material and technical resourcesPreservation – IPUMS provides material and technical resources
» Recover Recover historical census data and documentationhistorical census data and documentation» ArchiveArchive data and documentation to the highest international standards data and documentation to the highest international standards
» Integration – IPUMS does the workIntegration – IPUMS does the work» DrawDraw high-precision samples to uniform specifications high-precision samples to uniform specifications» AnonymizeAnonymize microdata to highest international standards microdata to highest international standards» IntegrateIntegrate samples according to national practices samples according to national practices and and international international
principlesprinciples» Dissemination – IPUMS manages the riskDissemination – IPUMS manages the risk
» License License samples and documentation in a global initiative (US$5,000 per samples and documentation in a global initiative (US$5,000 per census of 1 million or more person records)census of 1 million or more person records)
» Disseminate Disseminate microdata with minimal risk and maximum benefit, at no microdata with minimal risk and maximum benefit, at no costcost
5
Microdata
Integrated into IPUMS
Entrusted to IPUMS None entrusted
None inventoried
IPUMS-International IPUMS-International dark greendark green = integrated and disseminating = integrated and disseminating
(55 countries, 159 censuses, 325 millon person records)(55 countries, 159 censuses, 325 millon person records)green = to be integrated (35 countries, 90 censuses, 150 mill.)green = to be integrated (35 countries, 90 censuses, 150 mill.)
Mollweide projection
IPUMS-InternationalIPUMS-International
2011:2011:Cambodia 2008Cambodia 2008Egypt 2006 Egypt 2006 France 2006France 2006GermanyGermanyIrelandIrelandNicaraguaNicaraguaSierra Leone Sierra Leone etc.etc.
www.iecm-project.org
PROJECT OVERVIEW | COORDINATION | HARMONIZATION | DISSEMINATIONwww.iecm-project.org
Integrated European Census Microdata
Coordination Integration Dissemination
Meetings:
Barcelona 2005
Paris 2006
Lisbon 2007
Barcelona 2008
Integrated Documentation
Intra-European classifications
Mirror site
Additional documentation
Data Browser /Online Tabulator
2. Integrating Census Microdata and 2. Integrating Census Microdata and MetadataMetadata
See also: See also: 2009: “Timely dissemination of integrated census microdata and metadata: The IPUMS-
International approach.” ASSD V: “Information and communication technology in data dissemination: bridging closer producers and users during the 2010 round of Population and Housing Censuses” (19-21 November 2009, Dakar, Senegal)
Constructing the IPUMS-International integrated Constructing the IPUMS-International integrated metadata and microdata systemmetadata and microdata system
» IPUMS-International NEVER IPUMS-International NEVER disseminates source microdata!disseminates source microdata!
» 5 step process of integration—5 step process of integration—2+ years to integrate2+ years to integrate metadata and microdata: metadata and microdata:
1.1. Confirm the integrity and validity of source Confirm the integrity and validity of source microdata and metadatamicrodata and metadata
2.2. Draw and anonymize high precision samples Draw and anonymize high precision samples 3.3. Integrate microdata sample (next slide)Integrate microdata sample (next slide)4.4. Integrate metadata (following slide)Integrate metadata (following slide)5.5. Confirm the integrity and validity of the Confirm the integrity and validity of the
integrated microdata sample and metadata integrated microdata sample and metadata 11
Step 3 of integration in the IPUMS systemStep 3 of integration in the IPUMS system• Composite coding scheme:Composite coding scheme:
1)1) preserve every significant detail and preserve every significant detail and 2)2) harmonize every code harmonize every code
• Example: marital statusExample: marital status• ……• 200 = married/in union200 = married/in union• 210 = married, formal 210 = married, formal • 211 = married, civil211 = married, civil• 212 = married, religious212 = married, religious• ……..• 215 = traditional or customary215 = traditional or customary• 217 = polygamous217 = polygamous• ……• 220 = married, consensual union220 = married, consensual union• ……
12
Step 4: integrate metadataStep 4: integrate metadata
4.4. Integrate metadata (XML): Document Integrate metadata (XML): Document every census, sample, variable and code:every census, sample, variable and code:
• Source documents (pdf) in official language Source documents (pdf) in official language and English and English
• Dynamic metadata system—compare any Dynamic metadata system—compare any combination of countries and samples:combination of countries and samples:
• wording of any census question and instructions wording of any census question and instructions to field workers to field workers
• Characteristics of each census and sampleCharacteristics of each census and sample• Describe each variable: “universe”, Describe each variable: “universe”,
definition, comparability, etc.definition, comparability, etc.13
3. IPUMS-International: 3. IPUMS-International: DisseminationDissemination
See also: See also: 2010: "Disseminating internationally integrated census microdata for the 2010 round and
beyond: the Integrated Public Use Microdata Series-International Experience.” ECE/CES/GE.41/2010/19.
2. 2. UsingUsing https://www.ipums.org/international:https://www.ipums.org/international:
1. Logon 1. Logon w/ passwordw/ password
2a. Study documentation2a. Study documentation2b. Design extract2b. Design extract
3. Receive email; 3. Receive email; logon with p/wordlogon with p/word
4. Download 4. Download extract (SSL extract (SSL encrypted)encrypted)
5. UnZip data5. UnZip data
(also SAS, (also SAS, STATA) STATA)
6. Analyze6. Analyze
4. IPUMS-International4. IPUMS-InternationalUsage statisticsUsage statistics
See card hand-out for list of current samples and usage statistics See card hand-out for list of current samples and usage statistics
Who Uses the Microdata (1,264 undertakings, 2007)Who Uses the Microdata (1,264 undertakings, 2007)
»AffiliationAffiliation»University professors and students: 91%University professors and students: 91%
»Others: 9%Others: 9%» International agencies (World Bank, DFID, etc.): n=31International agencies (World Bank, DFID, etc.): n=31
» International research institutes: n=26International research institutes: n=26
»United Nations (ILO, WHO, etc.): n=21United Nations (ILO, WHO, etc.): n=21
»National Statistical Officials: n=18National Statistical Officials: n=18
»National government officials: n=18National government officials: n=18
»Employees of Non-Governmental Organizations: n =3Employees of Non-Governmental Organizations: n =3
Who Uses the Microdata (1,264 undertakings, 2007)Who Uses the Microdata (1,264 undertakings, 2007)
»DisciplinesDisciplines»Economics: Economics: 44%44%
»Demography: Demography: 13%13%
» Sociology:Sociology: 12%12%
»Public policy:Public policy: 5% 5%
»History:History: 4% 4%
»Others:Others: 22% (32 disciplines) 22% (32 disciplines)
Research Topics—extraordinarily diverseResearch Topics—extraordinarily diverse
» Economists:Economists:» Comparative study of labor force participationComparative study of labor force participation» Demand and supply of public services (water, electricity, sewage, etc.)Demand and supply of public services (water, electricity, sewage, etc.)» Economic impact of family planning and fertility declineEconomic impact of family planning and fertility decline» Discrimination in credit marketsDiscrimination in credit markets» Econometric analysis of labor force and incomeEconometric analysis of labor force and income» Effect of long-term youth unemploymentEffect of long-term youth unemployment» Effects of volume of human capital on returns to educationEffects of volume of human capital on returns to education» Human capital and agingHuman capital and aging» Impact of trade policies on growth, development, immigration, labor Impact of trade policies on growth, development, immigration, labor
markets, and inequalitymarkets, and inequality» Etc.Etc.
For uses, see http://bibliography.ipums.orgFor uses, see http://bibliography.ipums.org
Better: scholar.google.com Better: scholar.google.com IPUMS & key-word: subject, name of country, etc.IPUMS & key-word: subject, name of country, etc.
Conclusion: Invitation to continued cooperationConclusion: Invitation to continued cooperation
» In 1999, our dream: integrate samples of 21 countries in 10 In 1999, our dream: integrate samples of 21 countries in 10 yearsyears
» Thanks to generous cooperation of 55 National Statistical OfficesThanks to generous cooperation of 55 National Statistical Offices» Undreamed technological innovationsUndreamed technological innovations
» By 2009, integrated samples for 44 countriesBy 2009, integrated samples for 44 countries» Number of users and usage far exceeded expectationsNumber of users and usage far exceeded expectations
» For the 2010 decade, our dream: For the 2010 decade, our dream: » Double (2x) the number of integrated samplesDouble (2x) the number of integrated samples» Triple (3x) the number of usersTriple (3x) the number of users» Quadruple (4x) research output from census microdataQuadruple (4x) research output from census microdata