the challenge:

43
Religion and Economic Change over a Century: Linking Diverse Historical Data New Technologies and Interdisciplinary Research on Religion Harvard, 2010 Robert D. Woodberry Juan Carlos Esparza University of Texas at Austin Sociology Department and Population Research Center

Upload: azize

Post on 04-Feb-2016

34 views

Category:

Documents


0 download

DESCRIPTION

Religion and Economic Change over a Century: Linking Diverse Historical Data New Technologies and Interdisciplinary Research on Religion Harvard, 2010 Robert D. Woodberry Juan Carlos Esparza University of Texas at Austin Sociology Department and Population Research Center. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: The challenge:

Religion and Economic Change over a Century:

Linking Diverse Historical Data

New Technologies andInterdisciplinary Research on Religion

Harvard, 2010

Robert D. Woodberry Juan Carlos EsparzaUniversity of Texas at Austin

Sociology Department and Population Research Center

Page 2: The challenge:

The challenge:Roots of current differences may go back decades, even centuries – How test?

Religious recordsvaluable information

seldom used

Linking diverse sources over time

Page 3: The challenge:

The data:Source: Data: Characteristics:

Electronic datasets

Recent censuses, surveys and geo-climatic data

Polygons & Grids of Cells

Historical data

Historic censuses and colonial records Polygons

Protestant Data

Missionaries, education, etc. Points (mission stations)

English, Danish, Norwegian, French, German, and Spanish

Catholic Data

Missionaries, education, etc. Polygons (ecclesiastical jurisdictions)

English, Chinese, Italian, French, German, Latin, Spanish, Polish, and Portuguese.

Page 4: The challenge:

Problems:Gathering complete data

Digitizing data & maps

Normalizing and linking data from different sources

Dealing with missing data

Creating database for geo-spatial statistical modeling

Page 5: The challenge:

Complete dataLocating and evaluating “the universe” of sources

Temporal coverage

Spatial coverage

Data Quality

Variables included

Page 6: The challenge:

Complete dataComplete data often only available in archives: e.g., “Vatican Secret Archives,” & “Archives of Propaganda Fide”

Negotiating access

Locating, copying and digitizing sources

Page 7: The challenge:

Spatial LinkingIssues:

1) Data given for different spatial units

2) Spatial units change over time

3) Accuracy of base map

Page 8: The challenge:

Spatial Linking

1) Data given for different spatial units

Protestant: points

Catholic: polygons

Censuses, surveys, geo-climatic data:

different polygons and grids of cells

Page 9: The challenge:
Page 10: The challenge:
Page 11: The challenge:
Page 12: The challenge:

Spatial Linking

2) Spatial units change over time

Cities’ & towns’ names change

Catholic ecclesiastical jurisdictions evolve

National, provincial, and other state boundaries change

Page 13: The challenge:
Page 14: The challenge:
Page 15: The challenge:

Spatial LinkingWhy Important?

Connecting data to proper geographic referente.g., EJs & provinces in 1913

Linking data over time

For statistical analysis

For imputation

(How does data in 1892 relate to data in 1934 and 2009)

Page 16: The challenge:

Spatial Linking

3) Historic maps inaccurate (limited usefulness)

Points:Why matters:

1) change over time

2) link to proper polygon

3) link to proper geo-climatic conditions

Find place in modern gazetteer

Link locations between sources known alternative names

consistent institutions

Page 17: The challenge:
Page 18: The challenge:

Spatial Linking

Historic maps inaccurate (limited usefulness)

Territories: map spaghetti

Why matters:

1) Arbitrarily linking borders

2) Imputing data to artificial slivers

3) How link data when no maps

Page 19: The challenge:
Page 20: The challenge:

Spatial Linking

Improving accuracy:

Start with accurate modern maps

Reconstruct border change from legal documents

Reconstruct border overlap from legal documents

(e.g., Catholics and state jurisdictions borders)

Bring modern borders back through time

Page 21: The challenge:

Linking (cont.)

Accurate base maps:

Current world maps insufficient accuracy(e.g., mission stations in ocean or wrong country)

Improve coastlines, islands, borders, and maritime boundaries

Remove sliversAllows automatic linking of point and polygon data

Page 22: The challenge:

Maritime Boundaries

Page 23: The challenge:

Reconstructing historic borders:

Papal decrees document changes in EJs & identify corresponding government borders

Page 24: The challenge:
Page 25: The challenge:

Linking (cont.)Reconstructing historic borders:

Check accuracy with country & empire records

Smallest unit in legal sources determines size of MCGUs and precision of data linking

When possible use modern borders, when not digitize border from relatively accurate historical maps

Page 26: The challenge:

Linking (cont.)Determine Maximum Consistent Geographic Unit

(MCGU) before creating digital maps

MCGUs foundation for all linking and imputation

Only one base map (easy to update)

All other geographic units are unions of MCGUs

Page 27: The challenge:

Linking (cont.)

Maximum Consistent Geographic Unit (MCGU)

All point and cell data link to MCGUs

Protestant data

Geo-climatic data

Missionary mortality data

Also allow contextual analysis

(spatial autocorrelations, etc.)

Minimizes over-aggregation of data

Page 28: The challenge:
Page 29: The challenge:

Linking (cont.)

Linking geo-climatic data (endogeneity)

Aggregate as grid of cells: Grid of boxes covering world

Assign unique IDs and vectorize raster data

Normalize so boxes perfectly overlap and IDs match between layers

(very hard and time consuming)

Aggregate for MCGUs

Page 30: The challenge:
Page 31: The challenge:
Page 32: The challenge:
Page 33: The challenge:

Linking (cont.)

Linking mortality data (endogeneity)

Data on over 100,000 missionary lives

Calculate comparative mortality estimates by linking lives to

1) points (mission stations)

2) polygons (Countries, EJs & MCGUs)

Can generalized to other areas based on geo-climatic conditions, etc.

Page 34: The challenge:

Name Sex Born Sailed Loc_01 Begin End Loc_02 Begin2End2

Cover, James Fleet 1 1762 1796 Tahiti 1797 1798 Port Jackson 1798 1800

Eyre,John 1 1768 1796 Tahiti 1797 1808 Huahine 1808 1809

Jefferson, John 1 1760 1796 Tahiti 1797 1807

Lewis, Thomas 1 1765 1796 Tahiti 1797 1799

Bicknell, Henry 1 1766 1796 Tahiti 1797 1808 Port Jackson

Bowell, Daniel 1 1774 1796 Tongataboo 1797 1799

Broomhall, Benjamin 1 1776 1796 Tahiti 1797 1801

Buchanan, John 1 1765 1796 Tongataboo 1797 1800 Port Jackson 1800 1800

Cooper, James 1 1768 1796 Tongataboo 1797 1800 Port Jackson 1800 1801

Cock, John 1 1773 1796 Tahiti 1797 1798 Port Jackson 1798

Page 35: The challenge:

Missing DataProblems:

Changing categories between sources/years

Inconsistent categories within same source

Missing places in source

Inconsistent years between sources

Page 36: The challenge:

Missing Data (cont.)Strategies:

Finding missing data:

Letters of bishops to Pope

Triangulating between sources- To identify missing institutions &

organizations

- To identify estimates from inconsistencies

- To fill in missing data

Page 37: The challenge:

Missing Data (cont.)Strategies:

Imputing missing data (multiple imputation):

Using: 1) trend over time in MCGUs

- e.g., using linked MCGUs in 1913 & 1932

to estimate 1923

2) pattern with neighbor

Can compare results with and without imputed data

Page 38: The challenge:

An example: Mexico

Reconstruct all locality changes back to 1815

Reconstruct all EJ changes from 1850

Link historical censuses & modern surveys

Re-aggregate data according to any geographic unit (MCGU or larger)

Page 39: The challenge:
Page 40: The challenge:

Mexico (cont.)

Once completed:

All census, Catholic, and Protestant data linked for about 120 years

Multiple current surveys linked so can analyze modern consequences

Longitudinal database of MCGUs

Page 41: The challenge:

Mexico (cont.)Interrupted Time Series:

impact of introducing Protestant missions on Catholic church behavior

impact of Catholic and Protestant interventions on the change in literacy between censuses

Cumulative Influence:

Endogeneity: test correlates of when and where Protestants and Catholics invest in particular areas.

Page 42: The challenge:
Page 43: The challenge:

Thank You!• .