advanced lazarsfeldian methodology conference

22
Advanced Lazarsfeldian Methodology Conference From Lazarsfeldian Contextual analysis to Multilevel models (Strategies for analysis of individual and/or aggregate data) Petr Soukup

Upload: fredricka-elijah

Post on 30-Dec-2015

22 views

Category:

Documents


1 download

DESCRIPTION

Advanced Lazarsfeldian Methodology Conference. From Lazarsfeldian Contextual analysis to Multilevel models (Strategies for analysis of individual and/or aggregate data) Petr Soukup. Basic „ideology“. Gauss (1805) – „Regression analysis, OLS“ Homans (1950) - Human group - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Advanced Lazarsfeldian Methodology Conference

Advanced Lazarsfeldian Methodology Conference

From Lazarsfeldian Contextual analysis to Multilevel models

(Strategies for analysis of individual and/or aggregate data)

Petr Soukup

Page 2: Advanced Lazarsfeldian Methodology Conference

Basic „ideology“

Gauss (1805) – „Regression analysis, OLS“Homans (1950) - Human groupRobinson (1950) – „Ecological fallacy“ Lazarsfeld, Menzel (1961) - On the relation

between individual and collective properties Iversen (1991) – Contextual analysisAnd many others…about multilevel

modeling

Page 3: Advanced Lazarsfeldian Methodology Conference

Three possible strategieshow to analyse data

Page 4: Advanced Lazarsfeldian Methodology Conference

Three possible strategies to analyze individual and/or aggregate data

Analyze only individual data (classical regression or correlation analysis)

Analyze only aggregate data (but Robinson’s problem and EI solution)

Analyze individual and aggregate data at once (contextual and multilevel analysis)

Page 5: Advanced Lazarsfeldian Methodology Conference

Individual level data only

Page 6: Advanced Lazarsfeldian Methodology Conference

1st possible strategy (only individual data)

We omit information about aggregate levels (groups etc.), we loose some explained variance

We use classical regression or correlation analysis and many other methods

We make some statistical mistakes by ignoring of some dependencies of observations (or by pretending of the independence)

Results are usually quite good but we are not able differentiate between aggregate levels

If we have individual and aggregate data this is only 2nd best strategy

Page 7: Advanced Lazarsfeldian Methodology Conference

Results for 1st strategy

y = 5000 + 950x

5000

10000

15000

20000

25000

30000

0 5 10 15 20 25

years of education

inco

me

The result is one regression line (one equation). This equation is the same for all individuals („average“ line).

Page 8: Advanced Lazarsfeldian Methodology Conference

Inference from aggregate data

Page 9: Advanced Lazarsfeldian Methodology Conference

2nd possible strategy (only aggregate data)

We omit/do not have information about individual level

We want to infer about individual behaviour

We can make crucial mistakes (Robinson’s problem) so called ecological fallacy (demonstration in the inference about relation of education and salary)

Page 10: Advanced Lazarsfeldian Methodology Conference

2nd possible strategy (only aggregate data)

Ecological fallacy solution = ECOLOGICAL INFERENCE

50-ties method of bounds (Duncan, Davis, 1953), ecological regression (Goodman, 1953, 1959)

90-ties King: A solution to the ecological inference problem (1997)

General solution can not be found (we always loose information by aggregation), current solutions are only specific ones

Page 11: Advanced Lazarsfeldian Methodology Conference

Aggregate(groups) + individual data

Page 12: Advanced Lazarsfeldian Methodology Conference

Picture 4 (Different intercepts and slopes)

5000

10000

15000

20000

25000

30000

0 5 10 15 20 25

years of education

inco

me

male

femalemale+female

Page 13: Advanced Lazarsfeldian Methodology Conference

Lazarsfeldian approach

[Lazarsfeld, Menzel 1961] – typology of variables 1) global, 2) relational, 3) contextual –individual level 4) analytical and 5) structural – aggregate level Examples of these types Warning: This is „reduced“ version of original typology

4) can be derived from 1) by aggregation 5) can be derived from 2) by aggregation 3) can be derived from 1) or 2) measured on aggregate

level by disaggregation

Page 14: Advanced Lazarsfeldian Methodology Conference

Lazarsfeldian approach

This process (aggregation and disaggregation) can have of course more than two levels up to infinity (in practical analyses two or three levels)

The name Contextual analysis – we use information about aggregate data if we analyse individual data

We use currently multilevel analysis based on lazasfeldian contextual ideas

Page 15: Advanced Lazarsfeldian Methodology Conference

Contextual/multilevel analysis

Page 16: Advanced Lazarsfeldian Methodology Conference

Problem with group/context

Group boundaries –sometimes can be fuzzy, it is difficult to decide whether somebody is member of group or not

Mobility between groups – people tend to change group membership (change of school, neighborhood, church etc.) – „new“ members are not influenced by the group at the same level as the old ones

Multiple membership (overlapping) – People are usually members of more than one group, we should work with more contexts, (Whoch context(s) is (are) the most important? (possible solution see slide Other problems that can be solved via ML models)

Page 17: Advanced Lazarsfeldian Methodology Conference

Two types of contextual analysis

Interaction variables for individuals and groups (Method 1), or

A two-step estimated model based on variables measured at the first level for individual contexts, and then by using these estimates at the second level in the role of dependent variables (Method 2).

Page 18: Advanced Lazarsfeldian Methodology Conference

Multilevel analysis

Inclusion of random error at the second (group) level

Estimates by iterative methods

More precise estimates (lower standard errors)

Page 19: Advanced Lazarsfeldian Methodology Conference

Multilevel approach as more general

Growth models We measure characteristic of individuals many times. We can treat measurements in

current time as first level (similar to pupils at schools) and individuals as second levels (similar to school). „Average“ growth curve is one result of analysis but second result can be description (or explanation) of differences of individual growth curves.

Metaanalysis 1st level data from individual studies 2nd level individual studiesGoal: 1. to find common result of all covered studies and 2. to find reason of differences between studies

Cross classified modelsIndividual can be included in more than one group, these group are not hierarchically nested.

and their influences are mixed (crossed)

Page 20: Advanced Lazarsfeldian Methodology Conference

Conclusion?

Page 21: Advanced Lazarsfeldian Methodology Conference

Real research - examples

HSB – USAPISA, TIMMS or ICCS internationally

Page 22: Advanced Lazarsfeldian Methodology Conference

Model for multilevel analysis

i- index for individual, j-index for group

Lowest (individual) level:

(1)Yij= b0j+ b1jXij+ eij,X-individual variable

and at the Second (group) level: NEW (2) b0j= g00+ g01Zj+ u0j Random parts at the second level (3) b1j= g10+ g11Zj+ u1j because we do not have informationZ-group variable about all groups!!!Combining individual and group level:

(1)+(2)+(3): Yij= g00 + g10Xij+ g01Zj+ g11ZjXij -fixed part

+ u1jXij+ u0j+ eij -random part

(random coefficient model see Hox, 2002)