biases and errors in epidemiology anchita khatri

111
Biases and errors in Epidemiology Anchita Khatri

Upload: anne-hulm

Post on 02-Apr-2015

218 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Biases and errors in Epidemiology Anchita Khatri

Biases and errors in Epidemiology

Anchita Khatri

Page 2: Biases and errors in Epidemiology Anchita Khatri

Definitions ERROR: 1. A false or mistaken result obtained in a study

or experiment2. Random error is the portion of variation in

measurement that has no apparent connection to any other measurement or variable, generally regarded as due to chance

3. Systematic error which often has a recognizable source, e.g., a faulty measuring instrument, or pattern, e.g., it is consistently wrong in a particular direction

(Last)

Page 3: Biases and errors in Epidemiology Anchita Khatri

Relationship b/w Bias and Chance

Chance

Bias

Diastolic Blood Pressure (mm Hg)80 90

True BP(intra-arterial cannula)

BP measurement(sphygmomanometer)

No.

of

obse

rvat

ions

Page 4: Biases and errors in Epidemiology Anchita Khatri

Validity

• Validity: The degree to which a measurement measures what it purports to measure (Last)

Degree to which the data measure what they were intended to measure – that is, the results of a measurement correspond to the true state of the phenomenon being measured (Fletcher)

• also known as ‘Accuracy’

Page 5: Biases and errors in Epidemiology Anchita Khatri

Reliability • The degree of stability expected when a

measurement is repeated under identical conditions; degree to which the results obtained from a measurement procedure can be replicated

(Last)

• Extent to which repeated measurements of a stable phenomenon – by different people and instruments, at different times and places – get similar results (Fletcher)

• Also known as ‘Reproduciblity’ and ‘Precision’

Page 6: Biases and errors in Epidemiology Anchita Khatri

Validity and Reliability

VALIDITY

RELIABILITY

High Low

High

Low

Page 7: Biases and errors in Epidemiology Anchita Khatri

Bias• Deviation of results or inferences from the truth,

or processes leading to such deviation. Any trend in the collection, analysis, interpretation, publication, or review of data that can lead to conclusions that are systematically different from the truth. (Last)

• A process at any stage of inference tending to produce results that depart systematically from true values (Fletcher)

Page 8: Biases and errors in Epidemiology Anchita Khatri

Types of biases

1. Selection bias

2. Measurement / (mis)classification bias

3. Confounding bias

Page 9: Biases and errors in Epidemiology Anchita Khatri

Selection bias

• Errors due to systematic differences in characteristics between those who are selected for study and those who are not.

(Last; Beaglehole)

• When comparisons are made between groups of patients that differ in ways other than the main factors under study, that affect the outcome under study. (Fletcher)

Page 10: Biases and errors in Epidemiology Anchita Khatri

Examples of Selection bias

• Subjects: hospital cases under the care of a physician

• Excluded:

1. Die before admission – acute/severe disease.

2. Not sick enough to require hospital care

3. Do not have access due to cost, distance etc.

• Result: conclusions cannot be generalized

• Also known as ‘Ascertainment Bias’(Last)

Page 11: Biases and errors in Epidemiology Anchita Khatri

Ascertainment Bias

• Systematic failure to represent equally all classes of cases or persons supposed to be represented in a sample. This bias may arise because of the nature of the sources from which the persons come, e.g., a specialized clinic; from a diagnostic process influenced by culture, custom, or idiosyncracy. (Last)

Page 12: Biases and errors in Epidemiology Anchita Khatri

Selection bias with ‘volunteers’

• Also known as ‘response bias’

• Systematic error due to differences in characteristics b/w those who choose or volunteer to take part in a study and those who do not

Page 13: Biases and errors in Epidemiology Anchita Khatri

Examples …response bias

• Volunteer either because they are unwell, or worried about an exposure

• Respondents to ‘effects of smoking’ usually not as heavy smokers as non-respondents.

• In a cohort study of newborn children, the proportion successfully followed up for 12 months varied according to the income level of the parents

Page 14: Biases and errors in Epidemiology Anchita Khatri

Examples…. (Assembly bias)• Study: ? association b/w reserpine and breast

cancer in women• Design: Case Control• Cases: Women with breast cancer Controls: Women without breast cancer who were not suffering from any cardio-vascular disease (frequently associated with HT)• Result: Controls likely to be on reserpine

systematically excluded association between reserpine and breast cancer observed

Page 15: Biases and errors in Epidemiology Anchita Khatri

Examples…. (Assembly bias)

• Study: effectiveness of OCP1 vs. OCP2

• Subjects:

on OCP1 – women who had given birth at least once ( able to conceive)

on OCP2 – women had never become pregnant

• Result: if OCP2 found to be better, inference correct??

Page 16: Biases and errors in Epidemiology Anchita Khatri

Susceptibility Bias

• Groups being compared are not equally susceptible to the outcome of interest, for reasons other than the factors under study

• Comparable to ‘Assembly Bias’• In prognosis studies; cohorts may differ in

one or more ways – extent of disease, presence of other diseases, the point of time in the course of disease, prior treatment etc.

Page 17: Biases and errors in Epidemiology Anchita Khatri

Examples…..(Susceptibility Bias)

• Background: for colorectal cancer,

- CEA levels correlated with extent of disease (Duke’s classification)

- Duke’s classification and CEA levels strongly predicted diseases relapse

• Question: Does CEA level predict relapse independent of of Duke’s classification, or was susceptibility to relapse explained by Duke’s classification alone?

Page 18: Biases and errors in Epidemiology Anchita Khatri

Example… CEA levels (contd.)

• Answer: association of pre-op levels of CEA to disease relapse was observed for each category of Duke’s classification

stratification

Page 19: Biases and errors in Epidemiology Anchita Khatri

Disease-free survival according to CEA levels in colorectal cancer pts.with similar

pathological staging (Duke’s B)

3 96 1512 2118 24

80

60

100

0

CEA Level (ng)<2.5

2.5 – 10.0

>10.0

Months

% d

isea

se f

ree

Page 20: Biases and errors in Epidemiology Anchita Khatri

Selection bias with ‘Survival Cohorts’

• Patients are included in study because they are available, and currently have the disease

• For lethal diseases patients in survival cohort are the ones who are fortunate to have survived, and so are available for observation

• For remitting diseases patients are those who are unfortunate enough to have persistent disease

• Also known as ‘Available patient cohorts’

Page 21: Biases and errors in Epidemiology Anchita Khatri

Example… bias with ‘survival cohort’

Assemble Cohort

(N=150)

Measure outcomeImproved: 75Not improved: 75

Assemble patients

Begin Follow-up

(N=50)

Measure outcomeImproved: 40Not improved: 10

Not observed(N=100)

Dropouts Improved: 35Not improved: 65

TRUE COHORT

SURVIVAL COHORT

Observedimprovement

Trueimprovement

50% 50%

80% 50%

Page 22: Biases and errors in Epidemiology Anchita Khatri

Selection bias due to ‘Loss to Follow-up’

• Also known as ‘Migration Bias’

• In nearly all large studies some members of the original cohort drop out of the study

• If drop-outs occur randomly, such that characteristics of lost subjects in one group are on an average similar to those who remain in the group, no bias is introduced

• But ordinarily the characteristics of the lost subjects are not the same

Page 23: Biases and errors in Epidemiology Anchita Khatri

Example of ‘lost to follow-up’

+nt -nt Total

+nt 50 100 150

-nt

10000 20000 30000

EXPOSUREirradiation

+nt -nt Total

+nt 60

-nt

4000 8000 12000

30 30

EXPOSUREirradiation

DIS

EA

SE

cata

ract

RR= 50/10000 100/20000 = 1

RR= 30/4000 30/8000 = 2

Page 24: Biases and errors in Epidemiology Anchita Khatri

Migration bias• A form of Selection Bias• Can occur when patients in one group leave

their original group, dropping out of the study altogether or moving to one of the other groups under study (Fletcher)

• If occur on a large scale, can affect validity of conclusions.

• Bias due to crossover more often a problem in risk studies, than in prognosis studies, because risk studies go on for many years

Page 25: Biases and errors in Epidemiology Anchita Khatri

Example of migration

• Question: relationship between lifestyle and mortality

• Subjects: 10,269 Harvard College alumni

- classified according to physical activity, smoking, weight, BP

- In 1966 and 1977

• Mortality rates observed from 1977 to 1985

Page 26: Biases and errors in Epidemiology Anchita Khatri

Example of migration (contd.)

• Problem: original classification of ‘lifestyle’ might change (migration b/w groups)

• Solution: defined four categories

- Men who maintained high risk lifestyles

- Men who crossed over from low to high risk

- Men who crossed over from high to low risk

- Men who maintained low risk lifestyles

Page 27: Biases and errors in Epidemiology Anchita Khatri

Example of migration (contd.)

• Result: after controlling for other risk factors

- those who maintained or adopted high risk characteristics had highest mortality

- Those who changed from high to low had lesser mortality than above

- Those who never had any high risk behavior had least mortality

Page 28: Biases and errors in Epidemiology Anchita Khatri

Healthy worker effect

• A phenomenon observed initially in studies of occupational diseases: workers usually exhibit lower overall death rates than the general population, because the severely ill and chronically disabled are ordinarily excluded from employment. Death rates in the general population may be inappropriate for comparison if this effect is not taken into account.

(Last)

Page 29: Biases and errors in Epidemiology Anchita Khatri

Example…. ‘healthy worker effect’

• Question: association b/w formaldehyde exposure and eye irritation

• Subjects: factory workers exposed to formaldehyde

• Bias: those who suffer most from eye irritation are likely to leave the job at their own request or on medical advice

• Result: remaining workers are less affected; association effect is diluted

Page 30: Biases and errors in Epidemiology Anchita Khatri

Measurement bias• Systematic error arising from inaccurate

measurements (or classification) of subjects or study variables. (Last)

• Occurs when individual measurements or classifications of disease or exposure are inaccurate (i.e. they do not measure correctly what they are supposed to measure)

(Beaglehole)

• If patients in one group stand a better chance of having their outcomes detected than those in another group. (Fletcher)

Page 31: Biases and errors in Epidemiology Anchita Khatri

Measurement / (Mis) classification

• Exposure misclassification occurs when exposed subjects are incorrectly classified as unexposed, or vice versa

• Disease misclassification occurs when diseased subjects are incorrectly classified as non-diseased, or vice versa

(Norell)

Page 32: Biases and errors in Epidemiology Anchita Khatri

Causes of misclassification

1. Measurement gap: gap between the measured and the true value of a variable

- Observer / interviewer bias

- Recall bias

- Reporting bias

2. Gap b/w the theoretical and empirical definition of exposure / disease

Page 33: Biases and errors in Epidemiology Anchita Khatri

Sources of misclassification

Measurement results

Empirical definition

Theoretical definition

Measurement errors

Gap b/w theoretical & empirical definitions

Page 34: Biases and errors in Epidemiology Anchita Khatri

Example… ‘gap b/w definitions’

Theoretical definition• Exposure: passive

smoking – inhalation of tobacco smoke from other people’s smoking

• Disease: Myocardial infarction – necrosis of the heart muscle tissue

Empirical definition• Exposure: passive

smoking – time spent with smokers (having smokers as room-mates)

• Disease: Myocardial infarction – certain diagnostic criteria (chest pain, enzyme levels, signs on ECG)

Page 35: Biases and errors in Epidemiology Anchita Khatri

Exposure misclassification – Non-differential

• Misclassification does not differ between cases and non-cases

• Generally leads to dilution of effect, i.e. bias towards RR=1 (no association)

Page 36: Biases and errors in Epidemiology Anchita Khatri

Example…Non-differential Exposure Misclassification

+nt -nt Total

+nt 40 80 120

-nt

10000 40000 50000

+nt -nt Total

+nt 60 60 120

-nt

20000 30000 50000

EXPOSUREX-ray exposure

EXPOSUREX-ray exposure

DIS

EA

SE

Bre

ast

Can

cer

RR= 40/10000 80/40000 = 2

RR= 60/20000 60/30000 = 1.5

Page 37: Biases and errors in Epidemiology Anchita Khatri

Exposure misclassification - Differential

• Misclassification differs between cases and non-cases

• Introduces a bias towards

RR= 0 (negative / protective association), or

RR= α (infinity)(strong positive association)

Page 38: Biases and errors in Epidemiology Anchita Khatri

Example…Differential Exposure Misclassification

+nt -nt Total

+nt 40 80 120

-nt 9960 39920 49880

10000 40000 50000

+nt -nt Total

+nt 40 80 120

-nt 19940 29940 49880

19980 30020 50000

EXPOSUREX-ray exposure

EXPOSUREX-ray exposure

DIS

EA

SE

Bre

ast

Can

cer

RR= 40/10000 80/40000 = 2

RR= 40/19980 80/30020 = 0.75

Page 39: Biases and errors in Epidemiology Anchita Khatri

Implications of Differential exposure misclassification

• An improvement in accuracy of exposure information (i.e. no misclassification among those who had breast cancer), actually reduced accuracy of results

• Non-differential misclassification is ‘better’ than differential misclassification

• So, epidemiologists are more concerned with comparability of information than with improving accuracy of information

Page 40: Biases and errors in Epidemiology Anchita Khatri

Causes of Differential Exposure Misclassification

• Recall Bias:Systematic error due to differences in accuracy or completeness of recall to memory of past events or experience.

For e.g. patients suffering from MI are more likely to recall and report ‘lack of exercise’ in the past than controls

Page 41: Biases and errors in Epidemiology Anchita Khatri

Causes of Differential Exposure Misclassification

• Measurement bias:

e.g. analysis of Hb by different methods (cyanmethemoglobin and Sahli's) in cases and controls.

e.g.biochemical analysis of the two groups from two different laboratories, which give consistently different results

Page 42: Biases and errors in Epidemiology Anchita Khatri

Causes of Differential Exposure Misclassification

• Interviewer / observer bias: systematic error due to observer variation (failure of the observer to measure or identify a phenomenon correctly)

e.g. in patients of thrombo-embolism, look for h/o OCP use more aggressively

Page 43: Biases and errors in Epidemiology Anchita Khatri

Measurement bias in treatment effects

• Hawthorne effect: effect (usually positive / beneficial) of being under study upon the persons being studied; their knowledge of being studied influences their behavior

• Placebo effect: (usually, but not necessarily beneficial) expectation that regimen will have effect, i.e. the effect is due to the power of suggestion.

Page 44: Biases and errors in Epidemiology Anchita Khatri

Total effects of treatment are the sum of spontaneous improvement, non-specific responses,

and the effects of specific treatments

Specific to

treatment

Placebo

Hawthorne

Natural

History

EFFECTS

IMP

RO

VE

ME

NT

Page 45: Biases and errors in Epidemiology Anchita Khatri

Confounding 1. A situation in which the effects of two

processes are not separated. The distortion of the apparent effect of an exposure on risk brought about by the association with other factors that can influence the outcome

2. A relationship b/w the effects of two or more causal factors as observed in a set of data such that it is not logically possible to separate the contribution that any single causal factor has made to an effect

(Last)

Page 46: Biases and errors in Epidemiology Anchita Khatri

Confounding

When another exposure exists in the study population (besides the one being studied) and is associated both with disease and the exposure being studied. If this extraneous factor – itself a determinant of or risk factor for health outcome is unequally distributed b/w the exposure subgroups, it can lead to confounding

(Beaglehole)

Page 47: Biases and errors in Epidemiology Anchita Khatri

Confounder … must be

1. Risk factor among the unexposed (itself a determinant of disease)

2. Associated with the exposure under study

3. Unequally distributed among the exposed and the unexposed groups

Page 48: Biases and errors in Epidemiology Anchita Khatri

Examples … confounding

SMOKING LUNG CANCER

AGE(If the average ages of the smoking and non-smoking groups are very different)

(As age advanceschances of lungcancer increase)

Page 49: Biases and errors in Epidemiology Anchita Khatri

Examples … confounding

COFFEE DRINKING HEART DISEASE

SMOKING

(Coffee drinkers are more likely to smoke)

(Smoking increasesthe risk of heart ds)

Page 50: Biases and errors in Epidemiology Anchita Khatri

Examples … confounding

ALCOHOLINTAKE

MYOCARDIALINFARCTION

SEX

(Men are more at risk for MI)

(Men are more likelyto consume alcoholthan women)

Page 51: Biases and errors in Epidemiology Anchita Khatri

Examples … confounding

+nt -nt

+nt 140 100

-nt

Total 30000 30000

+nt -nt

male female male female

+nt 120 20 60 40

-nt

Total 20000 10000 10000 20000

Exposure-alcohol

Exposure-alcohol

Dis

ease

M

I

Dis

ease

M

I

RR = 140/30000 100/30000 = 1.4

RR = 120/20000(M) 60/10000 = 1RR = 20/10000(F) 40/20000 = 1

Page 52: Biases and errors in Epidemiology Anchita Khatri

Example … multiple biases• Study: ?? Association b/w regular exercise and

risk of CHD• Methodology: employees of a plant offered an

exercise program; some volunteered, others did not

coronary events detected by regular voluntary check-ups, including a careful history, ECG, checking routine heath records

• Result: the group that exercised had lower CHD rates

Page 53: Biases and errors in Epidemiology Anchita Khatri

Biases operating

• Selection: volunteers might have had initial lower risk (e.g. lower lipids etc.)

• Measurement: exercise group had a better chance of having a coronary event detected since more likely to be examined more frequently

• Confounding: if exercise group smoked cigarettes less, a known risk factor for CHD

Page 54: Biases and errors in Epidemiology Anchita Khatri

Dealing with Selection Bias

Ideally,

To judge the effect of an exposure / factor on the risk / prognosis of disease, we should compare groups with and without that factor, everything else being equal

But in real life ‘everything else’ is usually not equal

Page 55: Biases and errors in Epidemiology Anchita Khatri

Methods for controlling Selection Bias

During Study Design1. Randomization2. Restriction3. MatchingDuring analysis1. Stratification2. Adjustmenta) Simple / standardizationb) Multiple / multivariate adjustmentc) Best case / worst case analysis

Page 56: Biases and errors in Epidemiology Anchita Khatri

Restriction

• Subjects chosen for study are restricted to only those possessing a narrow range of characteristics, to equalize important extraneous factors

• Limitation: generalisability is compromised; by excluding potential subjects, cohorts / groups selected may be unusual and not representative of most patients or people with condition

Page 57: Biases and errors in Epidemiology Anchita Khatri

Example… restriction

• Study: effect of age on prognosis of MI

• Restriction: Male / White / Uncomplicated anterior wall MI

• Important extraneous factors controlled for: sex / race / severity of disease

• Limitation: results not generalizable to females, people of non-white community, those with complicated MI

Page 58: Biases and errors in Epidemiology Anchita Khatri

Example… restriction

• OCP example

restrict study to women having at least one child

• Colorectal cancer example

restrict patients to a particular staging of Duke’s classification

Page 59: Biases and errors in Epidemiology Anchita Khatri

Matching - definition

• The process of making a study group and a comparison group comparable with respect to extraneous factors (Last)

• For each patient in one group there are one or more patients in the comparison group with same characteristics, except for the factor of interest (Fletcher)

Page 60: Biases and errors in Epidemiology Anchita Khatri

Types of Matching• Caliper matching: process of matching

comparison group to study group within a specific distance for a continuous variable (e.g., matching age to within 2 years)

• Frequency matching: frequency distributions of the matched variable(s) be similar in study and comparison groups

• Category matching: matching the groups in broad classes such as relatively wide age ranges or occupational groups

Page 61: Biases and errors in Epidemiology Anchita Khatri

Types of Matching … (contd.)

• Individual matching: identifying individual subjects for comparison, each resembling a study subject on the matched variable(s)

• Pair matching: individual matching in which the study and comparison subjects are paired

(Last)

Page 62: Biases and errors in Epidemiology Anchita Khatri

• Matching is often done for age, sex, race, place of residence, severity of disease, rate of progression of disease, previous treatment received etc.

• Limitations:- controls for bias for only those factors involved

in the match- Usually not possible to match for more than a

few factors because of the practical difficulties of finding patients that meet all matching criteria

- If categories for matching are relatively crude, there may be room for substantial differences b/w matched groups

Page 63: Biases and errors in Epidemiology Anchita Khatri

Example… Matching• Study: ? Association of Sickle cell trait (HbAS)

with defects in physical growth and cognitive development

• Other potential biasing factors: race, sex, birth date, birth weight, gestational age, 5-min Apgar score, socio economic status

• Solution: matching – for each child with HbAS selected a child with HbAA who was similar with respect to the seven other factors (50+50=100)

• Result: no difference in growth and development

Page 64: Biases and errors in Epidemiology Anchita Khatri

Overmatching A situation that may arise when groups are being

matched. Several varieties:1. The matching procedure partially or

completely obscures evidence of a true causal association b/w the independent and dependant variables. Overmatching may occur if the matching variable is involved in, or is closely connected with, the mechanism whereby the independent variable affects the dependant variable. The matching variable may be an intermediate cause in the causal chain or it may be strongly affected by, or a consequence of, such an intermediate cause

Page 65: Biases and errors in Epidemiology Anchita Khatri

2. The matching procedure uses one or more unnecessary matching variables, e.g., variables that have no causal effect or influence on the dependant variable, and hence cannot confound the relationship b/w the independent and dependant variables.

3. The matching process is unduly elaborate, involving the use of numerous matching variables and / or insisting on a very close similarity with respect to specific matching variables. This leads to difficulty in finding suitable controls (Last)

Page 66: Biases and errors in Epidemiology Anchita Khatri

Stratification • The process of or the result of separating a

sample into several sub-samples according to specified criteria such as age groups, socio-economic status etc. (Last)

• The effect of confounding variables may be controlled by stratifying the analysis of results

• After data are collected, they can be analyzed and results presented according to subgroups of patients, or strata, of similar characteristics (Fletcher)

Page 67: Biases and errors in Epidemiology Anchita Khatri

Example…Stratification (Fletcher)

Pts Deaths %

Total 1200 48 4

Pre-opRisk

High

Medium

Low

500

400

300

30 6

16 4

02 .67

Pts Deaths %

Total 2400 64 2.6

Pre-opRisk

High

Medium

Low

400 24 6

800 32 4

1200 8 .67

HOSPITAL ‘A’

HOSPITAL ‘B’

Page 68: Biases and errors in Epidemiology Anchita Khatri

Example…Stratification

Pinellas county Dade countyRelat.

Rate

Dead Total Rate Dead Total Rate

Overall 5726 374,665 15.3 8332 935,047 8.9 1.7

> 55 yrs

Birth –54 yrs

Age –WiseStratification

737 229,198 3.2 2463 748,035 3.3 1.0

4989 145,147 34.4 5898 187,985 31.2 1.1

Page 69: Biases and errors in Epidemiology Anchita Khatri

Standardization

A set of techniques used to remove as far as possible the effects of differences in age or other confounding variables when comparing two or more populations

The method uses weighted averaging of rates specific for age, sex, or some other potentially confounding variable(s), according to some specified distribution of these variables

(Last)

Page 70: Biases and errors in Epidemiology Anchita Khatri

Standard populationA population in which the age and sex

composition is known precisely, as a result of a census or by an arbitrary means – e.g. an imaginary population, the “standard million” in which the age and sex composition is arbitrary. A standard population is used as comparison group in the actuarial procedure of standardization of mortality rates. (e.g. Segi world population, European standard population) (Last)

Page 71: Biases and errors in Epidemiology Anchita Khatri

Types of standardization

Direct: the specific rates in a study population are averaged using as weights the distribution of a specified standard population.

The standardized rate so obtained represents what the rate would have been in the study population if that population had the same distribution as the standard population w.r.t. the variables for which the adjustment or standardization was carried out.

Page 72: Biases and errors in Epidemiology Anchita Khatri

Indirect: used to compare the study populations for which the specific rates are either statistically unstable or unknown. The specific rates are averaged using as weights the distribution of the study population. The ratio of the crude rate for the study population to the weighted average so obtained is known as standardized mortality (or morbidity) ratio, or SMR. (Last)

[represents what the rate would have been in the study population if that population had the same specific rates as the standard population]

Page 73: Biases and errors in Epidemiology Anchita Khatri

Standardized mortality ratio (SMR)

Ratio of

The no. of deaths observed in the

study group or population

X 100

No. of deaths expected if the study population had the same specific rates as the standard population

Page 74: Biases and errors in Epidemiology Anchita Khatri

Example … direct standardization

Age Pop Deaths Rate

0 4000 60 15.0

1-4 4500 20 4.4

5-14 4000 12 3.0

15-19 5000 15 3.0

20-24 4000 16 4.0

25-34 8000 25 3.1

34-44 9000 48 5.3

45-54 8000 100 12.5

55-64 7000 150 21.4

Total 53,500 446 8.3

Std.Pop Exp deaths

2400 36

9600 42.24

19000 57

9000 27

8000 32

14000 43.4

12000 63.6

11000 137.5

8000 171.2

93000 609.94(6.56)

Page 75: Biases and errors in Epidemiology Anchita Khatri

Example … direct standardization

Preop Pts Deaths %High 500 30 6

Medium 400 16 4Low 300 2 .67

Total 1200 48 4Preop Pts Rate Exp.deathsHigh 400 6 24

Medium 400 4 16Low 400 .67 2.68

Total 1200 42.68 (3.6%)

HOSPITAL ‘A’

HOSPITAL ‘Std’

Page 76: Biases and errors in Epidemiology Anchita Khatri
Page 77: Biases and errors in Epidemiology Anchita Khatri

Stratification vs. Standardization• Standardization removes the effect

• Stratification controls for the effect of factor, but the effect can still be seen

• For e.g. in the ‘hospital example’, with standardization we found that patients had similar prognosis in both hospitals; with stratification also learnt mortality rates among different risk strata

• Similar to difference b/w age-standardized mortality rate and age specific mortality rates

Page 78: Biases and errors in Epidemiology Anchita Khatri

Multivariate adjustment• Simultaneously controlling the effects of

many variables to determine the independent effects of one

• Can select from a large no. of variables a smaller subset that independently and significantly contributes to the overall variation in outcome, and can arrange variables in order of the strength of their contribution

• Only feasible way to deal with many variables at one time during the analysis phase

Page 79: Biases and errors in Epidemiology Anchita Khatri

Examples… Multivariate adjustment

• CHD is the joint result of lipid abnormalities, HT, smoking, family history, DM, exercise, personality type.

• Start with 2x2 tables using one variable at a time

• Contingency tables, i.e. stratified analyses, examining the effect of one variable changed in the presence/absence of one or more variables

Page 80: Biases and errors in Epidemiology Anchita Khatri

Example…Multivariate adjustment

• Multi variable modeling i.e developing a mathematical expression of the effects of many variables taken together

• Basic structure of a multivariate model: Outcome variable = constant + (β1 x variable1)

+ (β2 x variable2) + ……….• β1, β2, … are coefficients determined from the

data; variable1, variable2, …. are the predictor variables that might be related to outcome

Page 81: Biases and errors in Epidemiology Anchita Khatri

Sensitivity analysis

• When data on important prognostic factors is not available, it is possible to estimate the potential effects on the study by assuming various degrees of mal-distribution of the factors b/w the groups being compared and seeing how that would affect the results

• Best case / worst case analysis is a special type of sensitivity analysis – assuming the best and worst type of mal-distribution

Page 82: Biases and errors in Epidemiology Anchita Khatri

Example… best/worst case analysis

• Study: effect of gastro-gastrostomy on morbid obesity

• Subjects: cohort of 123 morbidly obese patients who underwent gastro-gastrostomy, 19 to 47 months after surgery

• Success : losing >30% excess weight• Follow-up: 103 (84%) patients 20 patients lost to follow up

Page 83: Biases and errors in Epidemiology Anchita Khatri

Example…. (contd.)• Success rate: 60/103 (58%)• Best case: all 20 lost to follow up had

“success” Best success rate: (60+20)/123 (65%)• Worst case: all 20 lost to follow up had

“failures” Worst success rate: 60/123 (49%)• Result: true success rate b/w 49% and 65%;

probably closer to 58% ! (because pts. lost to follow up unlikely to be all successes or all failures

Page 84: Biases and errors in Epidemiology Anchita Khatri

Randomization

• The only way to equalize all extraneous factors, or ‘everything else’ is to assign patients to groups randomly so that each has an equal chance of falling into the exposed or unexposed group

• Equalizes even those factors which we might not know about!

• But it is not possible always

Page 85: Biases and errors in Epidemiology Anchita Khatri

Overall strategy

• Except for randomization, all ways of dealing with extraneous differences b/w groups. Are effective against only those factors that are singled out for consideration

• Ordinarily one uses several methods layered one upon another

Page 86: Biases and errors in Epidemiology Anchita Khatri

Example…• Study: effect of presence of VPCs on survival of

patients after acute MI• Strategies:- Restriction: not too young / old; no unusual

causes (e.g.mycotic aneurysm) for infarction- Matching: for age (as important prognostic

factor, but not the factor under study)- Stratification: examine results for different

strata of clinical severity- Multivariate analysis: adjust crude rates for the

effects of all other variables except VPC, taken together.

Page 87: Biases and errors in Epidemiology Anchita Khatri

Dealing with measurement bias

1. Blinding- Subject- Observer / interviewer- Analyser 2. Strict definition / standard definition for

exposure / disease / outcome3. Equal efforts to discover events equally in

all the groups

Page 88: Biases and errors in Epidemiology Anchita Khatri

Controlling confounding

• Similar to controlling for selection bias

• Use randomization, restriction, matching, stratification, standardization, multivariate analysis etc.

Page 89: Biases and errors in Epidemiology Anchita Khatri

Lead time bias• Lead time is the period of time b/w the

detection of a medical condition by screening and when it ordinarily would be diagnosed because a pt. experiences symptoms and seeks medical care

• As a result of screening, on an average, pt will survive longer from the time of diagnosis than who are diagnosed otherwise, even if T/t is not effective.

• Not more ‘survival time’, but more ‘disease time’

Page 90: Biases and errors in Epidemiology Anchita Khatri

How lead time affects survival time

Diag

Diag

Diag

Unscreened

Screened – Early T/t not effective

Screened –Early T/t is effective

Onset of Ds Death Survival after diagnosis

Page 91: Biases and errors in Epidemiology Anchita Khatri

Controlling lead time bias

• Compare screened group of people, and control group, and compare age specific mortality rates, rather than survival times from time of diagnoses

• E.g. early diagnosis and T/t for colorectal cancer is effective because mortality rates of screened people are lower than those of a comparable group of unscreened people

Page 92: Biases and errors in Epidemiology Anchita Khatri

Length time bias• Can affect studies of screening• B’cos the proportion of slow growing tumors

diagnosed during screening programs is greater than those diagnosed during usual medical care

• B’cos slow growing tumors are present for a longer period before they cause symptoms; fast growing tumors are likely to cause symptoms leading to interval diagnosis

• Screening tends to find tumors with inherently better prognoses

Page 93: Biases and errors in Epidemiology Anchita Khatri

Compliance bias• Compliant patients tend to have better

prognoses regardless of the screening

• If a study compares disease outcomes among volunteers for a screening program with outcomes in a group of people who did not volunteer, better results for the volunteers might not be due to T/t but due to factors related to compliance

• Compliance bias and length-time bias can both be avoided by relying on RCTs

Page 94: Biases and errors in Epidemiology Anchita Khatri

Types of studies & related biasesPrevalence study •Uncertainty about temporal sequences

•Bias studying ‘old’/prevalent cases

Case control •Selection bias in selecting cases/controls•Measurement bias

Cohort study •Susceptibility bias•Survival cohort vs. true cohort•Migration bias

Randomized control trials

•Consider natural h/o disease, Hawthorne effect, placebo effect etc.•Compliance problems•Effect of co-interventions

Page 95: Biases and errors in Epidemiology Anchita Khatri

Random error• Divergence on the basis of chance alone of

an observation on a sample from the population from the true population values

• ‘random’ because on an average it is as likely to result in observed values being on one side of the true value as on the other side

• Inherent in all observations• Can be minimized, but never avoided

altogether

Page 96: Biases and errors in Epidemiology Anchita Khatri

Sources of random error

1. Individual biological variation

2. Measurement error

3. Sampling error ( the part of the total estimation of error of a parameter caused by the random nature of the sample)

Page 97: Biases and errors in Epidemiology Anchita Khatri

Sampling variation Because research must ordinarily be

conducted on a sample of patients and not on all the patients with the condition under study

always a possibility that the particular sample of patients in a study, even though selected in an unbiased way, might not be similar to population of patients as a whole

Page 98: Biases and errors in Epidemiology Anchita Khatri

Sampling variation - definition

Since inclusion of individuals in a sample is determined by chance, the results of analysis on two or more samples will differ purely by chance.

(Last)

Page 99: Biases and errors in Epidemiology Anchita Khatri

Assessing the role of chance

1. Hypothesis testing

2. Estimation

Page 100: Biases and errors in Epidemiology Anchita Khatri

Hypothesis testing

Start off with the Null Hypothesis (H0) the statistical hypothesis that one variable

has no association with another variable or set of variables, or that two or more population distributions do not differ from one another.

in simpler terms, the null hypothesis states that the results observed in a study, experiment or test are no different from what might have occurred as a result of operation of chance alone

(Last)

Page 101: Biases and errors in Epidemiology Anchita Khatri

Statistical tests – errors (Fletcher)

TRUE DIFFERENCE

PRESENT

(H0) false

ABSENT

(H0) true

CONCLUSION

OF STATISTICAL

TEST

SIGNIFICANT

(H0) Rejected

NOT

SIGNIFICANT

(H0) Accepted

Type I

( α ) error

Type II

( β ) error

Power

Page 102: Biases and errors in Epidemiology Anchita Khatri

Statistical tests - errors• Type I (α) error: error of rejecting a true

null hypothesis , I.e. declaring a difference exists when it does not

• Type II (β) error: error of failing to reject a false null hypothesis , I.e. declaring that a difference does not exist when in fact it does

• Power of a study: ability of a study to demonstrate an association if one exists

Power = 1- β

Page 103: Biases and errors in Epidemiology Anchita Khatri

p - value• Probability of an α error.• Quantitative estimate of probability that

observed difference in b/w the groups in the study could have happened by chance alone, assuming that there is no real difference b/w the groups OR

• If there were no difference b/w the groups, and the trial was repeated many times, what proportion of the trials would lead to conclusions that there is the same or a bigger difference b/w the groups than the results found in the study

Page 104: Biases and errors in Epidemiology Anchita Khatri

p – value – Remember!!

• Usually P < 0.05 is considered statistically significant (i.e. probability of 1 in 20 that observed difference is due to chance)

• 0.05 is an arbitrary cut-off; can change according to requirements

• Statistically significant result might not be clinically significant and vice-versa

Page 105: Biases and errors in Epidemiology Anchita Khatri

Statistical significance vs. clinical significance

Large RCT called GUSTO (41,021 pts of ac MI)

• Study: Streptokinase vs. tPA

• Result: death rate at 30 days

- streptokinase (7.2%) (p < 0.001)

- tPA (6.3%)

• But, need to treat 100 patients with tPA instead of streptokinase to prevent 1 death!

• tPA costly - $ 250 thousand to save one death

??? Clinically significant

Page 106: Biases and errors in Epidemiology Anchita Khatri

Estimation • Effect size observed in a particular study is

called ‘Point estimate’• True effect is unlikely to be exactly that

observed in study because of random variation

• Confidence interval (CI): usually 95% (Last) computed interval with a given

probability e.g. 95%, that the true value such as a mean, proportion, or rate is contained within the interval

Page 107: Biases and errors in Epidemiology Anchita Khatri

Confidence intervals(Fletcher) If the study is unbiased, there is a 95%

chance that that the interval includes the true effect size. The true value is likely to be close to the point estimate, less likely to be near the outer limits of that interval, and could (5 times out of 100) fall outside these limits altogether,

CI allows the reader to see the range of plausible values and so to decide whether the effect size they regard as clinically meaningful is consistent with or ruled out by the data

Page 108: Biases and errors in Epidemiology Anchita Khatri

Multiple comparison problem

• If a no. of comparisons are made, (e.g. in a large study, the effect of treatment assessed separately for each subgroup, and for each outcome), 1 in 20 of these comparisons is likely to be statistically significant at the 0.05 level

Page 109: Biases and errors in Epidemiology Anchita Khatri

“If you dredge the data sufficiently deeply, and sufficiently often, you will find something odd. Many of these bizarre findings will be due to chance…….discoveries that were not initially postulated among the major objectives of the trial should be treated with extreme caution.”

Page 110: Biases and errors in Epidemiology Anchita Khatri

Dealing with random error• Increasing the sample size: sample size

depends upon

- level of statistical significance (α error)

- Acceptable chance of missing a real effect (β error)

- Magnitude of effect under investigation

- Amount of disease in population

- Relative sizes of groups being compared

• Sample size is usually a compromise b/w ideal and logistic and financial considerations

Page 111: Biases and errors in Epidemiology Anchita Khatri

References

1. Fletcher RH et al.Clinical Epidemiology : The Essentials – 3rd ed.

2. Beaglehole R et al. Basic Epidemiology, WHO

3. Last JM. Dictionary in Epidemiology – 3rd ed.

4. Maxcy-Rosenau-Last. Public Health & Preventive Medicine – 14th ed.

5. Norell SE. Workbook of Epidemiology

6. Park K. Park’s textbook of preventive and social medicine – 16th ed.