basic statistical concepts part ii psych 231: research methods in psychology

31
Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology

Post on 22-Dec-2015

229 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology

Basic Statistical ConceptsPart II

Psych 231: Research Methods in Psychology

Page 2: Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology

Statistics

Why do we use them? Descriptive statistics

• Used to describe, simplify, & organize data sets• Describing distributions of scores

Inferential statistics• Used to test claims about the population, based on data

gathered from samples• Takes sampling error into account, are the results above

and beyond what you’d expect by random chance

Page 3: Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology

Distribution

Properties of a distribution Shape

• Symmetric v. asymmetric (skew) • Unimodal v. multimodal

Center• Where most of the data in the distribution are

• Mean, Median, Mode

Spread (variability)• How similar/dissimilar are the scores in the distribution?

• Standard deviation (variance), Range

Page 4: Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology

The Mean

The most commonly used measure of center The arithmetic average

Computing the mean

μ =∑X

N

– The formula for the population mean is (a parameter):

– The formula for the sample mean is (a statistic):

X =∑ X

n

Add up all of the X’s

Divide by the total number in the population

Divide by the total number in the sample

Page 5: Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology

Spread (Variability)

How similar are the scores? Range: the maximum value - minimum value

• Only takes two scores from the distribution into account

• Influenced by extreme values (outliers) Standard deviation (SD): (essentially) the average amount that

the scores in the distribution deviate from the mean• Takes all of the scores into account

• Also influenced by extreme values (but not as much as the range)

Variance: standard deviation squared

Page 6: Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology

Variability

Low variabilityThe scores are fairly similar

High variabilityThe scores are fairly dissimilar

mean mean

Page 7: Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology

Standard deviation

The standard deviation is the most popular and most important measure of variability. The standard deviation measures how far off all of the

individuals in the distribution are from a standard, where that standard is the mean of the distribution.

• Essentially, the average of the deviations.

μ

Page 8: Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology

An Example: Computing the Mean

Our population2, 4, 6, 8

μ =∑X

N=

2 + 4 + 6 + 8

4=

20

4= 5.0

1 2 3 4 5 6 7 8 9 10

μ

Page 9: Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology

An Example: Computing Standard Deviation (population)

Step 1: To get a measure of the deviation we need to subtract the population mean from every individual in our distribution.

μ =∑X

N=

2 + 4 + 6 + 8

4=

20

4= 5.0

2 - 5 = -3

1 2 3 4 5 6 7 8 9 10

μX - μ = deviation scores

-3

Our population2, 4, 6, 8

Page 10: Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology

1 2 3 4 5 6 7 8 9 10

μ =∑X

N=

2 + 4 + 6 + 8

4=

20

4= 5.0

2 - 5 = -34 - 5 = -1

μX - μ = deviation scores

-1

Step 1: To get a measure of the deviation we need to subtract the population mean from every individual in our distribution.

Our population2, 4, 6, 8

An Example: Computing Standard Deviation (population)

Page 11: Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology

1 2 3 4 5 6 7 8 9 10

μ =∑X

N=

2 + 4 + 6 + 8

4=

20

4= 5.0

2 - 5 = -34 - 5 = -1

6 - 5 = +1μ

X - μ = deviation scores

1

Step 1: To get a measure of the deviation we need to subtract the population mean from every individual in our distribution.

Our population2, 4, 6, 8

An Example: Computing Standard Deviation (population)

Page 12: Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology

1 2 3 4 5 6 7 8 9 10

μ =∑X

N=

2 + 4 + 6 + 8

4=

20

4= 5.0

2 - 5 = -34 - 5 = -1

6 - 5 = +18 - 5 = +3

μX - μ = deviation scores

3

Notice that if you add up all of the deviations they must equal 0.

Step 1: To get a measure of the deviation we need to subtract the population mean from every individual in our distribution.

Our population2, 4, 6, 8

An Example: Computing Standard Deviation (population)

Page 13: Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology

Step 2: So what we have to do is get rid of the negative signs. We do this by squaring the deviations and then taking the square root of the sum of the squared deviations (SS).

SS = (X - μ)2

2 - 5 = -34 - 5 = -1

6 - 5 = +18 - 5 = +3

X - μ = deviation scores

= (-3)2+ (-1)2+ (+1)2+ (+3)2

= 9 + 1 + 1 + 9 = 20

An Example: Computing Standard Deviation (population)

Page 14: Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology

Step 3: ComputeVariance (which is simply the average of the squared deviations (SS)) So to get the mean, we need to divide by the number of

individuals in the population.

variance = 2 = SS/N

An Example: Computing Standard Deviation (population)

Page 15: Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology

Step 4: Compute Standard Deviation To get this we need to take the square root of the population

variance.

2 =X − μ( )

2∑N

standard deviation = =

An Example: Computing Standard Deviation (population)

Page 16: Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology

To review: Step 1: Compute deviation scores Step 2: Compute the SS Step 3: Determine the variance

• Take the average of the squared deviations• Divide the SS by the N

Step 4: Determine the standard deviation• Take the square root of the variance

An Example: Computing Standard Deviation (population)

Page 17: Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology

To review: Step 1: Compute deviation scores Step 2: Compute the SS Step 3: Determine the variance

• Take the average of the squared deviations• Divide the SS by the N-1

Step 4: Determine the standard deviation• Take the square root of the variance

An Example: Computing Standard Deviation (sample)

This is done because samples are biased to be less variable than the population. This “correction factor” will increase the sample’s SD (making it a better estimate of the population’s SD)

Page 18: Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology

Relationships between variables

Example: Suppose that you notice that the more you study for an exam, the better your score typically is. This suggests that there is a relationship

between study time and test performance. We call this relationship a correlation.

Page 19: Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology

Relationships between variables

Properties of a correlation Form (linear or non-linear) Direction (positive or negative) Strength (none, weak, strong, perfect)

To examine this relationship you should: Make a scatterplot Compute the Correlation Coefficient

Page 20: Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology

Scatterplot

Plots one variable against the other Useful for “seeing” the relationship

Form, Direction, and Strength

Each point corresponds to a different individual Imagine a line through the data points

Page 21: Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology

Scatterplot

Hours study

X

Exam perf.

Y

6 6

1 2

5 6

3 4

3 2

Y

X

1

2

3

4

5

6

1 2 3 4 5 6

Page 22: Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology

Correlation Coefficient

A numerical description of the relationship between two variables

For relationship between two continuous variables we use Pearson’s r

It basically tells us how much our two variables vary together As X goes up, what does Y typically do

• X, Y• X, Y• X, Y

Page 23: Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology

Form

Non-linearLinear

Page 24: Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology

Direction

NegativePositive

• As X goes up, Y goes up

• X & Y vary in the same direction

• Positive Pearson’s r

• As X goes up, Y goes down

• X & Y vary in opposite directions

• Negative Pearson’s r

Y

X

Y

X

Page 25: Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology

Strength

Zero means “no relationship”. The farther the r is from zero, the stronger the

relationship

The strength of the relationship Spread around the line (note the axis scales)

Page 26: Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology

Strength

r = 1.0“perfect positive corr.”

r = -1.0“perfect negative corr.”

r = 0.0“no relationship”

-1.0 0.0 +1.0

The farther from zero, the stronger the relationship

Page 27: Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology

Strength

-1.0 0.0 +1.0

-.8 .5

Which relationship is stronger?

Rel A, -0.8 is stronger than +0.5

r = -0.8

Rel A

r = 0.5

Rel B

Page 28: Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology

Y

X

1

2

3

4

5

6

1 2 3 4 5 6

Regression

Compute the equation for the line that best fits the data points

Y = (X)(slope) + (intercept)

2.0

Change in Y

Change in X= slope

0.5

Page 29: Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology

Y

X

1

2

3

4

5

6

1 2 3 4 5 6

Regression

Can make specific predictions about Y based on X

Y = (X)(.5) + (2.0)X = 5

Y = ? Y = (5)(.5) + (2.0)

Y = 2.5 + 2 = 4.54.5

Page 30: Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology

Regression

Also need a measure of error

Y = X(.5) + (2.0) + error Y = X(.5) + (2.0) + error

Y

X

1

2

3

4

5

6

1 2 3 4 5 6

Y

X

1

2

3

4

5

6

1 2 3 4 5 6

• Same line, but different relationships (strength difference)

Page 31: Basic Statistical Concepts Part II Psych 231: Research Methods in Psychology

Cautions with correlation & regression

Don’t make causal claims Don’t extrapolate Extreme scores (outliers) can strongly

influence the calculated relationship