statistics for psychology patrick murphy department of statistics room l548

43
Statistics for Statistics for Psychology Psychology Patrick Murphy Patrick Murphy Department of Department of Statistics Statistics Room L548 Room L548 5th Floor Library 5th Floor Library Building Building [email protected] [email protected]

Upload: gail-fowler

Post on 14-Mar-2016

52 views

Category:

Documents


0 download

DESCRIPTION

Statistics for Psychology Patrick Murphy Department of Statistics Room L548 5th Floor Library Building [email protected]. 12 Lectures 2.00 pm Tuesdays Theatre L Textbook Seeing Through Statistics by Jessica Utts Duxbury Press. CLASS WEBPAGE - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

Statistics for PsychologyStatistics for Psychology

Patrick MurphyPatrick MurphyDepartment of StatisticsDepartment of Statistics

Room L548 Room L548 5th Floor Library Building5th Floor Library Building

[email protected]@UCD.IE

Page 2: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

12 Lectures12 Lectures

2.00 pm Tuesdays 2.00 pm Tuesdays Theatre LTheatre L

TextbookTextbookSeeing Through StatisticsSeeing Through Statistics

by by Jessica UttsJessica Utts

Duxbury PressDuxbury Press

Page 3: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

CLASS WEBPAGECLASS WEBPAGE

1. 1. Go to the Statistics Go to the Statistics Department WebsiteDepartment Website

WWW.UCD.IE/~Statdept/WWW.UCD.IE/~Statdept/

2.2. Then click on Then click on ClassPages ClassPages in the left in the left frameframe

3. Finally click on 3. Finally click on Statistics for PsychologyStatistics for Psychology

Page 4: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

What do you know about statistics?

Page 5: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

It’s boring…

Page 6: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

Frogs and Princesses

Page 7: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

There are three kinds of lies:

LiesLies Damned LiesDamned Lies and and StatisticsStatistics

- Benjamin Disraeli- Benjamin Disraeli

Page 8: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

A single death is a tragedy, a million deaths is a statistic.

Joseph Stalin (1879-1953)

Page 9: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

The weaker the data available upon which to base one's conclusion, thegreater the precision which should be quoted in order to give the dataauthenticity.

Norman R. Augustine

Page 10: Statistics for Psychology Patrick Murphy Department of Statistics Room L548
Page 11: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

Simpsons episode: Homer is questioned about his Homer is questioned about his

newly formed vigilante groupnewly formed vigilante groupNewscaster:Newscaster: Since your group Since your group started up, petty crime is down started up, petty crime is down 20%, but other crimes are up. 20%, but other crimes are up. Such as heavy sack beating Such as heavy sack beating which is up 800%. So you’re which is up 800%. So you’re actually increasing crime.actually increasing crime.Homer:Homer: You can make up You can make up statistics to prove anything.statistics to prove anything.43% of people know that.43% of people know that.

Page 12: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

Misuse of Statistics

The Great Meryl Streep Apple The Great Meryl Streep Apple Juice Cancer Scare Juice Cancer Scare

Asbestos is really bad for you Asbestos is really bad for you so we need to eradicate it from so we need to eradicate it from our buildingsour buildings

Page 13: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

Aeroplanes

1/1,000,000 chance of a bomb 1/1,000,000 chance of a bomb on a planeon a plane

Aeroplane EnginesAeroplane Engines

Page 14: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

The foundation of Probability The foundation of Probability theory lies in problems associated theory lies in problems associated with gambling and games of chancewith gambling and games of chance

The Romans used played a game The Romans used played a game with ASTRAGALI - Heel bones of with ASTRAGALI - Heel bones of animalsanimals

What about Probability?

Page 15: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

DICE as we know them were DICE as we know them were invented around 300 BCinvented around 300 BC

DICE

Page 16: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

“I lied, cheated and stole to become a millionaire. Now anybody at all can win the lottery and become a millionaire”

Page 17: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

LOTTO 6/42 What are the chance of What are the chance of

winning with one selection winning with one selection of 6 numbers?of 6 numbers?

Matches Matches Odds Odds 66 1 in 5,245,7861 in 5,245,786

5 5 1 in 24,2861 in 24,286

4 4 1 in 555 1 in 555

Page 18: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

LOTTO 6/42 The average time to win each of the prizes is The average time to win each of the prizes is

given by:given by:

Match 3 with BonusMatch 3 with Bonus 2 Years, 6 Weeks2 Years, 6 Weeks

Match 4Match 4 2 Years, 8 Months 2 Years, 8 Months

Match 5Match 5 116 Years, 9 Months 116 Years, 9 Months

Match 5 with BonusMatch 5 with Bonus 4323 Years, 5 Months4323 Years, 5 Months

Share in JackpotShare in Jackpot 25,220 Years 25,220 Years

Page 19: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

Why do people still play the lottery?

If you’re not in you can’t win!If you’re not in you can’t win! You never know your luck You never know your luck

until you try!until you try! My chances of winning a My chances of winning a

million are better than my million are better than my chances of earning a million.chances of earning a million.

The lottery is a tax on the The lottery is a tax on the statistically challenged.statistically challenged.

Page 20: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

Lincoln & Kennedy

Abraham Lincoln was elected to Congress in 1846.

John F Kennedy was elected to Congress in 1946.

Abraham Lincoln was elected President in 1860.

John F. Kennedy was elected President in 1960.

The names Lincoln and Kennedy each contain seven letters.

Both were particularly concerned with civil rights.

Page 21: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

Lincoln & Kennedy Both wives lost a child while

living in the White House. Both Presidents were shot on a

Friday. Both Presidents were shot in the

head. Lincoln's secretary was named

Kennedy. Kennedy's secretary was named

Lincoln. Both were assassinated by

Southerners.

Page 22: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

Lincoln & Kennedy Both were succeeded by

Southerners named Johnson. Andrew Johnson, who succeeded

Lincoln, was born in 1808. Lyndon Johnson, who succeeded

Kennedy, was born in 1908. John Wilkes Booth, who

assassinated Lincoln, was born in 1839.

Lee Harvey Oswald, who assassinated Kennedy, was born in 1939.

Page 23: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

Lincoln & Kennedy Both assassins were known by their

three names. Both names are composed of

fifteen letters. Lincoln was shot at the theatre

named 'Ford.' Kennedy was shot in a car called

'Lincoln.' Booth ran from the theatre and was

caught in a warehouse. Oswald ran from a warehouse and

was caught in a theatre. Booth and Oswald were

assassinated before their trials.

Page 24: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

Lincoln & Kennedy And here's the clincher. A week before Lincoln was

shot, he was in Monroe, Maryland.

A week before Kennedy was shot, he was in Marilyn Monroe.

Oh…and on the day he died Lincoln pardoned a man named…

Patrick Murphy

Page 25: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

Election: Which parties have

most power?

Party A - 45%Party A - 45% Party B - 44%Party B - 44% Party C - 7%Party C - 7% Party D - 4%Party D - 4%

Page 26: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

We’re ready to play some games…

Page 27: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

An Example Experiment:Experiment: Roll Two Dice Roll Two Dice Possible Outcomes:Possible Outcomes: Any Any

number from 1 to 6 can appear number from 1 to 6 can appear on each die.on each die.

There are 36 possible outcomes There are 36 possible outcomes Each Outcome in the Sample Each Outcome in the Sample

Space is equally probable.Space is equally probable. So the probability of each So the probability of each

outcome is 1/36outcome is 1/36 What is the probability of the What is the probability of the

Event - “get combined total of 7 Event - “get combined total of 7 on the dice”on the dice”

Page 28: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

(1,6)

(2,5)

(3,4)

(4,3)

(5,2)

(6,1)

(1,1) (1,2) (1,3) (1,4) (1,5) (1,6)

(2,1) (2,2) (2,3) (2,4) (2,5) (2,6)

(3,1) (3,2) (3,3) (3,4) (3,5) (3,6)

(4,1) (4,2) (4,3) (4,4) (4,5) (4,6)

(5,1) (5,2) (5,3) (5,4) (5,5) (5,6)

(6,1) (6,2) (6,3) (6,4) (6,5) (6,6)

Page 29: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

A more interesting example Game Show Game Show ““Who wants to win a Ferrari?”Who wants to win a Ferrari?” 3 doors3 doors 1 Car & 2 Goats1 Car & 2 Goats You pick a door - e.g. #1You pick a door - e.g. #1 Host knows what’s behind all Host knows what’s behind all

the doors and he opens another the doors and he opens another door, say #3, and shows you a door, say #3, and shows you a goatgoat

He then asks if you want to He then asks if you want to stick with your original choice stick with your original choice #1, or change to door #2?#1, or change to door #2?

Page 30: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

Ask Marilyn. Marilyn vos SavantMarilyn vos Savant Guinness Book of Records -Guinness Book of Records -

Highest IQHighest IQ ““Yes you should switch. The first Yes you should switch. The first

door has a 1/3 chance of winning door has a 1/3 chance of winning while the second has a 2/3 chance while the second has a 2/3 chance of winning.”of winning.”

Ph.D.s - Now two doors, 1 goat & Ph.D.s - Now two doors, 1 goat & 1 car so chances of winning are 1 car so chances of winning are 1/2 for door #1 and 1/2 for door 1/2 for door #1 and 1/2 for door #2.#2.

““You are the goat”You are the goat” - Western - Western State University.State University.

Page 31: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

Who’s right? At the start, the sample space is:At the start, the sample space is:

{{CCGG, GG, GGCG, CG, GGGC}GC} Pick a door e.g. #1Pick a door e.g. #1 1 in 3 chance of winning1 in 3 chance of winning

Host shows you a goat so nowHost shows you a goat so now

{{CCGGGG, , GGCCGG, , GGGGC}C} So Marilyn was right, you So Marilyn was right, you

should switch.should switch.

Page 32: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

StatisticsStatistics is the science of data. is the science of data. This involves collecting, analysing This involves collecting, analysing and interpreting information.and interpreting information.Descriptive StatisticsDescriptive Statistics uses uses graphical and numerical techniques graphical and numerical techniques to summarise and display the to summarise and display the information contained in a dataset.information contained in a dataset.Inferential StatisticsInferential Statistics uses sample uses sample data to make decisions or data to make decisions or predictions about a larger predictions about a larger population of data population of data

Chapter 1The Beginning

Page 33: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

More DefinitionsMore Definitions

Population:Population: The entire collection of The entire collection of individuals or objects about which individuals or objects about which information is desired.information is desired.Sample:Sample: A part (subset) of the A part (subset) of the populationpopulation selected in some prescribed selected in some prescribed manner.manner.Variable:Variable: A characteristic or property A characteristic or property of an individual unit in the population.of an individual unit in the population.Representative Sample:Representative Sample: A selection A selection of data chosen from the target of data chosen from the target population population which exhibits which exhibits characteristics typical of the characteristics typical of the populationpopulation..Representative samples should give Representative samples should give unbiased estimatesunbiased estimates

Page 34: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

More DefinitionsMore Definitions

The most common way to select a The most common way to select a Representative Sample is to choose a Representative Sample is to choose a Random Sample.Random Sample.A A Random SampleRandom Sample is a sample is a sample selected so that each different possible selected so that each different possible sample of the desired size has an equal sample of the desired size has an equal chance of being the one chosen.chance of being the one chosen.This implies that each member of the This implies that each member of the original population has an equal chance original population has an equal chance of being selected in any random of being selected in any random sample.sample.

Page 35: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

Descriptive vs Inferential StatisticsDescriptive statistics is only interested Descriptive statistics is only interested in describing a dataset, whereas in describing a dataset, whereas Inferential Statistics seeks to make a Inferential Statistics seeks to make a decision based on the data.decision based on the data.

Page 36: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

An Example of Descriptive Statistics- UCD Faculties

Faculties Faculty # Students # Degrees # PG Degrees Arts 4,438 1,153 342 Commerce 2,129 424 395 Law 463 120 43 Science 1,868 327 106 Engineering 1,142 229 88 Medicine 1,185 218 63 Architecture 289 79 22 Agriculture 950 130 0

Page 37: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

Faculty

010002000300040005000

# Degrees

0500

10001500

Arts Commerce

Law Science

Engineering

Medicine

Architectur

e

Agriculture

X

Page 38: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

Degrees/Student

00.050.1

0.150.2

0.250.3

Arts Commerce

Law Science

Engineering

Medicine

Architectur

e

Agriculture

By using Descriptive Statistics to display the data in this manner we can now analyse the data more easily to find trends or patterns which were not immediately obvious in the original dataset.

Page 39: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

The Basics of Inferential Statistics - An ExampleA Newspaper wants to know whether A Newspaper wants to know whether people are happy with the performance people are happy with the performance of the Government. They hire a of the Government. They hire a company to conduct an opinion poll. company to conduct an opinion poll. The pollsters select 1000 people and The pollsters select 1000 people and ask them the question: “Are you happy ask them the question: “Are you happy with the performance of the with the performance of the Government?”Government?”The Newspaper prints a headline like The Newspaper prints a headline like the following:the following:““70% want the Government to go”70% want the Government to go”oror““Government achieves record Government achieves record popularity among voters” popularity among voters”

Page 40: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

How can the newspaper publish things How can the newspaper publish things like this?like this?

They have only got the opinions of less They have only got the opinions of less than 1000 people ( remember the than 1000 people ( remember the “don’t knows”).“don’t knows”).

1000/2.3 Million = 0.00043 or 0.043%1000/2.3 Million = 0.00043 or 0.043%

Before the end of this course we will Before the end of this course we will find out in great detail whether we find out in great detail whether we should believe these polls.should believe these polls.

Page 41: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

For the moment lets examine the For the moment lets examine the procedure carried out in this example.procedure carried out in this example.

The newspaper is interested in a certain The newspaper is interested in a certain population. What is this Population?population. What is this Population?

The newspaper wants to measure some The newspaper wants to measure some variable for each unit of the population. variable for each unit of the population. What variable do they want to What variable do they want to measure?measure?

The opinion pollsters decide to select a The opinion pollsters decide to select a sample from the population. What is sample from the population. What is the sample? the sample?

And what is so special about the And what is so special about the sample chosen?sample chosen?

Is the result reliable?Is the result reliable?

Page 42: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

How to collect data.

Before we can begin making inferences about Before we can begin making inferences about the data we need to collect the data itself. the data we need to collect the data itself. Usually one gets data in one of 4 different Usually one gets data in one of 4 different ways.ways.

Data from a published sourceData from a published sourceThe data has already been collected and the The data has already been collected and the results published, all we need do is draw results published, all we need do is draw conclusions from the data. This is where conclusions from the data. This is where politicians and economists get most of their politicians and economists get most of their data. A boring way to get data!!!data. A boring way to get data!!!

Data from a designed experimentData from a designed experimentHere you design and conduct an experiment Here you design and conduct an experiment to measure some characteristic of a to measure some characteristic of a population. You have strict control over how population. You have strict control over how the experiment is carried out. This is the way the experiment is carried out. This is the way scientists collect their data and it is the scientists collect their data and it is the method which should provide the most method which should provide the most accurate results.accurate results.

Page 43: Statistics for Psychology Patrick Murphy Department of Statistics Room L548

How to collect data continued... Data from a surveyData from a survey

Here you select a representative sample of Here you select a representative sample of people from the population you are interested people from the population you are interested in. You ask each person some questions and in. You ask each person some questions and record their answers. This method is used by record their answers. This method is used by polling companies, government statisticians polling companies, government statisticians etc. It has certain obvious drawbacks relating etc. It has certain obvious drawbacks relating to the truthfulness of responses.to the truthfulness of responses.

Data collected observationallyData collected observationallyHere one observes the sample in its normal Here one observes the sample in its normal environment and records the variables of environment and records the variables of interest. Used by biologists and psychologists.interest. Used by biologists and psychologists.