chapter 1 everything you ever wanted to know about statistcs

Download CHAPTER 1 EVERYTHING YOU EVER WANTED TO KNOW ABOUT STATISTCS

If you can't read please download the document

Upload: eustacia-davis

Post on 18-Jan-2018

244 views

Category:

Documents


0 download

DESCRIPTION

Figure 1.1

TRANSCRIPT

CHAPTER 1 EVERYTHING YOU EVER WANTED TO KNOW ABOUT STATISTCS Building Statistical Models Discovering some thing about a phenomena We explain a Phenomena by collecting data from real world and then draw conclusion about what i being studied Analogy: Building a Bridge across a river Figure 1.1 Model is an accurate represenation of the real world. Social Scientists build real world models to predict how these processes operate under certain conditions Simple Statistics Models: The Mean, sums of squares & Std Deviation Mean: Represents a summary of data. 5 Stat lectures and measured the number of friends they had i.e 1,2,3,3 and 4 Mean is 2.6 (Fig 1.2) Total error = sum of deviances Sum of Squared Errors (SS) measures accuracy of model Variance: Measures how well the model fits the data = SS / N-1 Std Deviation: Measures how well the mean represents the data. Large and small standard deviation Model Outcome= Model + Error Figure 1.2 Error in the model Model over estimates Red line is the model Figure 1.3 Mean is a good fit of data Mean is a poor fit of data SS: Sum of squared Errors, Variance and Standard Deviation Frequency Distributions Figure 1.4 In an ideal world data is distributed symmetrically around the mean: Normal distribution Properties of frequency distributions Skewness Figure 1.5 Positively skewed (left)Negatively skewed (right) In a normal distribution the value is zero Kutosis: Distributions vary in their pointyness Figure 1.7 LeptokurticPlatykurtic In a normal distribution the value is zero 1.5.3 Standard Normal Distribution Beachy Head example Figure 1.8 Z scores X / s Z = X X / s Important Z Scores Z = i.e 95% of z scores lie between this range Z = i.e 99% of z scores lie between Z = i.e 99.9% of z scores lie between Is my sample representative of population Figure 1.9 Sampling distribution tells us the behaviour of samples Linear models Figure 1.11 Confidence Intervals Lower boundary of confidence intervals = X (1.96 x SE ) Upper boundary of confidence intervals = X (1.96 x SE ) How can we tell if our model represents the real world Generate a Hypothesis Collect useful data Fit a statistical model to the data Assess this model to see whether it supports initial predictions Prediction made by researcher Experimental Hypothesis (Alternative Hypo) Status Quo is Null Hypothesis Example: Hamburgers make you fat Fischer Statistically Significant : When we are 95% certain that a result is genuine (i.e not a chance finding) should we accept it as being true. Or I f there is only a 5 % probability of some thing occuring by then we accept that it is a true finding. We say it is a statistically significant finding. Test Statistics Two Types of Variance Systematic Variation Due to some genuie effect i.e that can be explained by the model that we have fitted to the data. Unsystemativ Variation This variation is not due to the effect in which we are interested. i.e that can not be explained by the model that we have fitted to the data. Whteher a model is a reasonable representation of what is happening in the population Calculate Test Statistics ( A test stat is simply a stat that has known properties i.e we know how frequently different values of this stat occur) By knowing this we can calculate the probability of obtaining a particular value. The test statistics are t, F & chi square. They represent the same thing test statistics = Variance explained by the model / Variance not explained by the model One and two tailed tests Figure 1.13 Type I & Type II Errors H 0 true (Null is True) H 0 false Null is false) Fail to Reject H 0 Correct decision 1- Type II error P(Acceptinng the Null Hypothesis when it is fls) Reject H 0 Type I error P(Rejecting the null hypothesis, when it is true) Correct Decision 1- Power True state of nature:What is in reality What we did beleive from the experiement Effect Sizes R =.10 Small effect i.e Explains 1% of the total variance R =.30 Small effect i.e Explains 9% of the total variance R =.50 Small effect i.e Explains 25% of the total variance Statistical Power The ability of a test to detect an effect of that size is known as statistical power. 1- Power