stat 310 - probability and statisticsdobelman/courses/310.overview.0.pdf1. probability and...

Post on 20-Aug-2020

15 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

ECON 307/STAT 310Probability and Statistics

or

Introduction to Mathematical Statistics

Fall 2017

George R. Brown School of Engineering - STATISTICS

Everyone’s Favorite Subject

2

Elementary Statistics

4

Logistics and Expectations

• Fast Paced– Not an abbreviated exercise– Concise as possible– Required base knowledge for most of stat

courses to come• What to expect

– Get and read the book(s)!– Lectures– Class participation– Homework– Exams

5

Presenter
Presentation Notes
Win to “kick you own ass”

Understanding• Dr. Dobelman’s website:

– How to find– http://dobelman.rice.edu

• Course Syllabus• Canvas• Pace yourself

6

Mathematical Statistics

7

( ) is continuousE( ) ( )

isdiscretei i

xf x dx xX XdF x

x p x

= = Σ

∫∫

Statistics• What is “Statistics”

– vs. what “are” statistics• [sample] average• [sample] proportion

• What is N?• Why talk about probability and statistics?

8

Statistics• Branch of mathematics (BOM) which studies

methods for the calculation of probabilities• BOM - collection and interpretation of

quantitative data and the use of probability theory to estimate population parameters

• OED– The branch of political science concerned with

the collection, classification, and discussion of (esp. numerical) facts bearing on the condition of a state or community.

9

– The branch of science or mathematics concerned with the analysis and interpretation of numerical data and appropriate ways of gathering such data.

– Also, the systematic collection and arrangement of numerical facts or data of any kind

10

Don’t Like Statistics• Why don’t people like statistics?

– Some people “hate” statistics– Too hard to understand or learn– Doesn’t make sense/not intuitive– Too many formulas– Do not understand the applicability– Easy to misuse or “lie”

11

ECON 307/STAT 310

• Population versus samples• Descriptive Statistics

– Summarizing, describing, data reduction• Probability (populations)

– How the population behaves – Probabilities of obtaining outcomes

• Statistics and Inference (samples)– Data used to make statements or decisions

about the universe from which the data are obtained

12

Population v. Sample

F(x)

µ

X

13

X

Presenter
Presentation Notes
Ps. 1 1] Blessed is the man that walketh not in the counsel of the ungodly, nor standeth in the way of sinners, nor sitteth in the seat of the scornful.[2] But his delight is in the law of the LORD; and in his law doth he meditate day and night.[3] And he shall be like a tree planted by the rivers of water, that bringeth forth his fruit in his season; his leaf also shall not wither; and whatsoever he doeth shall prosper.[4] The ungodly are not so: but are like the chaff which the wind driveth away.[5] Therefore the ungodly shall not stand in the judgment, nor sinners in the congregation of the righteous.[6] For the LORD knoweth the way of the righteous: but the way of the ungodly shall perish. Ro 4. [3] For what saith the scripture? Abraham believed God, and it was counted unto him for righteousness.

Descriptive• Stem and Leaf• Histogram• Univariate summary

statistics

14

114,023 EarthquakeMag.Mean 1.98Median 1.90Mode 1.90Standard Deviation 0.8803Variance 0.7749Kurtosis 0.4995Skewness 0.4353Standard Error 0.0026Range 7.3Minimum -0.6Maximum 6.797% Conf. Level 0.0057

2 Days, Japan, 2/10/01

• Probability– Mathematical foundation/basis for results in

statistics• Statistics

– Branch of scientific method that deals with [numerical] properties of populations that occur in nature (or our imaginations), of natural phenomena

– Natural phenomena includes all the happenings of the external world, human or not

– Estimators of parameters15

. .sup ( ) ( ) 0a snF x F x− →

ˆ( ) ; ( ) ?u P X u f aα ααβ> = =

Laplacian Determinism

• Laplacian Dæmon – 19th century ideal– Perfect knowledge of the past and the

system → perfect prediction

• Poincare complication (per Mirowski, 1990)

– Imperfect knowledge of past (minor errors) → wildly discrepant future predictions

– 60 years before Mandelbrot 16

Presenter
Presentation Notes
Simeon Denis Poisson Ph.D. École Polytechnique 1800 France Advisor 1: Joseph Louis Lagrange; Advisor 2: Pierre-Simon Laplace Mirowski, Philip. From Mandelbrot to Chaos in Economic Theory, Southern Economic Journal, Vol. 57, No. 2 (Oct., 1990), pp. 289-307

Keep in mind• Understand variability• Violin story• Fundamental theorem of probability

• Fundamental theorem of statistics

17

. .sup ( ) ( ) 0

a s

nF x F x− →

( )d

n X

X

Xn zµσ−

→Φ

. .a s

nX µ→

Black Magic Stats Courses/Books• Table of Contents

– 1. Histograms, time series charts– 2. Organizing Data– 3. Averages and Variation– 4. Correlation and Regression– 5. Elementary Probability Theory– 6. The Binomial Probability Distribution and Related Topics– 7. Normal Curves and Sampling Distributions – 8.1 Estimating µ When σ is Known– 8.2 Estimating µ When σ is Unknown– 8.3 Estimating p in the Binomial Distribution– 9.2 Testing the Mean of µ– 9.3 Testing a Proportion p– 10. Inferences About Differences– 11.1 Chi-Square: Tests of Independence– 11.2 Chi-Square: Goodness of Fit– 11.3 Testing a Single Variance or Standard Deviation– Part II: Inferences Relating to Linear Regression– 11.4 Inferences for Correlation and Regression

• Supposed to remember - Basic notation and definitions

How to Handle This?

MathStat Contents1. Probability and Distributions2. Multivariate Distributions3. Some Special Distributions4. Some Elementary Statistical Inferences5. Consistency and Limiting Distributions6. Maximum Likelihood Methods7. Sufficiency8. Optimal Tests of Hypotheses9. Inferences about Normal Models10. Nonparametric and Robust Statistics11. Bayesian Statistics

20

Appendix A. Mathematical CommentsAppendix B. R-FunctionsAppendix C. Tables of DistributionsAppendix D. List of Common DistributionsAppendix E. ReferencesAppendix F. Answers to Selected ExercisesIndex

21

1. Probability and Distributions.

22

( ) ( )( ), , , ,XP FωΩ →

P( ) ?X A∈ =

( ) ( )E( ( )) ( ) P( ) ( ) ( )ω ω ωΩ

= =∫ ∫

g X g X d g X dF x

1. Probability and Distributions.

• Set theory• The probability set function• Conditional probability and

Independence• Random variables

– Discrete random variables– Discrete random variables -

Transformations

23

1. Probability and Distributions.

– Continuous random variables– Continuous random variables -

Transformations• Expectation of a random variable• Some special expectations• Important Inequalities

24

2. Multivariate Distributions.

25

( ) ( )( )( ) 2

2 ,, , , ,XY

X YP Fωω

Ω →

, ?X Yf =

( )( ) ( )( )2

E E( ) E( ) E( ) E( ) XYX X Y Y X X Y Y dFΣ = − − = − −∫∫

( , ) ( , )U V g X Y=

1

2

( , )( , )

u g x yv g x y==

, ?U Vf =

2. Multivariate Distributions

• Distribution of two random variables• Distribution of two random variables -

Expectation• Transformations - Bivariate random

variables• Conditional distributions and

expectations• The Correlation Coefficient

26

2. Multivariate Distributions

• Independent random variables• Extension to several random variables• Covariance matrices• Transformations for several random

variables• Linear combinations of random

variables

27

3. Some Special Distributions

28

P( ) ( )X xX x f x p= = =

P( ) ( ) ( )x x

XX x F x dF f x dx−∞ −∞

≤ = = =∫ ∫

3. Some Special Distributions

• Binomial and related distributions• The Poisson distribution• The gamma, chi squared and beta

distributions• The Normal distribution

29

3. Some Special Distributions

• The multivariate normal distribution• The t-distribution• The F-distribution• Student's theorem• Mixture distributions

30

4. Elementary Statistical Inferences

31

1ˆ( ) ( , , )nW X W X X X θ= ≡ =

( )P ( ) ( ) 1 95%L X U Xθ α≤ ≤ = − =

0

1

: 100: 100

HH

µµ=≠

1 , , nX X

iid ( )with ( | )i X XX F x f x θ

4. Elementary Statistical Inferences

• Sampling and statistics• Histogram estimates of pdmf's• Confidence intervals• Confidence intervals for differences in

means• Confidence intervals for differences in

proportions• Confidence intervals for parameters of

discrete distributions

32

4. Elementary Statistical Inferences

• Order statistics• Quantiles and confidence intervals for

quantiles• Introduction to hypothesis testing• Additional comments about statistical

tests• Chi-Squared tests• Monte Carlo

33

4. Elementary Statistical Inferences

• Bootstrap procedures• Percentile bootstrap confidence intervals• Bootstrap testing procedures

34

5. Consistency/Limiting Distributions

35

. .

p

a sn

pLn n n

cn

d

X X

X X X X X XX X

→→ ⇒ → ⇒ →→

( ) dn

X

n X µφ

σ

−→

5. Consistency/Limiting Distributions

• Convergence in probability• Convergence in distribution• Convergence in distribution - Bounded in

probability• Convergence in distribution - Delta method• Convergence in distribution - MGF

technique• Central limit theorem• Multivariate extensions

36

37

6. Maximum Likelihood Methods

38

1ˆ ( , , )nW X Xθ =

( | ) ( | )XL x f xθ θ=

2E( ( ))Var( )( )n

gWIθθ

6. Maximum Likelihood Methods

• Maximum Likelihood estimation• Cramer-Rao lower bound and efficiency• Maximum likelihood tests• Multi parameter case: Estimates• Multi parameter case: Testing• the EM algorithm

39

7. Sufficiency

400 20 40 60 80 100

01

23

45

6

Sum of Xi = 213

Index

Fish

Cau

ght p

er H

our

?λ =

7. Sufficiency• Measures of quality of estimators• A sufficient statistic for a parameter• Properties of a sufficient statistic• Completeness and uniqueness• The Exponential class of distributions

41

( ) ( ) ( ) ( )( | ) w T x H x CXf x e θ θθ ′ + +=

7. Sufficiency• Functions of a parameter• The case of several parameters• Minimal sufficiency and auxiliary

statistics• Sufficiency, completeness and

independence

42

8. Hypothesis Testing

• Hypothesis

• Test statistic T=T(X1,X2,…,Xn)• Critical region (to reject H0)

• Significance level (α)

• Power of the test (β)43

0 0 0 0

1 1 1 0

: :: :

H HH H

θ θ θ θθ θ θ θ= == ≠

0 0

1 0

::

H f fH f f

=≠

0 : Reject R x H=

8. Optimal Tests of Hypotheses

44

0( ) P(reject )Hβ θ θ= ∀

1 0( ) ( ) vs.β θ β θ θ θ′ ′≥ ∀ ∈Θ ∈Θ

0

sup ( )( )

sup ( )

L xx

L xθ

θ

∈Θ

∈Θ

Λ =

0:Reject ifR x H cλ= ≤

8. Optimal Tests of Hypotheses

45

8. Optimal Tests of Hypotheses

• Most powerful tests• Uniformly most powerful tests• Likelihood ratio tests• Sequential probability ratio test• Minimax procedure• Classification procedure

46

9. Inferences about Normal Models

47

( )( ) ( )11

21/2/2

1( )2

TX X

X pf x eµ µ

π

−−− Σ −

1TX X−Σ

1( ) ( )TX Xµ µ−− Σ −

Y Xβ ε= +

( ) 1ˆ T TX X X Yβ−

=

9. Inferences about Normal Models

• Quadratic forms• One-way ANOVA• Noncentral chi square and F-

distributions• Multiple comparisons• The analysis of variance• A regression problem• A test of independence

48

9. Inferences about Normal Models

• The distribution of certain quadratic forms

• The independence of certain quadratic forms

49

10. Nonparametric and Robust Statistics

50

10. Nonparametric and Robust Statistics

• Location models• Sample median and the sign test

– ARE– estimating equations base on the sign test– CI for median

• Signed-rank Wilcoxon– ARE– estimating equations based on Signed-

rank Wilcoxon– CI for median

51

10. Nonparametric and Robust Statistics

• Mann-Whitney-Wilcoxon procedure– ARE– estimating equations based on Mann-

Whitney-Wilcoxon– CI for shift parameter ∆

• General rank scores– Efficacy– estimating equations based on general

scores– optimization: best estimates

52

10. Nonparametric and Robust Statistics

• Adaptive procedures• Simple linear model• Measures of association

– Kendall's tau τ– Spearman's rho ρ

• Robust concepts– location model– linear model

53

11. Bayesian Statistics

54

11. Bayesian Statistics

• Subjective probability• Bayesian procedures:

– Prior and posterior distributions– Bayesian point estimates– Bayesian interval estimation– Bayesian testing procedures– Bayesian sequential procedures

• More Bayesian terminology and ideas• Gibbs sampler• Modern empirical Bayes

55

Appendices

• Mathematical comments– Regularity conditions– Sequences

• R-Functions• Tables of Distributions• List of Common Distributions• References• Answers to Selected Exercises

56

top related