statistics 500=psychology 611=biostatistics 550 ... statistics 500 bulk pack - 1 - statistics...

Download Statistics 500=Psychology 611=Biostatistics 550 ... Statistics 500 Bulk Pack - 1 - Statistics 500=Psychology

If you can't read please download the document

Post on 01-Jun-2020




0 download

Embed Size (px)


  • Statistics 500 Bulk Pack - 1 -

    Statistics 500=Psychology 611=Biostatistics 550

    Introduction to Regression and Anova Paul R. Rosenbaum

    Professor, Statistics Department, Wharton School Description Statistics 500/Psychology 611 is a second course in statistics for PhD students in the social, biological and business sciences. It covers multiple linear regression and analysis of variance. Students should have taken an undergraduate course in statistics prior to Statistics 500. Topics 1-Review of basic statistics. 2-Simple regression. 3-Multiple regression. 4-General linear hypothesis. 5-Woes of Regression Coefficients. 6-Transformations. 7-Polynomials. 8-Coded variables. 9-Diagnostics. 10-Variable selection. 11-One-way anova. 12-Two-way and factorial anova. How do I get R for free? Final exam date: Holidays, breaks, last class: My web page: Email: Phone: 215-898-3120 Office: 473 Huntsman Hall (in the tower, 4th floor) Office Hours: Tuesday 1:30-2:30 and by appointment. The bulk pack and course data in R are on my web page.

  • Statistics 500 Bulk Pack - 2 -


    Review of Basic Statistics Descriptive statistics, graphs, probability, confidence intervals, hypothesis tests.

    Simple Regression Simple regression uses a line with one predictor to predict one outcome.

    Multiple Regression Multiple regression uses several predictors in a linear way to predict one


    General Linear Hypothesis The general linear hypothesis asks whether several variables may be dropped

    from a multiple regression.

    Woes of Regression Coefficients Discussion of the difficulties of interpreting regression coefficients and what you

    can do.

    Transformations A simple way to fit curves or nonlinear models: transform the variables.

    Polynomials Another way to fit curves: include quadratics and interactions.

    Coded Variables Using nominal data (NY vs Philly vs LA) as predictors in regression.

    Diagnostics How to find problems in your regression model: residual, leverage and influence.

    Variable Selection Picking which predictors to use when many variables are available.

    One-Way Anova Simplest analysis of variance: Do several groups differ, and if so, how?

    Two-Way Anova Study two sources of variation at the same time.

    Factorial Anova Study two or more treatments at once, including their interactions.

  • Statistics 500 Bulk Pack - 3 -

    Common Questions

    Statistics Department Courses (times, rooms)

    Final Exams (dates, rules)

    Computing and related help at Wharton

    Statistical Computing in the Psychology Department

    When does the the course start? When does it end? Holidays?

    Does anybody have any record of this?

    Huntsman Hall

    Suggested reading

    Box, G. E. P. (1966) Use and Abuse of Regression, Technometrics, 8, 625-629. or ncedSearch%3Fq0%3Dbox%26f0%3Dau%26c0%3DAND%26q1%3Dabuse%26f1%3Dti%26c1%3DAND%26q2%3D%26 f2%3Dall%26c2%3DAND%26q3%3D%26f3%3Dall%26wc%3Don%26sd%3D%26ed%3D%26la%3D%26jo%3D%26dc.S tatistics%3DStatistics%26Search%3DSearch&item=1&ttl=1&returnArticleService=showArticle

    Helpful articles from JSTOR 1. The Analysis of Repeated Measures: A Practical Review with Examples B. S. Everitt The Statistician, Vol. 44, No. 1. (1995), pp. 113-135. 2. The hat matrix in regression and anova. D. Hoaglin and R. Welsh, American Statistician, Vol 32, (1978), pp. 17-22. 3. The Use of Nonparametric Methods in the Statistical Analysis of the Two- Period Change-Over Design Gary G. Koch Biometrics, Vol. 28, No. 2. (Jun., 1972), pp. 577-584.

  • Statistics 500 Bulk Pack - 4 -

    Some Web Addresses

    Web page for Sheather’s text Amazon for Sheather’s text (required) Statistics/dp/0387096078/ref=tmm_hrd_title_0/186-7302133- 0606755?ie=UTF8&qid=1315493088&sr=1-1 Alternative text used several years ago (optional alternative, not suggested) Methods/dp/0495384968/ref=sr_1_1?s=books&ie=UTF8&qid=1315493363&sr=1-1 Good supplement about R (optional, suggested) Based/dp/0521762936/ref=sr_1_1?s=books&ie=UTF8&qid=1315493138&sr=1-1 Review basic statistics, learn basic R (optional, use if you need it) Computing/dp/0387790535/ref=sr_1_1?s=books&ie=UTF8&qid=1315493184&sr=1-1 Excellent text, alternative to Sheather, more difficult than Sheather Statistics/dp/0471170828/ref=sr_1_1?s=books&ie=UTF8&qid=1315493220&sr=1-1 Good text, alternative/supplement to Sheather, easier than Sheather Statistics/dp/0471746967/ref=tmm_hrd_title_0?ie=UTF8&qid=1315493316&sr=1-1 Free R manuals at R home page. Start with “An Introduction to R” --> Manuals --> An Introduction to R --> Search --> Paradis --> R for Beginners My web page (bulk pack, course data)


View more >