statistics for librarians, session 1: what is statistics & why is it important?

59
Why is it important? WHAT IS STATISTICS?

Upload: university-of-north-texas

Post on 07-May-2015

461 views

Category:

Education


0 download

DESCRIPTION

First of 4 sessions introducing statistics to librarians and library staff.

TRANSCRIPT

Page 1: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Why is it important?

WHAT IS STATISTICS?

Page 2: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Goals of Series

Comfort

Fears

Page 3: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Series Objectives

FoundationsDescriptive Statistics

Inferential Statistics

Reading & Interpreting

Statistics

Comfort Level

Page 4: Statistics for Librarians, Session 1: What is statistics & Why is it important?

What is Statistics?

• Study of Data• Collecting• Organizing• Summarizing • Analyzing• Presenting• Storing &

Sharing

Why is it Important?

• Make sense of the data

• Explain what happens and (possibly) why

• Make sound decisions

• To know how close we are to the truth.

Page 5: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Results

Bias?

Sampling Error?

Invalid Measure

s?

Random Error?

Other Factors?

Purpose of Statistics

Page 6: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Thinking about Data in your Research Project

Page 7: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Start with your Research Question

How do users differ when (searching, finding, selecting) (articles, books, Web sites)?What are the effects of ___________On ____________?

Which is better at improving _________?How are people (finding, selecting, using) _______?

What are factors associated with ___________?

Page 8: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Example of Research Question

PACS• Low LibQUAL+

Ratings

Collections

• Is it our collections?

Do we have what they use?

• Based on citations

Page 9: Statistics for Librarians, Session 1: What is statistics & Why is it important?

VariablesIndepende

nt

Subjects

Factors

Effects of…

Dependent

Objects

Outcomes

Effects on…

Page 10: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Example of Variables

• Department• Years at UNTFaculty

• # published by type

Published

• # cited by type• UNT accessibleCited

IV

DV

Page 11: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Scales of Data (NOIR)

Nominal• Counts by

category• Binary (Yes/No)• No meaning

between the categories (Blue is not better than Red)

Ordinal• Ranks• Scales• Space between

ranks is subjective

Interval• Integers• No baseline• Space between

values is equal and objective, but discrete

Ratio• Interval data with

a baseline• Space between is

continuous

Page 12: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Likert-Type Scale?

Arbitrary

Few Levels

Individual Questions

Ordinal?

Symmetrical

Many Levels

Composite Score

Interval?

Page 13: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Example of Variable Types

• Department• Years at UNTFaculty

• # published by type

Published

• # cited by type• UNT accessibleCited

N

N

NN

I

Page 14: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Compared to What?

Book Circulations

180,354

Page 15: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Compared by…

Time Periods

Other Libraries

National Surveys

Patron Types

Material Types

Page 16: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Research Question

Data Type

Comparison Group

Statistical

Methods Used

Page 17: Statistics for Librarians, Session 1: What is statistics & Why is it important?

VALIDITY OF MEASURES

Page 18: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Are you actually measuring what you are trying to

measure?

Page 19: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Selecting Measures

•Counts•Survey responses•Grades/Scores•Ranks•Scales (e.g. Likert)•Age, Length of Time•Frequency

•People•Books•Articles•Uses•Levels of Analysis•What is the object (DV)?•What is the subject (IV)?

Measures Units of Analysis

Page 20: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Use a tool with established validity

Approaches and Study Skills Inventory for Students (ASSIST)

User Engagement Scale (UES)

Page 21: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Establish Validity of Measures

• ConsistencyReliability

• Corresponds with expectations

• Common understandings

Content Validity

• Corresponds with other variables based on theory

Construct Validity

• Corresponds with other measures

Criterion Validity

Page 23: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Results

Bias?

Invalid Measure

s?

Sampling Error?

Random Error?

Other Factors?

Page 24: Statistics for Librarians, Session 1: What is statistics & Why is it important?

ROLE OF SAMPLING

Page 25: Statistics for Librarians, Session 1: What is statistics & Why is it important?

All members of population

Hard to measure

The Truth

Census

A selection of the population

Easier to measure

An estimate of the truth

Sample

Page 26: Statistics for Librarians, Session 1: What is statistics & Why is it important?

When to Use Which:Research Question?

Census

• Book usage at UNT Libraries

• Effects of IL instruction on English 1100 students

Sample

• Book usage at all libraries

• Effects of IL instruction on all students

Page 27: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Example - Census or Sample?

All journal articles cited

All Items Published by PACS Faculty

All journal articles published by PACS faculty

Page 28: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Random Samples

• Every Unit of Analysis has an equal and known chance of being included.

Page 29: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Importance of Randomness

Random Samples

Random, Weighted,

etc.

Should be representati

ve of population

Can use inferential statistics

Most useful for testing hypotheses

Non-Random Samples

Convenience, Purposive, etc.

May or may not be

representative of population

Use descriptive

statistics only

Most useful for generating hypotheses

Page 30: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Results

Bias?

Invalid Measure

s?

Sampling Error?

Random Error?

Other Factors?

Page 31: Statistics for Librarians, Session 1: What is statistics & Why is it important?

ROLE OF DATA COLLECTION IN STATISTICS

Page 32: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Goal of Data Collection in Statistics

Reliability

Bias

Page 33: Statistics for Librarians, Session 1: What is statistics & Why is it important?

BiasSystematic (not random) deviation from the true value (Statistics.com)

Selection Bias

Measurement• Observer Bias• Non-response Bias

Analysis Bias

Page 34: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Data Collection Forms

Many or Complex Variables

Surveys

1 Unit Per

Form Fewer Variables

Collected all at once

BibliometricSpace Surveys

Spread-

sheet

Page 35: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Data Input

Have a data entry plan

Train the inputters

Use data validation tricks

Double-entry

Page 36: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Organizing Data

One Unit of Analysis per Row

Page 37: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Example Spreadsheets

Page 38: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Results

Bias?

Invalid Measure

s?

Sampling Error?

Random Error?

Other Factors?

Page 39: Statistics for Librarians, Session 1: What is statistics & Why is it important?

STATISTICAL ANALYSIS

Page 40: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Central Tenden

cy

ErrorSpread

Elements of Statistical Analysis

Page 41: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Inferential

• Infer associations

Descriptive

• Describe

Page 42: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Descriptive AnalysisJust the Facts, Ma’am

Summarizes

TablesCharts

UnivariateOne

variable at a time

Comparison with

Population

Demonstrates how random the sample is

Page 43: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Measures of Central Tendency

• Average

Mean

• Middle

Median

• Most Common

Mode

Page 44: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Central Tendency by Scales

Interval or Ratio

Mean

Median

Nominal or Rank

Median

Mode

Page 45: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Spread

Interval & Ratio

• Range• Quartiles

or Quintiles

• Standard Deviation

Nominal & Rank

• Distribution Tables

• Bar Graphs

How variable is the data?

Page 46: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Range & Quartiles

Page 47: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Standard Deviation

•Measure of dispersion of data•Square root of the average variation from the mean

Page 48: Statistics for Librarians, Session 1: What is statistics & Why is it important?

What does the Standard Deviation tell you?

Greater variation, less certainty

Lower variation, more certainty

Page 49: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Presentation of Spread

•Box plots•Mean•Upper & lower quintiles•Outliers•Cross-tabulations•Bar graphs

Page 50: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Spread of Nominal data

Page 51: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Bar graphs & plots

Page 52: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Inferential Statistics

Tests of hypotheses• Associations• ExpectationsAccounts for uncertainty• Random error• Confidence interval

Page 53: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Hypotheses

Your Hypothes

is(H1)

Null Hypothesis(H0)

Page 54: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Example Hypothesis

>=75%* <75%*

*…of journal articles cited by UNT PACS faculty in journal articles published between 2008-2011.

UNT Libraries provides access to…

Page 55: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Hypothesis Testing

p

Sample Size

Central Tendency

SpreadDistribution

Significance Level

Page 56: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Statistical Analysis

Noise

Signal

Page 57: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Results

Bias?

Sampling Error?

Invalid Measure

s?

Random Error?

Other Factors?

Purpose of Statistics

Page 58: Statistics for Librarians, Session 1: What is statistics & Why is it important?

Valid

• Measures• Data Collection• Sample Selection• Statistical Methods

Valid

• Data• Sample• Statistical Analysis

Valid

• Results

Role of Validity

in Researc

h