Download - Deanne W. Swan, PhD IMLS / OPRE dswan@imls
![Page 1: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/1.jpg)
Analysis of Library Data at the State & Local Level
2013 SDC ConferenceSt. Louis, MO
December 12, 2013
Deanne W. Swan, PhDIMLS / [email protected]
Frank NelsonIdaho Public Libraries
![Page 2: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/2.jpg)
Why data analysis?
![Page 3: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/3.jpg)
We analyze data…
… to discover useful information.… to answer questions.… to solve problems.… to make better decisions.
… to tell a story.
![Page 4: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/4.jpg)
What is data analysis?
![Page 5: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/5.jpg)
Data analysis is…… a process…
of inspecting, cleaning, transforming, and modeling data…
… with the goal of uncovering information, supporting decision making, and telling stories.
![Page 6: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/6.jpg)
State Problem
Select Method
Find Data
Manage Data
Analyze Data
Present Data
Data Analysis – A Brief Introduction
![Page 7: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/7.jpg)
Let’s start with an example…
![Page 8: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/8.jpg)
Children who start school not ready to learn are at-risk for reading below proficiency at the end of third grade.
Children who can’t read at grade level by the end of third grade have low academic achievement in later grades and are less likely to graduate from high school.
Where should we invest our resources?
The Problem
![Page 9: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/9.jpg)
How big of a problem is this?
![Page 10: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/10.jpg)
Does it affect all children the same way?
![Page 11: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/11.jpg)
What are the differences between these children?
![Page 12: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/12.jpg)
How early can we see evidence of this problem?
![Page 13: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/13.jpg)
Does the magnitude of this problem change over time?
![Page 14: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/14.jpg)
Is there a measurable difference between identifiable groups of children?
![Page 15: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/15.jpg)
Is there some trait that might explain or differentiate this gap?
![Page 16: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/16.jpg)
Are there additional factors that might exacerbate the problem?
![Page 17: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/17.jpg)
Is this contextual factor consistent across geography?
![Page 18: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/18.jpg)
Is there a community resource that could ameliorate this problem?
![Page 19: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/19.jpg)
Is this resource utilized equally across child characteristics?
![Page 20: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/20.jpg)
The Problem restated
• In order to succeed in school, children need to be ready to learn, including having fundamental early literacy skills, when they enter school.
• There is an opportunity gap. Certain children are at-risk for entering school not ready to learn.
• These children include children who are Hispanic, children of immigrant parents, and children living in poverty.
![Page 21: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/21.jpg)
• These children are often not enrolled in early education programs that help prepare children for entry to school, leaving these children and their families underserved.
Question:What is the status of children’s programs in public libraries in areas of high concentration of child poverty and immigrant families?
The Problem restated
![Page 22: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/22.jpg)
Analysis
What is the relationship between attendance at public library children’s programs to high levels of child poverty and immigrant status for the top 100 metropolitan areas?Data:
PLS (IMLS)SAIPE and CPS (Census)Crosswalk of Top 100 MSAs
![Page 23: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/23.jpg)
Analysis
![Page 24: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/24.jpg)
Analysis
![Page 25: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/25.jpg)
Analysis
State and County Estimates for 2010The files in the data directory contain estimates of poverty and income for 2010. There is one data file for each state (or US) with data for ALL with the 2010 statistics.
Excel format:est10ALL.xls – US and all states and countiesest10US.xls – US and states data
![Page 26: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/26.jpg)
Analysis
![Page 27: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/27.jpg)
Analysis
![Page 28: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/28.jpg)
Analysis
![Page 29: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/29.jpg)
Analysis
Join (Merge) all of the files based on the linking variable:FIPSCO (FIPS county)
![Page 30: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/30.jpg)
Analysis
![Page 31: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/31.jpg)
Analysis
![Page 32: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/32.jpg)
Analysis
![Page 33: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/33.jpg)
Analysis
![Page 34: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/34.jpg)
Is this resource available to children who are at-risk?
![Page 35: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/35.jpg)
Is the difference in this resource dispersed equally geographically?
![Page 36: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/36.jpg)
• In some areas with high concentrations of children with highest risk (poverty and COI status), there is lower attendance at children’s programs in public libraries.
Result
![Page 37: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/37.jpg)
Statistics without context have no meaning. They are simply numbers.
In order to make our stories more compelling and powerful, we need to put public library data within context:
– Place Geographic, Spatial Data– Time Temporal Data– Social Demographic Data– Economic Financial / Labor Data– Political Program and Policy Data
![Page 38: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/38.jpg)
Data Analysis
Data analysis is a process…
… of inspecting, cleaning, transforming, and modeling data…
… with the goal of uncovering information, supporting decision making, and telling stories.
![Page 39: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/39.jpg)
State Problem
Select Method
Find Data
Manage Data
Analyze Data
Present Data
Data Analysis
![Page 40: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/40.jpg)
Find Data
Where can I get data to analyze?
Collect your own dataOR
Use data someone else collected.
![Page 41: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/41.jpg)
Find Data
Federal Statistical CollectionsIMLS: www.imls.gov
PLS, SLAA
U.S. Census Bureau: www.census.gov ACS, CPS, SAIPE / Data Ferrett
NCES: www.nces.ed.gov NAEP, NHES, ECLS, CCD, SASS
NCHS: www.cdc.gov/nchs/ NHANES, NHIS, NVSS
BLS: www.bls.gov GDP, CPI, (Un)employment
![Page 42: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/42.jpg)
Find Data
![Page 43: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/43.jpg)
Find Data
![Page 44: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/44.jpg)
Find Data
First rule of analysis club:Read the data documentation.
Second rule of analysis club:Read the data documentation.
![Page 45: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/45.jpg)
Manage Data
Managing data includes all of the activities needed to
obtain, inspect,
clean, scrub,
transform, andmanipulate data.
![Page 46: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/46.jpg)
Manage Data
Tools for Cleaning and Analyzing Data
Statistical Packages: SAS, SPSS, Stata ($$$)Free Statistical Tools:
R: http://www.r-project.org/ Data Applied: http://www.data-applied.com/
![Page 47: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/47.jpg)
Manage Data
Download the Data Determine the best format for your needsRead the data documentation.
ResourcesHarvard University GIS tutorial: http://www.gsd.harvard.edu/gis/manual/data/ Sources of Spatial Data, Data Handling, Effective Cartography, Analytic Techniques
U.S. Census Bureau: Download the database http://quickfacts.census.gov/qfd/download_data.html
![Page 48: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/48.jpg)
Manage Data
Join/Merge DataFIPS code (Federal Information Processing Standard)
State, County, Place
FIPS CrosswalkNational Bureau of Economic Research (NBER):http://www.nber.org/data/ssa-fips-state-county-crosswalk.html
![Page 49: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/49.jpg)
Manage Data
How to merge two data files in R:
Suppose you have two data files, dataset1 and dataset2, that need to be merged into a single data set. First, read both data files in R. Then, use the merge() function to join the two data sets based on a unique id variable that is common to both data sets:
> merged.data <- merge(dataset1, dataset2, by=“FIPSCO")
![Page 50: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/50.jpg)
Manage Data
Explore/Clean Data
![Page 51: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/51.jpg)
Manage Data
“…seeing may be believing or disbelieving, but above all, data analysis involves visual, as well as statistical, understanding.”
~ John W. Tukey
Exploratory Data Analysis
![Page 52: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/52.jpg)
Manage Data
Exploratory Data Analysis is…
… a type of statistical analysis.… an attitude about looking at data.… a state of mind.
Traditional statistics = numerical summariesEDA = numerical summaries + graphical displays
![Page 53: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/53.jpg)
Manage Data
Data = smooth + rough
![Page 54: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/54.jpg)
Manage Data
The goal of EDAto discover patterns in the data.
The role of the analystto listen to the data
in as many ways as possibleuntil the data tell a story.
![Page 55: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/55.jpg)
Manage Data
Data are distributed across a range of values, from the lowest to the highest.
To describe the distribution:location (central tendency)spread (dispersion)shape (normal)systematic relationships
![Page 56: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/56.jpg)
Manage Data
Transform Data
Creating new variables based on original variables, such as…
Visitation per capita:
Adjusting financial data for inflation:
![Page 57: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/57.jpg)
Manage Data
Sometimes a variable will need to be transformed to prepare it for analysis.
Common transformationsnatural log: square: x2
square root:
Resource – common transformations and when to use them:http://oak.ucc.nau.edu/rh232/courses/EPS625/Handouts/Data%20Transformation%20Handout.pdf
![Page 58: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/58.jpg)
Analyze Data
Types of Data AnalysisDescriptiveExploratoryPredictive
![Page 59: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/59.jpg)
Analyze Data
Data = smooth + rough
![Page 60: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/60.jpg)
Analyze Data
Prediction with RegressionThe General Linear Model (GLM)
01ˆ XbmXY
![Page 61: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/61.jpg)
Analyze Data
![Page 62: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/62.jpg)
Analyze Data
![Page 63: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/63.jpg)
Analyze Data
Modeling datato predict a value based on knowledge of another value or values.
General Linear Model (regression)Structural Equation Modeling (SEM)
Multilevel Modeling (MLM/HLM)
If you can uncover the pattern of what was in relation to what is, you can (within reason) predict what will be.
![Page 64: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/64.jpg)
Present Data
“The greatest value of a picture is when it forces us to notice what we never expected to see.”
~ Tukey (1977, p. vi)
![Page 65: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/65.jpg)
Date of Death Name of Deceased Residence19 August 1854 Mr. Samuel Morris 34 Berwick Street21 August 1854 Miss Emma Watkins 54 Cross Street
Miss Susan Taylor 132 Broad Street24 August 1854 Mr. Franklin Ford 9 Cambridge Street
Mr. Thomas Johnson 140 Broad Street27 August 1854 Mrs. Franklin Ford 9 Cambridge Street29 August 1854 Mister Robert Taylor 132 Broad Street30 August 1854 Miss Evelyn Stromwell12 West Street
Mrs. Robert Smith 207 Broad Street31 August 1854 Mr. Stephen Maxwell Poland Street Workhouse
Mr. Frederick Stovall 55 Cross StreetMrs. Frederick Stovall55 Cross Street
![Page 66: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/66.jpg)
Deaths from cholera
0
20
40
60
80
100
120
140
19 21 23 25 27 29 31 2 4 6 8 10 12 14 16 18 20 22 24 26 28
![Page 67: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/67.jpg)
Cumulative Deaths from Cholera
0
100
200
300
400
500
600
700
19 21 23 25 27 29 31 2 4 6 8 10 12 14 16 18 20 22 24 26 28
![Page 68: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/68.jpg)
Mapping Data: 1854 London Cholera Epidemic (Snow)
![Page 69: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/69.jpg)
Representing Space and Time: Napoleon’s March on Moscow (Minard)
![Page 70: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/70.jpg)
Equalizing cartogram: 2004 Presidential election
![Page 71: Deanne W. Swan, PhD IMLS / OPRE dswan@imls](https://reader035.vdocuments.mx/reader035/viewer/2022081507/56816005550346895dcf05a5/html5/thumbnails/71.jpg)
Merry Analysis and a Happy Data Year!
Thank you!Deanne SwanSr. StatisticianIMLS / OPRE
Frank NelsonIdaho Public Libraries