lecture 1: welcome to data visualization using r · a. a r dataframe i a dataframe is the basic...
TRANSCRIPT
![Page 1: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/1.jpg)
Admin R Ex Tufte R
Lecture 1:Welcome to Data Visualization Using R
January 28, 2019
![Page 2: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/2.jpg)
Admin R Ex Tufte R
Take This Class So You Won’t Make This Graphic
From Trachtenberg’s 2018 magazine.
![Page 3: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/3.jpg)
Admin R Ex Tufte R
Or This One��������� ������ ������������������������������������ !"�#�$��
���%�&����#���� �� '����� ���� ��������(�)�������� ���� �%� �����*+���,��� ��-"�#�.$�� ."�/������ 0���,���%��'�-(��1�231�2������,�450���,��6 ��4789:;<=>?@=<AB@CD9E@BF:@GH:I
“U.S. Metros Ranked on Black-White Income Inequality,” Next City, May 2,2017
![Page 4: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/4.jpg)
Admin R Ex Tufte R
Instead, Aspire to This
See Tufte for citation.
![Page 5: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/5.jpg)
Admin R Ex Tufte R
Overview
Course Administration
Some R Examples
Tufte, Grandfather of Visualization
Getting Started with R
![Page 6: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/6.jpg)
Admin R Ex Tufte R
Course Administration
1. Syllabus• Good/bad/ugly assignments online• Propose and trade and notify me if you can’t do your assigned
date
2. Bring a name tent to class
3. Questions/issues with readings?
4. Make sure you’re signed up for Piazza
5. Introductions• name and degree• why this course?• what you do now• what you’d like to do when you’re done
![Page 7: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/7.jpg)
DC Gains Population Through 1950
![Page 8: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/8.jpg)
Population Loses Start with Desegregation
![Page 9: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/9.jpg)
Continue A�er Civil Disturbance
![Page 10: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/10.jpg)
Population Turns Up A�er 2000
![Page 11: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/11.jpg)
Profound Changes: Share African American by Neighborhood
193019401950196019701980199020002010
![Page 12: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/12.jpg)
Profound Changes: Share African American by Neighborhood
193019401950196019701980199020002010
![Page 13: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/13.jpg)
Profound Changes: Share African American by Neighborhood
193019401950196019701980199020002010
![Page 14: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/14.jpg)
Profound Changes: Share African American by Neighborhood
193019401950196019701980199020002010
![Page 15: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/15.jpg)
Profound Changes: Share African American by Neighborhood
193019401950196019701980199020002010
![Page 16: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/16.jpg)
Profound Changes: Share African American by Neighborhood
193019401950196019701980199020002010
![Page 17: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/17.jpg)
Profound Changes: Share African American by Neighborhood
193019401950196019701980199020002010
![Page 18: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/18.jpg)
Profound Changes: Share African American by Neighborhood
193019401950196019701980199020002010
![Page 19: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/19.jpg)
Profound Changes: Share African American by Neighborhood
193019401950196019701980199020002010
![Page 20: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/20.jpg)
By Square Footage, 7th Street is Most Impacted
![Page 21: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/21.jpg)
Roughly Half of Square Footage Damaged to Some Degree
![Page 22: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/22.jpg)
Assessed Value of Most Improvements Drops, 1967 to 1970
![Page 23: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/23.jpg)
Damaged Properties Lose Improvements, A Few Rebuild
![Page 24: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/24.jpg)
1999: Damaged Properties Have Smaller Structures
![Page 25: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/25.jpg)
2005: Damaged Properties Show Some Catch-up
![Page 26: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/26.jpg)
2010: Damaged Properties Approaching Undamaged Ones
![Page 27: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/27.jpg)
2018: Near Convergence of Damaged Properties
![Page 28: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/28.jpg)
Admin R Ex Tufte R
Tufte
![Page 29: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/29.jpg)
Admin R Ex Tufte R
Edward Tufte
• A quantitative political scientist
• Writing in the mid-1970s
• Became interested in visualization by working with pioneeringstatistician John Tukey
• Remember that this is the pre-Excel era, in which datagraphics are difficult to make
![Page 30: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/30.jpg)
Admin R Ex Tufte R
An Argument for Better Visualization
Because good visualizations tell the most compelling story
![Page 31: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/31.jpg)
Admin R Ex Tufte R
An Argument for Better Visualization
Because good visualizations tell the most compelling story
![Page 32: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/32.jpg)
Admin R Ex Tufte R
Tufte’s Types of Graphs
1. Data maps
2. Time series
3. Space-time narrative designs
4. Relational graphs – the holy grail
![Page 33: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/33.jpg)
Admin R Ex Tufte R
Data Maps
• Describe the location of numbers
• This can be revealing or obfuscating
• We will make these in this class
![Page 34: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/34.jpg)
Admin R Ex Tufte R
Data Map Example
![Page 35: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/35.jpg)
Admin R Ex Tufte R
Time Series
• Time on the horizontal axis
• Something else on the vertical axis
• One of the first types of data graphics
![Page 36: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/36.jpg)
Admin R Ex Tufte R
Train, Paris to Lyon
See Tufte for citation.
![Page 37: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/37.jpg)
Admin R Ex Tufte R
Space-Time Narrative Designs
• Move over space and time at the same time
• A time series plus
![Page 38: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/38.jpg)
Admin R Ex Tufte R
Space-Time Narrative Example
Which dimensions?
1. army size
2. army location, N/S
3. army location, E/W
4. direction of movement
5. temperature
6. by date
![Page 39: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/39.jpg)
Admin R Ex Tufte R
Space-Time Narrative Example
Which dimensions?
1. army size
2. army location, N/S
3. army location, E/W
4. direction of movement
5. temperature
6. by date
![Page 40: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/40.jpg)
Admin R Ex Tufte R
Relational Graphics
• One variable on the vertical, another on the horizontal
• A conceptual advance in graphics
• A more sophisticated way of thinking
![Page 41: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/41.jpg)
Admin R Ex Tufte R
Relational Graphics Example
![Page 42: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/42.jpg)
Admin R Ex Tufte R
Tufte’s Main Causes of Distortion in Graphics
1. Data are bad• should be per capita and are not• data are not consistent over time• don’t adjust for inflation
2. Graphics are rotten• size doesn’t match the numbers• colors and styles are misleading• graphic fails to highlight key point
3. Graphics are irrelevant• too much extraneous stuff
![Page 43: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/43.jpg)
Admin R Ex Tufte R
Size and Number Don’t Match
![Page 44: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/44.jpg)
Admin R Ex Tufte R
A Continuing Problem: Graphics are Irrelevant
![Page 45: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/45.jpg)
Admin R Ex Tufte R
Tufte’s Six Rules of Graphic Integrity, 1 to 3 of 6
1. The representation of numbers, as physically measured on thesurface of the graphic itself, should be directly proportional tothe numerical quantities represented.
2. Clear, detailed, and thorough labeling should be used todefeat graphical distortion and ambiguity. Write outexplanations of the data on the graphic itself. Label importantevents in the data.
3. Show data variation, not design variation.
![Page 46: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/46.jpg)
Admin R Ex Tufte R
Tufte’s Six Rules of Graphic Integrity, 4 to 6
4. In time-series displays of money, deflated and standardizedunits of monetary measurement are nearly always better thannominal units.
5. The number of information-carrying (variable) dimensionsdepicted should not exceed the number of dimensions in thedata.
6. Graphics must not quote data out of context.
![Page 47: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/47.jpg)
Admin R Ex Tufte R
R
![Page 48: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/48.jpg)
Admin R Ex Tufte R
What is R?
• A programming language
• Developed by statisticians from New Zealand
• Open source, and therefore free
• Based on “S,” developed by Bell Labs
![Page 49: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/49.jpg)
Admin R Ex Tufte R
Strengths of R
• Free
• Open-source, so packages by all kinds of users are available
• There are frequently many ways to do the same task
• Very good graphics
• Very flexible
• Can have many datasets in memory at once
![Page 50: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/50.jpg)
Admin R Ex Tufte R
Weaknesses of R
• Not always enterprise-ready: packages break and there is nocentral help
• There are frequently many ways to do the same task
• Syntax can be challenging
![Page 51: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/51.jpg)
Admin R Ex Tufte R
Today’s Goals
• When you leave today, you will be able to• run a R script• create a R dataframe• do basic operations with a R dataframe
• Download the R tutorial for this class now.
• You’ll continue work at home on your own and turn in aproblem set next lecture
![Page 52: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/52.jpg)
Today
A. A R dataframeB. PackagesC. SubsettingD. Summarizing
![Page 53: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/53.jpg)
A. A R dataframe
I a dataframe is the basic building block of data analysis in RI R has other types of data structures, but this will the be most
useful to youI dataframe consists of columnsI each column can be
I numeric: 1,2,3.556,-2.6I orI character: “hello”,“dogs are good”,“”
I refer to rows and columns
![Page 54: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/54.jpg)
Sample dataframe
new.dataframe <- data.frame(class = c(1,2,3),subject = c("basics","merging","graphs"),students = c(19,19,18))
new.dataframe
## class subject students## 1 1 basics 19## 2 2 merging 19## 3 3 graphs 18
![Page 55: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/55.jpg)
Referring to parts of the dataframenew.dataframe[ROWS,COLUMNS]
Just one column, all rows
new.dataframe[,c("students")]
## [1] 19 19 18
Just two rows, all columns
new.dataframe[1:2,]
## class subject students## 1 1 basics 19## 2 2 merging 19
![Page 56: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/56.jpg)
Referring to parts of the dataframenew.dataframe[ROWS,COLUMNS]
Just one column, all rows
new.dataframe[,c("students")]
## [1] 19 19 18
Just two rows, all columns
new.dataframe[1:2,]
## class subject students## 1 1 basics 19## 2 2 merging 19
![Page 57: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/57.jpg)
Referring to parts of the dataframenew.dataframe[ROWS,COLUMNS]
Just one column, all rows
new.dataframe[,c("students")]
## [1] 19 19 18
Just two rows, all columns
new.dataframe[1:2,]
## class subject students## 1 1 basics 19## 2 2 merging 19
![Page 58: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/58.jpg)
B. Packages
I there is “Base R,” which is a set of basic commandsI and user-written packages that add functionalityI some packages are maintained by teams, frequently updated,
and do many thingsI some are one-function add-insI most famous are those by Hadley WickhamI today we’ll use his “plyr” packcage
![Page 59: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/59.jpg)
Installing packages
I install a package once
install.packages("plyr", dependencies = TRUE)
I call a package at the beginning of any program in which you’dlike to use the package
library(plyr)
I this makes commands accessible
![Page 60: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/60.jpg)
C. Subsetting
I sometimes you want to work with something smaller than thewhole dataframe
I create a new dataframe that has only part of the big oneI here we keep just students 1 and 2
df.smaller <- new.dataframe[1:2,]df.smaller
## class subject students## 1 1 basics 19## 2 2 merging 19
![Page 61: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/61.jpg)
Subset by attributes
I take only classes with 19 students
df19 <-new.dataframe[which(new.dataframe$students == 19),]
df19
## class subject students## 1 1 basics 19## 2 2 merging 19
![Page 62: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/62.jpg)
D. Summarizing
I frequently, you’d like to know something at a level ofaggrgation not in your dataset
I in our case, maybe average attendanceI make a new dataframe with this informationI use plyr library
![Page 63: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/63.jpg)
Summarize data: load dplyr package
#### Attaching package: 'dplyr'
## The following objects are masked from 'package:plyr':#### arrange, count, desc, failwith, id, mutate, rename, summarise,## summarize
## The following objects are masked from 'package:stats':#### filter, lag
## The following objects are masked from 'package:base':#### intersect, setdiff, setequal, union
![Page 64: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/64.jpg)
Making a new dataset that is a function of the old one
av.attendance <- summarize(new.dataframe,av.at=mean(students, na.rm = TRUE))
av.attendance
## av.at## 1 18.66667
I more complicated example in tutorial
![Page 65: Lecture 1: Welcome to Data Visualization Using R · A. A R dataframe I a dataframe is the basic building block of data analysis in R I R has other types of data structures, but this](https://reader030.vdocuments.mx/reader030/viewer/2022041022/5ed30bed5538134e52473867/html5/thumbnails/65.jpg)
Admin R Ex Tufte R
Next Lecture
• Turn in PS 1, which is at the end of the tutorial
• Read Few Chapters 3 and 5
• Look at “Smart Graph Chart”