Data Transformation I
Week 09, Fall 2024
- Start: Monday, October 21
- End: Friday, October 25
Summary
This week we will dive into data visualization and transformation. In doing so, we will move from using strictly Base R, to supplementing Base R with the tidyverse.
Learning Objectives
After completing this week, you are expected to be able to:
- Understand the difference between a tibbleand adata.frame.
- Visualize data using ggplot2.
- Transform data using dplyr, specifically using the single table verbs:- selectto pick columns (variables) based on their names
- filterto pick rows (observations) based on their values
- mutateto add new columns using functions of existing variables
- summarizeto create single number statistical summaries of columns
- arrangeto change the ordering of rows
 
Reading
Additional Reading
| Link | Source | 
|---|---|
| tibbleVignette | tibbleDocumentation | 
| Tidy Data Vignette | tidyrDocumentation | 
Cheatsheets
| Link | Source | 
|---|---|
| ggplot2 | Posit Cheatsheets | 
| dplyr | Posit Cheatsheets | 
| readr | Posit Cheatsheets | 
| tidyr | Posit Cheatsheets | 
Data
- Package: palmerpenguins
- Package: dplyr::starwars
- Package: Lahman
- Package: nycflights13
Video
| Title | Link | Mirror | 
|---|---|---|
| 9.1 - Welcome to Week 09 | 9.1 - YouTube | 9.1 - ClassTranscribe | 
| 9.2 - Data and Tibbles | 9.2 - YouTube | 9.2 - ClassTranscribe | 
| 9.3 - Data Visualization with ggplot2 | 9.3 - YouTube | 9.3 - ClassTranscribe | 
| 9.4 - Data Manipulation with dplyr | 9.4 - YouTube | 9.4 - ClassTranscribe | 
| 9.5 - Lab 06 | 9.5 - YouTube | 9.5 - ClassTranscribe | 
Assignments
| Assignment | Deadline | Credit | 
|---|---|---|
| Lab 05 | Thursday, October 24 | 100% | 
| Quiz 05 | Thursday, October 24 | 105% | 
| Lab 06 | Thursday, October 31 | 100% | 
| Quiz 06 | Thursday, October 31 | 105% | 
Office Hours
See Syllabus!