This workshop introduces R
tools for data
management.
We focus on functions found in the tidyverse
collection
of packages, but also include tools found in base R
and
other packages.
We will cover the following topics:
Review of dataframes
Review of functions
Data cleaning and missing values
Data management with dplyr
Combining datasets
Tidy data with tidyr
Managing dates with lubridate
Managing strings stringr
Factors with forcats
tidyverse
: a collection of packages with tools for most
aspects of data analysis, particularly strong in data import,
management, and visualization. We will focus on the following packages
within tidyverse
:
readr
- updated data import functions like
read_csv()
dplyr
- subsetting, sorting, transforming variables,
groupingtidyr
- restructuring rows and columnslubridate
- date and time variable processingstringr
- string variable manipulationforcats
- factor variable manipulation