MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.
Course 2 of 3 in the Expressway to Data Science: R Programming and Tidyverse Specialization.
What You Will Learn
- You will learn to identify and describe tidy data and transform a non-tidy data set to be tidy in R.
- You will learn to analyze data between multiple related data tables.
- You will be learn to apply regular expressions to detect patterns in strings and return matches and replace patterns with new values.
Syllabus
WEEK 1
Projects, Tibbles and Importing Data
When analyzing data, you will often be required to import data from CSV or txt files. In this module, you will learn how to import and parse data in base R and the readr library, a package in the Tidyverse. You will also be introduced to R projects, which help store and organize data files associated with an analysis.
WEEK 2
Tidying Data
Data are stored in tabular forms and are often organized differently depending on its use. In this module, you will learn how to reorganize data to produce a "tidy" data set, where every variable is stored in its own column, every observation is stored in its own row, and each value is stored in a table cell.
WEEK 3
Relational Data
Data analysis rarely involves a single data table and you will be required to combine multiple related tables to answer questions you are interested in. In this module, you will learn and practice mutating variables and filtering observations from relational data.
WEEK 4
String Manipulation and Regular Expressions
This module will introduce string manipulation in R. You will learn the basics of strings, including string creation, merging, and subsetting. Then, you will use regular expressions to describe and view patterns in strings.
WEEK 5
Categorical Variables and Factors
In the last module of the course, you will use the forcats package in the tidyverse to work with categorical variables, variables that have discrete values. The forcats package introduces factors - data objects used to categorize the data in levels. You will practice creating and modifying factors.
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.