Coursera

Introduction to the Tidyverse (Coursera)

Offered by Johns Hopkins University,

This course introduces a powerful set of data science tools known as the Tidyverse. The Tidyverse has revolutionized the way in which data scientists do almost every aspect of their job. We will cover the simple idea of "tidy data" and how this idea serves to organize data for analysis and modeling. We will also cover how non-tidy can be transformed to tidy data, the data science project life cycle, and the ecosystem of Tidyverse R packages that can be used to execute a data science project.

Class Deals by MOOC List - Click here and see Coursera's Active Discounts, Deals, and Promo Codes.

If you are new to data science, the Tidyverse ecosystem of R packages is an excellent way to learn the different aspects of the data science pipeline, from importing the data, tidying the data into a format that is easy to work with, exploring and visualizing the data, and fitting machine learning models. If you are already experienced in data science, the Tidyverse provides a power system for streamlining your workflow in a coherent manner that can easily connect with other data science tools.
In this course it is important that you be familiar with the R programming language. If you are not yet familiar with R, we suggest you first complete R Programming before returning to complete this course.

What You Will Learn

Distinguish between tidy and non-tidy data
Describe how non-tidy data can be transformed into tidy data
Describe the Tidyverse ecosystem of packages
Organize and initialize a data science project

Course 1 of 5 in the Tidyverse Skills for Data Science in R Specialization

Syllabus

WEEK 1
Tidy Data
Before we can discuss all the ways in which R makes it easy to work with tidy data, we have to first be sure we know what tidy data are. Tidy datasets, by design, are easier to manipulate, model, and visualize because the tidy data principles that we’ll discuss in this course impose a general framework and a consistent set of rules on data. In fact, a well-known quote from Hadley Wickham is that “tidy datasets are all alike but every messy dataset is messy in its own way.” Utilizing a consistent tidy data format allows for tools to be built that work well within this framework, ultimately simplifying the data wrangling, visualization and analysis processes. By starting with data that are already in a tidy format or by spending the time at the beginning of a project to get data into a tidy format, the remaining steps of your data science project will be easier.

WEEK 2
From Non-Tidy –>Tidy
The reason it’s important to discuss what tidy data are an what they look like is because out in the world, most data are untidy. If you are not the one entering the data but are instead handed the data from someone else to do a project, more often than not, those data will be untidy. Untidy data are often referred to simply as messy data. In order to work with these data easily, you’ll have to get them into a tidy data format. This means you’ll have to fully recognize untidy data and understand how to get data into a tidy format. The following common problems seen in messy datasets again come from Hadley Wickham’s paper on tidy data. After briefly reviewing what each common problem is, we will then take a look at a few messy datasets. We’ll finally touch on the concepts of tidying untidy data, but we won’t actually do any practice yet. That’s coming soon!
The Data Science Life Cycle & Tidyverse Ecosystem
With a solid understanding of tidy data and how tidy data fit into the data science life cycle, we’ll take a bit of time to introduce you to the tidyverse and tidyverse-adjacent packages that we’ll be teaching and using throughout this specialization. Taken together, these packages make up what we’re referring to as the tidyverse ecosystem. The purpose for the rest of this course is not for you to understand how to use each of these packages (that’s coming soon!), but rather to help you familiarize yourself with which packages fit into which part of the data science life cycle.
Note that the official tidyverse packages below are bold. All other packages are tidyverse-adjacent, meaning they follow the same conventions as the official tidyverse packages and work well within the tidy framework and structure of data analysis.

WEEK 3
Data Science Project Organization & Workflows
Data science projects vary quite a lot so it can be difficult to give universal rules for how they should be organized. However, there are a few ways to organize projects that are commonly useful. In particular, almost all projects have to deal with files of various sorts—data files, code files, output files, etc. This section talks about how files work and how projects can be organized and customized.
Case Studies
Throughout this specialization, we’re going to make use of a number of case studies from Open Case Studies to demonstrate the concepts introduced in the course. We’ll generally make use of the same case studies throughout the specialization, providing continuity to allow you to focus on the concepts and skills being taught (rather than the context) while working with interesting data. These case studies aim to address a public-health question and all of them use real data.

WEEK 4
Project: Organizing a New Data Science Project
This project will allow you to create a new project and organize the files that will be needed to engage in a future data analysis

Go to Class

MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Coursera

Edureka

Fundamentals of Data Governance (Coursera)

CS: Information & Technology

Welcome to the Fundamentals of Data Governance course. This program will equip you with fundamental knowledge in data governance, enabling you to create and apply an all-encompassing framework for data governance that ensures data integrity, security, adherence to regulations, and proficient data handling in a company.

Aug 10th 2026

1 Week

Data Management Data Security Data Quality

Coursera

University of California, Santa Cruz

Bayesian Statistics: Techniques and Models (Coursera)

Statistics & Data Analysis Data Science

This is the second of a two-course sequence introducing the fundamentals of Bayesian statistics. It builds on the course Bayesian Statistics: From Concept to Data Analysis, which introduces Bayesian methods through use of simple conjugate models. Real-world data often require more sophisticated models to reach realistic conclusions. This course aims to expand our “Bayesian toolbox” with more general models, and computational techniques to fit them.

Aug 10th 2026

5-12 Weeks

Monte Carlo Method ANOVA Statistical Modeling

Coursera

University of Illinois at Urbana-Champaign

3D Data Visualization for Science Communication (Coursera)

Data Science

This course is an introduction to 3D scientific data visualization, with an emphasis on science communication and cinematic design for appealing to broad audiences. You will develop visualization literacy, through being able to interpret/analyze (read) visualizations and create (write) your own visualizations.

Aug 10th 2026

4 Weeks

3D Communication Computer Graphics

Coursera

Erasmus University Rotterdam

Econometrics: Methods and Applications (Coursera)

Statistics & Data Analysis Data Science

Do you wish to know how to analyze and solve business and economic questions with data analysis tools? Then Econometrics by Erasmus University Rotterdam is the right course for you, as you learn how to translate data into models to make forecasts and to support decision making.

Aug 10th 2026

5-12 Weeks

Econometrics Data Analysis Linear Regression

Coursera

University of Colorado Boulder

Statistical Inference and Hypothesis Testing in Data Science Applications (Coursera)

Statistics & Data Analysis Data Science

This course will focus on theory and implementation of hypothesis testing, especially as it relates to applications in data science. Students will learn to use hypothesis tests to make informed decisions from data. Special attention will be given to the general logic of hypothesis testing, error and error rates, power, simulation, and the correct computation and interpretation of p-values. Attention will also be given to the misuse of testing concepts, especially p-values, and the ethical implications of such misuse.

Aug 10th 2026

5-12 Weeks

Statistics Hypothesis Statistical Inference

Coursera

University of Edinburgh

Data Science in Stratified Healthcare and Precision Medicine (Coursera)

Health & Society Statistics & Data Analysis

An increasing volume of data is becoming available in biomedicine and healthcare, from genomic data, to electronic patient records and data collected by wearable devices. Recent advances in data science are transforming the life sciences, leading to precision medicine and stratified healthcare. In this course, you will learn about some of the different types of data and computational methods involved in stratified healthcare and precision medicine.

Aug 10th 2026

5-12 Weeks

Medicine Healthcare Machine Learning

Coursera

Knowledge Accelerators

From Excel to Power BI (Coursera)

CS: Information & Technology

Are you using Excel to manage, analyze, and visualize your data? Would you like to do more? Perhaps you've considered Power BI as an alternative, but have been intimidated by the idea of working in an advanced environment. The fact is, many of the same tools and mechanisms exist across both these Microsoft products. This means Excel users are actually uniquely positioned to transition to data modeling and visualization in Power BI! Using methods that will feel familiar, you can learn to use Power BI to make data-driven business decisions using large volumes of data.

Aug 10th 2026

1 Week

Excel Data Analysis Microsoft Excel

Coursera

University of Illinois at Urbana-Champaign

Visualization for Data Journalism (Coursera)

Statistics & Data Analysis Data Science

While telling stories with data has been part of the news practice since its earliest days, it is in the midst of a renaissance. Graphics desks which used to be deemed as “the art department,” a subfield outside the work of newsrooms, are becoming a core part of newsrooms’ operation. Those people (they often have various titles: data journalists, news artists, graphic reporters, developers, etc.) who design news graphics are expected to be full-fledged journalists and work closely with reporters and editors.

Aug 10th 2026

5-12 Weeks

Python Storytelling Data Analysis

Coursera

FIA Business School

Ferramentas para Ciência de Dados: Introdução ao R (Coursera)

Statistics & Data Analysis

Nossas boas-vindas ao Curso Ferramentas para Ciência de Dados: Introdução ao R. Neste curso, você aprenderá que o mundo evoluiu muito quando o assunto é tomada de decisão baseada em dados e já não é possível comparar a quantidade de informações a que temos acesso atualmente com o que tínhamos disponíveis décadas atrás.

Aug 10th 2026

4 Weeks

Data Science R Language R Programming

Coursera

Knowledge Accelerators

The ABC's of GDPR: Protecting Privacy in an Online World (Coursera)

Business

This course is aimed at business professionals, web designers, and other interested professionals who collect, process, use, or otherwise encounter personal data subject to the European Union's General Data Protection Regulation. By the end of this course, learners will be familiar with roles involved in assuring compliance with GDPR standards, the rights of GDPR data subjects, responsibilities of organizations regarding GDPR and the consequences for failure to comply.

Aug 10th 2026

1 Week

Data Management Privacy Data Privacy

Coursera

The Hong Kong University of Science and Technology - HKUST

Python and Statistics for Financial Analysis (Coursera)

Economics & Finance Business

Python is now becoming the number 1 programming language for data science. Due to python’s simplicity and high readability, it is gaining its importance in the financial industry. The course combines both python coding and statistical concepts and applies into analyzing financial data, such as stock data.

Aug 10th 2026

4 Weeks

Python Statistics Inference

Coursera

Rutgers University

Supply Chain Analytics Essentials (Coursera)

Business

In this introductory course to Supply Chain Analytics, I will take you on a journey to this fascinating area where supply chain management meets data analytics. You will learn real life examples on how analytics can be applied to various domains of a supply chain, from selling, to logistics, production and sourcing, to generate a significant social / economic impact.

Aug 10th 2026

4 Weeks

Data Analysis Supply Chain Supply Chain Analytics