Data – What It Is, What We Can Do With It (Coursera)

Data – What It Is, What We Can Do With It (Coursera)

This course introduces students to data and statistics. By the end of the course, students should be able to interpret descriptive statistics, causal analyses and visualizations to draw meaningful insights. The course first introduces a framework for thinking about the various purposes of statistical analysis. We’ll talk about how analysts use data for descriptive, causal and predictive inference. We’ll then cover how to develop a research study for causal analysis, compute and interpret descriptive statistics and design effective visualizations. The course will help you to become a thoughtful and critical consumer of analytics.

Class Deals by MOOC List - Click here and see Coursera's Active Discounts, Deals, and Promo Codes.

If you are in a field that increasingly relies on data-driven decision making, but you feel unequipped to interpret and evaluate data, this course will help you develop these fundamental tools of data literacy.
Course 1 of 5 in the Data Literacy Specialization

Syllabus

WEEK 1
Data and Theories
When most people think about using data, they quickly jump to considering the best way to analyze it with statistical methods. A good analysis, however, begins with a strong theoretical framework. A good theory will guide the collection of data, selection of appropriate statistical methods and interpretation of the results. Further, the theory will determine what kind of research design is needed, such as an observational study or experiment. This module will focus on the development of high-quality theories that can be used to guide descriptive, causal and predictive inference.

WEEK 2
The Causality Framework
Establishing causality is frequently the primary motivation for research. Policymakers often want to understand how the implementation of a new program or other policy tool will affect an outcome of interest. Will smaller class sizes increase student learning? Will the implementation of stricter background checks for gun buyers reduce gun violence? Biomedical researchers often want to understand whether a new medicine will improve a disease outcome. Will taking a drug improve life expectancy, or even cure the disease under study? To answer these and similar questions, analysts must develop research designs that are appropriate for causal inference. Estimating a causal effect is challenging, yet it is essential to understand the impacts of a policy, medicine or any other kind of intervention.

WEEK 3
Descriptive Statistics
Over the next four lessons we'll begin to make sense of raw data. Staring at raw data, such as a spreadsheet, does not reveal much of anything about the key takeaway points. Consider a variable such as a survey question that asks about the level of discrimination in the U.S. (where the answer choices are "a lot," "some," "only a little," "none at all," and "don't know"). Reading the raw data does not tell you about the average respondent or the distribution of responses among the possible answer choices. To better understand the shape of the distribution, we can calculate measures of central tendency, measures of spread and characterize the data's dispersion. These summary statistics allow a researcher to draw some simple yet powerful initial conclusions about what the data tell us in a real-world sense.

WEEK 4
Visualizations
Edward Tufte, a world-renowned expert of data visualization, once said, "There is no such thing as information overload. There is only bad design." When communicating the results of an analysis, and particularly when trying to persuade an audience, a picture is truly worth a thousand words. A well-designed graph can leverage either a small or large amount of data to make a convincing argument. Data visualizations highlight specific points about the underlying information and enable the viewer to draw insights that are nearly invisible when staring at the numbers alone. In short, to be a good at communicating with data, you must become skilled at visualizing data.

Go to Class
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Advanced Linear Models for Data Science 1: Least Squares (Coursera) Coursera
Johns Hopkins University

Advanced Linear Models for Data Science 1: Least Squares (Coursera)

Welcome to the Advanced Linear Models for Data Science Class 1: Least Squares. This class is an introduction to least squares from a linear algebraic and mathematical perspective. Before beginning the class make sure that you have the following: a basic understanding of linear algebra and multivariate calculus; a basic understanding of statistics and regression models; at least a little familiarity with proof based mathematics; basic knowledge of the R programming language.

Jun 1st 2026
5-12 Weeks
Data Visualization and Communication with Tableau (Coursera) Coursera
Duke University

Data Visualization and Communication with Tableau (Coursera)

One of the skills that characterizes great business data analysts is the ability to communicate practical implications of quantitative analyses to any kind of audience member. Even the most sophisticated statistical analyses are not useful to a business if they do not lead to actionable advice, or if the answers to those business questions are not conveyed in a way that non-technical people can understand. In this course you will learn how to become a master at communicating business-relevant implications of data analyses.

Jun 1st 2026
5-12 Weeks
The R Programming Environment (Coursera) Coursera
Johns Hopkins University

The R Programming Environment (Coursera)

This course provides a rigorous introduction to the R programming language, with a particular focus on using R for software development in a data science setting. Whether you are part of a data science team or working individually within a community of developers, this course will give you the knowledge of R needed to make useful contributions in those settings.

Jun 1st 2026
4 Weeks
Data Management and Visualisation (Coursera) Coursera
Wesleyan University

Data Management and Visualisation (Coursera)

Whether being used to customize advertising to millions of website visitors or streamline inventory ordering at a small restaurant, data is becoming more integral to success. Too often, we’re not sure how use data to find answers to the questions that will make us more successful in what we do. In this course, you will discover what data is and think about what questions you have that can be answered by the data – even if you’ve never thought about data before. Based on existing data, you will learn to develop a research question, describe the variables and their relationships, calculate basic statistics, and present your results clearly.

Jun 1st 2026
4 Weeks
Managing Data Analysis (Coursera) Coursera
Johns Hopkins University

Managing Data Analysis (Coursera)

This one-week course describes the process of analyzing data and how to manage that process. We describe the iterative nature of data analysis and the role of stating a sharp question, exploratory data analysis, inference, formal statistical modeling, interpretation, and communication. In addition, we will describe how to direct analytic activities within a team and to drive the data analysis process towards coherent and useful results.

Jun 1st 2026
1 Week
Python Project for Data Science (Coursera) Coursera
IBM

Python Project for Data Science (Coursera)

This mini-course is intended to for you to demonstrate foundational Python skills for working with data. The completion of this course involves working on a hands-on project where you will develop a simple dashboard using Python. This course is part of the IBM Data Science Professional Certificate and the IBM Data Analytics Professional Certificate.

Jun 4th 2026
1 Week
The Structured Query Language (SQL) (Coursera) Coursera
University of Colorado Boulder

The Structured Query Language (SQL) (Coursera)

In this course you will learn all about the Structured Query Language ("SQL".) We will review the origins of the language and its conceptual foundations. But primarily, we will focus on learning all the standard SQL commands, their syntax, and how to use these commands to conduct analysis of the data within a relational database. Our scope includes not only the SELECT statement for retrieving data and creating analytical reports, but also includes the DDL ("Data Definition Language") and DML ("Data Manipulation Language") commands necessary to create and maintain database objects.

Jun 2nd 2026
5-12 Weeks
Statistical Inference (Coursera) Coursera
Johns Hopkins University

Statistical Inference (Coursera)

Statistical inference is the process of drawing conclusions about populations or scientific truths from data. There are many modes of performing inference including statistical modeling, data oriented strategies and explicit use of designs and randomization in analyses. Furthermore, there are broad theories (frequentists, Bayesian, likelihood, design based, …) and numerous complexities (missing data, observed and unobserved confounding, biases) for performing inference.

Jun 1st 2026
4 Weeks
Introduction to Machine Learning (Coursera) Coursera
Duke University

Introduction to Machine Learning (Coursera)

This course will provide you a foundational understanding of machine learning models (logistic regression, multilayer perceptrons, convolutional neural networks, natural language processing, etc.) as well as demonstrate how these models can solve complex problems in a variety of industries, from medical diagnostics to image recognition to text prediction.

Jun 5th 2026
5-12 Weeks
Data Science in Real Life (Coursera) Coursera
Johns Hopkins University

Data Science in Real Life (Coursera)

Have you ever had the perfect data science experience? The data pull went perfectly. There were no merging errors or missing data. Hypotheses were clearly defined prior to analyses. Randomization was performed for the treatment of interest. The analytic plan was outlined prior to analysis and followed exactly. The conclusions were clear and actionable decisions were obvious. Has that every happened to you? Of course not. Data analysis in real life is messy. How does one manage a team facing real data analyses? In this one-week course, we contrast the ideal with what happens in real life. By contrasting the ideal, you will learn key concepts that will help you manage real life analyses.

Jun 1st 2026
1 Week