Introduction to Probability and Data with R (Coursera)

Offered by Duke University,
Introduction to Probability and Data with R (Coursera)

This course introduces you to sampling and exploring data, as well as basic probability theory and Bayes' rule. You will examine various types of sampling methods, and discuss how such methods can impact the scope of inference. A variety of exploratory data analysis techniques will be covered, including numeric summary statistics and basic data visualization.

Class Deals by MOOC List - Click here and see Coursera's Active Discounts, Deals, and Promo Codes.

You will be guided through installing and using R and RStudio (free statistical software), and will use this software for lab exercises and a final project. The concepts and techniques in this course will serve as building blocks for the inference and modeling courses in the Specialization.
Course 1 of 5 in the Statistics with R Specialization.

Syllabus

WEEK 1
About Introduction to Probability and Data
This course introduces you to sampling and exploring data, as well as basic probability theory. You will examine various types of sampling methods and discuss how such methods can impact the utility of a data analysis. The concepts in this module will serve as building blocks for our later courses.Each lesson comes with a set of learning objectives that will be covered in a series of short videos. Supplementary readings and practice problems will also be suggested from OpenIntro Statistics, 3rd Edition (a free online introductory statistics textbook, that I co-authored). There will be weekly quizzes designed to assess your learning and mastery of the material covered that week in the videos. In addition, each week will also feature a lab assignment, in which you will use R to apply what you are learning to real data. There will also be a data analysis project designed to enable you to answer research questions of your own choosing.
Since this is a Coursera course, you are welcome to participate as much or as little as you’d like, though I hope that you will begin by participating fully. One of the most rewarding aspects of a Coursera course is participation in forum discussions about the course materials. Please take advantage of other students' feedback and insight and contribute your own perspective where you see fit to do so.
Introduction to Data
Welcome to Introduction to Probability and Data! I hope you are just as excited about this course as I am! In the next five weeks, we will learn about designing studies, explore data via numerical summaries and visualizations, and learn about rules of probability and commonly used probability distributions.

WEEK 2
Exploratory Data Analysis and Introduction to Inference
Welcome to Week 2 of Introduction to Probability and Data! Hope you enjoyed materials from Week 1. This week we will delve into numerical and categorical data in more depth, and introduce inference.

WEEK 3
Introduction to Probability
Welcome to Week 3 of Introduction to Probability and Data! Last week we explored numerical and categorical data. This week we will discuss probability, conditional probability, the Bayes’ theorem, and provide a light introduction to Bayesian inference. Thank you for your enthusiasm and participation, and have a great week! I’m looking forward to working with you on the rest of this course.

WEEK 4
Probability Distributions
Great work so far! Welcome to Week 4 - the last content week of Introduction to Probability and Data! This week we will introduce two probability distributions: the normal and the binomial distributions in particular. As usual, you can evaluate your knowledge in this week's quiz. There will be no labs for this week.

WEEK 5
Data Analysis Project
Well done! You have reached the last week of Introduction to Probability and Data! There will not be any new videos in this week, instead, you will be asked to complete an initial data analysis project with a real-world data set. The project is designed to help you discover and explore research questions of your own, using real data and statistical methods we learn in this class. The project will be graded via peer assessments, meaning that you will need to evaluate three peers' projects after submitting your own.
Get started with your data analysis in this week! It should be interesting and very exciting!

Go to Class
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Principles of fMRI 2 (Coursera) Coursera
Johns Hopkins University,University of Colorado Boulder

Principles of fMRI 2 (Coursera)

Functional Magnetic Resonance Imaging (fMRI) is the most widely used technique for investigating the living, functioning human brain as people perform tasks and experience mental states. It is a convergence point for multidisciplinary work from many disciplines. Psychologists, statisticians, physicists, computer scientists, neuroscientists, medical researchers, behavioral scientists, engineers, public health researchers, biologists, and others are coming together to advance our understanding of the human mind and brain. This course covers the analysis of Functional Magnetic Resonance Imaging (fMRI) data.

Jun 1st 2026
4 Weeks
Statistical Inference (Coursera) Coursera
Johns Hopkins University

Statistical Inference (Coursera)

Statistical inference is the process of drawing conclusions about populations or scientific truths from data. There are many modes of performing inference including statistical modeling, data oriented strategies and explicit use of designs and randomization in analyses. Furthermore, there are broad theories (frequentists, Bayesian, likelihood, design based, …) and numerous complexities (missing data, observed and unobserved confounding, biases) for performing inference.

Jun 1st 2026
4 Weeks
Designing, Running, and Analyzing Experiments (Coursera) Coursera
University of California, San Diego

Designing, Running, and Analyzing Experiments (Coursera)

You may never be sure whether you have an effective user experience until you have tested it with users. In this course, you’ll learn how to design user-centered experiments, how to run such experiments, and how to analyze data from these experiments in order to evaluate and validate user experiences. You will work through real-world examples of experiments from the fields of UX, IxD, and HCI, understanding issues in experiment design and analysis.

Jun 1st 2026
5-12 Weeks
Digital Marketing Analytics in Practice (Coursera) Coursera
University of Illinois at Urbana-Champaign

Digital Marketing Analytics in Practice (Coursera)

Successfully marketing brands today requires a well-balanced blend of art and science. This course introduces students to the science of web analytics while casting a keen eye toward the artful use of numbers found in the digital space. The goal is to provide the foundation needed to apply data analytics to real-world challenges marketers confront daily. Students will learn to identify the web analytic tool right for their specific needs; understand valid and reliable ways to collect, analyze, and visualize data from the web; and utilize data in decision making for agencies, organizations or clients.

Jun 1st 2026
4 Weeks
Reproducible Research (Coursera) Coursera
Johns Hopkins University

Reproducible Research (Coursera)

This course focuses on the concepts and tools behind reporting modern data analyses in a reproducible manner. Reproducible research is the idea that data analyses, and more generally, scientific claims, are published with their data and software code so that others may verify the findings and build upon them. The need for reproducibility is increasing dramatically as data analyses become more complex, involving larger datasets and more sophisticated computations.

Jun 1st 2026
4 Weeks
The Data Scientist's Toolbox (Coursera) Coursera
Johns Hopkins University

The Data Scientist's Toolbox (Coursera)

In this course you will get an introduction to the main tools and ideas in the data scientist's toolbox. The course gives an overview of the data, questions, and tools that data analysts and data scientists work with. There are two components to this course. The first is a conceptual introduction to the ideas behind turning data into actionable knowledge. The second is a practical introduction to the tools that will be used in the program like version control, markdown, git, GitHub, R, and RStudio.

Jun 1st 2026
4 Weeks
Data Science Companion (Coursera) Coursera
MathWorks

Data Science Companion (Coursera)

The Data Science Companion provides an introduction to data science. You will gain a quick background in data science and core machine learning concepts, such as regression and classification. You’ll be introduced to the practical knowledge of data processing and visualization using low-code solutions, as well as an overview of the ways to integrate multiple tools effectively to solve data science problems.

Jun 5th 2026
4 Weeks
Bioinformatic Methods I (Coursera) Coursera
University of Toronto

Bioinformatic Methods I (Coursera)

Large-scale biology projects such as the sequencing of the human genome and gene expression surveys using RNA-seq, microarrays and other technologies have created a wealth of data for biologists. However, the challenge facing scientists is analyzing and even accessing these data to extract useful information pertaining to the system being studied. This course focuses on employing existing bioinformatic resources – mainly web-based programs and databases – to access the wealth of data to answer questions relevant to the average biologist, and is highly hands-on.

Jun 1st 2026
5-12 Weeks
Foundations of marketing analytics (Coursera) Coursera
ESSEC Business School

Foundations of marketing analytics (Coursera)

Who is this course for? This course is designed for students, business analysts, and data scientists who want to apply statistical knowledge and techniques to business contexts. For example, it may be suited to experienced statisticians, analysts, engineers who want to move more into a business role, in particular in marketing. You will find this course exciting and rewarding if you already have a background in statistics, can use R or another programming language and are familiar with databases and data analysis techniques such as regression, classification, and clustering. However, it contains a number of recitals and R Studio tutorials which will consolidate your competences, enable you to play more freely with data and explore new features and statistical functions in R.

Jun 1st 2026
5-12 Weeks
Data Analysis Tools (Coursera) Coursera
Wesleyan University

Data Analysis Tools (Coursera)

In this course, you will develop and test hypotheses about your data. You will learn a variety of statistical tests, as well as strategies to know how to apply the appropriate one to your specific data and question. Using your choice of two powerful statistical software packages (SAS or Python), you will explore ANOVA, Chi-Square, and Pearson correlation analysis. This course will guide you through basic statistical principles to give you the tools to answer questions you have developed. Throughout the course, you will share your progress with others to gain valuable feedback and provide insight to other learners about their work.

Jun 1st 2026
4 Weeks
Regression Models (Coursera) Coursera
Johns Hopkins University

Regression Models (Coursera)

Linear models, as their name implies, relates an outcome to a set of predictors of interest using linear assumptions. Regression models, a subset of linear models, are the most important statistical analysis tool in a data scientist’s toolkit. This course covers regression analysis, least squares and inference using regression models.

Jun 1st 2026
4 Weeks