Basic Statistics (Coursera)

Basic Statistics (Coursera)

Understanding statistics is essential to understand research in the social and behavioral sciences. In this course you will learn the basics of statistics; not just how to calculate them, but also how to evaluate them. This course will also prepare you for the next course in the specialization - the course Inferential Statistics. In the first part of the course we will discuss methods of descriptive statistics. You will learn what cases and variables are and how you can compute measures of central tendency (mean, median and mode) and dispersion (standard deviation and variance). Next, we discuss how to assess relationships between variables, and we introduce the concepts correlation and regression.

Class Deals by MOOC List - Click here and see Coursera's Active Discounts, Deals, and Promo Codes.

The second part of the course is concerned with the basics of probability: calculating probabilities, probability distributions and sampling distributions. You need to know about these things in order to understand how inferential statistics work.
The third part of the course consists of an introduction to methods of inferential statistics - methods that help us decide whether the patterns we see in our data are strong enough to draw conclusions about the underlying population we are interested in. We will discuss confidence intervals and significance tests.
You will not only learn about all these statistical concepts, you will also be trained to calculate and generate these statistics yourself using freely available statistical software.
Course 3 of 5 in the Methods and Statistics in Social Sciences Specialization.

Syllabus

WEEK 1
Before we get started...
In this module we'll consider the basics of statistics. But before we start, we'll give you a broad sense of what the course is about and how it's organized. Are you new to Coursera or still deciding whether this is the course for you? Then make sure to check out the 'Course introduction' and 'What to expect from this course' sections below, so you'll have the essential information you need to decide and to do well in this course! If you have any questions about the course format, deadlines or grading, you'll probably find the answers here. Are you a Coursera veteran and ready to get started? Then you might want to skip ahead to the first course topic: 'Exploring data'. You can always check the general information later. Veterans and newbies alike: Don't forget to introduce yourself in the 'meet and greet' forum!
Exploring Data
In this first module, we’ll introduce the basic concepts of descriptive statistics. We’ll talk about cases and variables, and we’ll explain how you can order them in a so-called data matrix. We’ll discuss various levels of measurement and we’ll show you how you can present your data by means of tables and graphs. We’ll also introduce measures of central tendency (like mode, median and mean) and dispersion (like range, interquartile range, variance and standard deviation). We’ll not only tell you how to interpret them; we’ll also explain how you can compute them. Finally, we’ll tell you more about z-scores. In this module we’ll only discuss situations in which we analyze one single variable. This is what we call univariate analysis. In the next module we will also introduce studies in which more variables are involved.

WEEK 2
Correlation and Regression
In this second module we’ll look at bivariate analyses: studies with two variables. First we’ll introduce the concept of correlation. We’ll investigate contingency tables (when it comes to categorical variables) and scatterplots (regarding quantitative variables). We’ll also learn how to understand and compute one of the most frequently used measures of correlation: Pearson's r. In the next part of the module we’ll introduce the method of OLS regression analysis. We’ll explain how you (or the computer) can find the regression line and how you can describe this line by means of an equation. We’ll show you that you can assess how well the regression line fits your data by means of the so-called r-squared. We conclude the module with a discussion of why you should always be very careful when interpreting the results of a regression analysis.

WEEK 3
Probability
This module introduces concepts from probability theory and the rules for calculating with probabilities. This is not only useful for answering various kinds of applied statistical questions but also to understand the statistical analyses that will be introduced in subsequent modules. We start by describing randomness, and explain how random events surround us. Next, we provide an intuitive definition of probability through an example and relate this to the concepts of events, sample space and random trials. A graphical tool to understand these concepts is introduced here as well, the tree-diagram.Thereafter a number of concepts from set theory are explained and related to probability calculations. Here the relation is made to tree-diagrams again, as well as contingency tables. We end with a lesson where conditional probabilities, independence and Bayes rule are explained. All in all, this is quite a theoretical module on a topic that is not always easy to grasp. That's why we have included as many intuitive examples as possible.

WEEK 4
Probability Distributions
Probability distributions form the core of many statistical calculations. They are used as mathematical models to represent some random phenomenon and subsequently answer statistical questions about that phenomenon. This module starts by explaining the basic properties of a probability distribution, highlighting how it quantifies a random variable and also pointing out how it differs between discrete and continuous random variables. Subsequently the cumulative probability distribution is introduced and its properties and usage are explained as well. In a next lecture it is shown how a random variable with its associated probability distribution can be characterized by statistics like a mean and variance, just like observational data. The effects of changing random variables by multiplication or addition on these statistics are explained as well.The lecture thereafter introduces the normal distribution, starting by explaining its functional form and some general properties. Next, the basic usage of the normal distribution to calculate probabilities is explained. And in a final lecture the binomial distribution, an important probability distribution for discrete data, is introduced and further explained. By the end of this module you have covered quite some ground and have a solid basis to answer the most frequently encountered statistical questions. Importantly, the fundamental knowledge about probability distributions that is presented here will also provide a solid basis to learn about inferential statistics in the next modules.

WEEK 5
Sampling Distributions
Methods for summarizing sample data are called descriptive statistics. However, in most studies we’re not interested in samples, but in underlying populations. If we employ data obtained from a sample to draw conclusions about a wider population, we are using methods of inferential statistics. It is therefore of essential importance that you know how you should draw samples. In this module we’ll pay attention to good sampling methods as well as some poor practices. To draw conclusions about the population a sample is from, researchers make use of a probability distribution that is very important in the world of statistics: the sampling distribution. We’ll discuss sampling distributions in great detail and compare them to data distributions and population distributions. We’ll look at the sampling distribution of the sample mean and the sampling distribution of the sample proportion.

WEEK 6
Confidence Intervals
We can distinguish two types of statistical inference methods. We can: (1) estimate population parameters; and (2) test hypotheses about these parameters. In this module we’ll talk about the first type of inferential statistics: estimation by means of a confidence interval. A confidence interval is a range of numbers, which, most likely, contains the actual population value. The probability that the interval actually contains the population value is what we call the confidence level. In this module we’ll show you how you can construct confidence intervals for means and proportions and how you should interpret them. We’ll also pay attention to how you can decide how large your sample size should be.

WEEK 7
Significance Tests
In this module we’ll talk about statistical hypotheses. They form the main ingredients of the method of significance testing. An hypothesis is nothing more than an expectation about a population. When we conduct a significance test, we use (just like when we construct a confidence interval) sample data to draw inferences about population parameters. The significance test is, therefore, also a method of inferential statistics. We’ll show that each significance test is based on two hypotheses: the null hypothesis and the alternative hypothesis. When you do a significance test, you assume that the null hypothesis is true unless your data provide strong evidence against it. We’ll show you how you can conduct a significance test about a mean and how you can conduct a test about a proportion. We’ll also demonstrate that significance tests and confidence intervals are closely related. We conclude the module by arguing that you can make right and wrong decisions while doing a test. Wrong decisions are referred to as Type I and Type II errors.

WEEK 8
Exam time!
This is the final module, where you can apply everything you've learned until now in the final exam. Please note that you can only take the final exam once a month, so make sure you are fully prepared to take the test. Please follow the honor code and do not communicate or confer with others while taking this exam. Good luck!

Go to Class
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Introduction to Probability and Data with R (Coursera) Coursera
Duke University

Introduction to Probability and Data with R (Coursera)

This course introduces you to sampling and exploring data, as well as basic probability theory and Bayes' rule. You will examine various types of sampling methods, and discuss how such methods can impact the scope of inference. A variety of exploratory data analysis techniques will be covered, including numeric summary statistics and basic data visualization.

Jul 6th 2026
5-12 Weeks
Machine Learning: Regression (Coursera) Coursera
University of Washington

Machine Learning: Regression (Coursera)

Case Study - Predicting Housing Prices. In our first case study, predicting house prices, you will create models that predict a continuous value (price) from input features (square footage, number of bedrooms and bathrooms,...). This is just one of the many places where regression can be applied. Other applications range from predicting health outcomes in medicine, stock prices in finance, and power usage in high-performance computing, to analyzing which regulators are important for gene expression.

Jul 6th 2026
5-12 Weeks
Practical Predictive Analytics: Models and Methods (Coursera) Coursera
University of Washington

Practical Predictive Analytics: Models and Methods (Coursera)

Statistical experiment design and analytics are at the heart of data science. In this course you will design statistical experiments and analyze the results using modern methods. You will also explore the common pitfalls in interpreting statistical arguments, especially those associated with big data. Collectively, this course will help you internalize a core set of practical and effective machine learning methods and concepts, and apply them to solve some real world problems.

Jul 6th 2026
4 Weeks
Data Analytics for Lean Six Sigma (Coursera) Coursera
University of Amsterdam

Data Analytics for Lean Six Sigma (Coursera)

Welcome to this course on Data Analytics for Lean Six Sigma. In this course you will learn data analytics techniques that are typically useful within Lean Six Sigma improvement projects. At the end of this course you are able to analyse and interpret data gathered within such a project. You will be able to use Minitab to analyse the data. I will also briefly explain what Lean Six Sigma is.

Jul 6th 2026
5-12 Weeks
Doping : Sports, Organizations and Sciences (Coursera) Coursera
University of Lausanne

Doping : Sports, Organizations and Sciences (Coursera)

The objective of this course is to encourage a critical understanding of doping. To achieve this goal, this course will rely on a multidisciplinary approach that allow you to see how different disciplines get into a single object, in different perspectives and in often complementary ways. This approach will also allow us to appreciate the complexity of a subject like doping. Doping in sports is a complex practice whose definition and identification is the result of socially and historically constructed norms. This course offers to shed light on the processes that led to the use and prohibition of doping substances.

Jul 6th 2026
4 Weeks
Big Data, Genes, and Medicine (Coursera) Coursera
The State University of New York

Big Data, Genes, and Medicine (Coursera)

This course distills for you expert knowledge and skills mastered by professionals in Health Big Data Science and Bioinformatics. You will learn exciting facts about the human body biology and chemistry, genetics, and medicine that will be intertwined with the science of Big Data and skills to harness the avalanche of data openly available at your fingertips and which we are just starting to make sense of.

Jul 6th 2026
5-12 Weeks
Linear Regression and Modeling (Coursera) Coursera
Duke University

Linear Regression and Modeling (Coursera)

This course introduces simple and multiple linear regression models. These models allow you to assess the relationship between variables in a data set and a continuous response variable. Is there a relationship between the physical attractiveness of a professor and their student evaluation scores? Can we predict the test score for a child based on certain characteristics of his or her mother? In this course, you will learn the fundamental theory behind linear regression and, through data examples, learn to fit, examine, and utilize regression models to examine relationships between multiple variables, using the free statistical software R and RStudio.

Jul 6th 2026
4 Weeks
ADHD: Everyday Strategies for Elementary Students (Coursera) Coursera
University at Buffalo,The State University of New York

ADHD: Everyday Strategies for Elementary Students (Coursera)

This course will provide an overview of ADHD diagnosis and treatment. Course participants can expect to learn about ADHD as a developmental disorder that begins early in childhood, and participants will also learn about evidence-based approaches for diagnosing ADHD. Following that, two evidence-based treatment approaches (the Daily Report Card and Parenting Strategies) will be introduced. (Note these course activities are informational and are not intended to replace working with a licensed/trained professional).

Jul 6th 2026
4 Weeks
Inferential and Predictive Statistics for Business (Coursera) Coursera
University of Illinois at Urbana-Champaign

Inferential and Predictive Statistics for Business (Coursera)

This course provides an analytical framework to help you evaluate key problems in a structured fashion and will equip you with tools to better manage the uncertainties that pervade and complicate business processes. The course aim to cover statistical ideas that apply to managers. We will consider two basic themes: first, is recognizing and describing variations present in everything around us, and then modeling and making decisions in the presence of these variations.

Jul 6th 2026
4 Weeks
Practical Machine Learning (Coursera) Coursera
Johns Hopkins University

Practical Machine Learning (Coursera)

One of the most common tasks performed by data scientists and data analysts are prediction and machine learning. This course will cover the basic components of building and applying prediction functions with an emphasis on practical applications. The course will provide basic grounding in concepts such as training and tests sets, overfitting, and error rates.

Jul 6th 2026
4 Weeks
Advanced Linear Models for Data Science 2: Statistical Linear Models (Coursera) Coursera
Johns Hopkins University

Advanced Linear Models for Data Science 2: Statistical Linear Models (Coursera)

Welcome to the Advanced Linear Models for Data Science Class 2: Statistical Linear Models. This class is an introduction to least squares from a linear algebraic and mathematical perspective. Before beginning the class make sure that you have the following: a basic understanding of linear algebra and multivariate calculus; a basic understanding of statistics and regression models; at least a little familiarity with proof based mathematics; basic knowledge of the R programming language.

Jul 6th 2026
4 Weeks