Coursera

Statistical Inference and Hypothesis Testing in Data Science Applications (Coursera)

Offered by University of Colorado Boulder,

This course will focus on theory and implementation of hypothesis testing, especially as it relates to applications in data science. Students will learn to use hypothesis tests to make informed decisions from data. Special attention will be given to the general logic of hypothesis testing, error and error rates, power, simulation, and the correct computation and interpretation of p-values. Attention will also be given to the misuse of testing concepts, especially p-values, and the ethical implications of such misuse.

Class Deals by MOOC List - Click here and see Coursera's Active Discounts, Deals, and Promo Codes.

This course can be taken for academic credit as part of CU Boulder’s Master of Science in Data Science (MS-DS) degree offered on the Coursera platform. The MS-DS is an interdisciplinary degree that brings together faculty from CU Boulder’s departments of Applied Mathematics, Computer Science, Information Science, and others. With performance-based admissions and no application process, the MS-DS is ideal for individuals with a broad range of undergraduate education and/or professional experience in computer science, information science, mathematics, and statistics.

What You Will Learn

Define a composite hypothesis and the level of significance for a test with a composite null hypothesis.
Define a test statistic, level of significance, and the rejection region for a hypothesis test. Give the form of a rejection region.
Perform tests concerning a true population variance.
Compute the sampling distributions for the sample mean and sample minimum of the exponential distribution.

Course 3 of 3 in the Data Science Foundations: Statistical Inference Specialization

Syllabus

WEEK 1
Fundamental Concepts of Hypothesis Testing
In this module, we will define a hypothesis test and develop the intuition behind designing a test. We will learn the language of hypothesis testing, which includes definitions of a null hypothesis, an alternative hypothesis, and the level of significance of a test. We will walk through a very simple test.

WEEK 2
Composite Tests, Power Functions, and P-Values
In this module, we will expand the lessons of Module 1 to composite hypotheses for both one and two-tailed tests. We will define the “power function” for a test and discuss its interpretation and how it can lead to the idea of a “uniformly most powerful” test. We will discuss and interpret “p-values” as an alternate approach to hypothesis testing.

WEEK 3
t-Tests and Two-Sample Tests
In this module, we will learn about the chi-squared and t distributions and their relationships to sampling distributions. We will learn to identify when hypothesis tests based on these distributions are appropriate. We will review the concept of sample variance and derive the “t-test”. Additionally, we will derive our first two-sample test and apply it to make some decisions about real data.

WEEK 4
Beyond Normality
In this module, we will consider some problems where the assumption of an underlying normal distribution is not appropriate and will expand our ability to construct hypothesis tests for this case. We will define the concept of a “uniformly most powerful” (UMP) test, whether or not such a test exists for specific problems, and we will revisit some of our earlier tests from Modules 1 and 2 through the UMP lens. We will also introduce the F-distribution and its role in testing whether or not two population variances are equal.

WEEK 5
Likelihood Ratio Tests and Chi-Squared Tests
In this module, we develop a formal approach to hypothesis testing, based on a “likelihood ratio” that can be more generally applied than any of the tests we have discussed so far. We will pay special attention to the large sample properties of the likelihood ratio, especially Wilks’ Theorem, that will allow us to come up with approximate (but easy) tests when we have a large sample size. We will close the course with two chi-squared tests that can be used to test whether the distributional assumptions we have been making throughout this course are valid.

Go to Class

MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Coursera

McMaster University

Experimentation for Improvement (Coursera)

Statistics & Data Analysis Data Science

We are always using experiments to improve our lives, our community, and our work. Are you doing it efficiently? Or are you (incorrectly) changing one thing at a time and hoping for the best? In this course, you will learn how to plan efficient experiments - testing with many variables. Our goal is to find the best results using only a few experiments. A key part of the course is how to optimize a system.

Aug 17th 2026

5-12 Weeks

Statistics Data Science Regression Models

Coursera

IBM

Machine Learning Rapid Prototyping with IBM Watson Studio (Coursera)

Data Science

An emerging trend in AI is the availability of technologies in which automation is used to select a best-fit model, perform feature engineering and improve model performance via hyperparameter optimization. This automation will provide rapid-prototyping of models and allow the Data Scientist to focus their efforts on applying domain knowledge to fine-tune models. This course will take the learner through the creation of an end-to-end automated pipeline built by Watson Studio’s AutoAI experiment tool, explaining the underlying technology at work as developed by IBM Research.

Aug 17th 2026

4 Weeks

Python Artificial Intelligence Prototyping

Coursera

Georgia Institute of Technology

Materials Data Sciences and Informatics (Coursera)

Engineering Sci: Chemistry

This course aims to provide a succinct overview of the emerging discipline of Materials Informatics at the intersection of materials science, computational science, and information science. Attention is drawn to specific opportunities afforded by this new field in accelerating materials development and deployment efforts.

Aug 10th 2026

5-12 Weeks

Statistics Informatics Chemistry

Coursera

Eindhoven University of Technology

Improving Your Statistical Questions (Coursera)

Statistics & Data Analysis Data Science

This course aims to help you to ask better statistical questions when performing empirical research. We will discuss how to design informative studies, both when your predictions are correct, as when your predictions are wrong. We will question norms, and reflect on how we can improve research practices to ask more interesting questions.

Aug 10th 2026

5-12 Weeks

Statistics Statistical Inference Meta-Analysis

Coursera

FIA Business School

Ferramentas para Ciência de Dados: Introdução ao R (Coursera)

Statistics & Data Analysis

Nossas boas-vindas ao Curso Ferramentas para Ciência de Dados: Introdução ao R. Neste curso, você aprenderá que o mundo evoluiu muito quando o assunto é tomada de decisão baseada em dados e já não é possível comparar a quantidade de informações a que temos acesso atualmente com o que tínhamos disponíveis décadas atrás.

Aug 10th 2026

4 Weeks

Data Science R Language R Programming

Coursera

University of Michigan

Applied Plotting, Charting & Data Representation in Python (Coursera)

Statistics & Data Analysis Data Science

This course will introduce the learner to information visualization basics, with a focus on reporting and charting using the matplotlib library. The course will start with a design and information literacy perspective, touching on what makes a good and bad visualization, and what statistical measures translate into in terms of visualizations. The second week will focus on the technology used to make visualizations in python, matplotlib, and introduce users to best practices when creating basic charts and how to realize design decisions in the framework.

Aug 10th 2026

4 Weeks

Python Data Analysis Data Science

Coursera

Stanford University

Introduction to Statistics (Coursera)

Statistics & Data Analysis Data Science

Stanford's "Introduction to Statistics" teaches you statistical thinking concepts that are essential for learning from data and communicating insights. By the end of the course, you will be able to perform exploratory data analysis, understand key principles of sampling, and select appropriate tests of significance for multiple contexts. You will gain the foundational skills that prepare you to pursue more advanced topics in statistical thinking and machine learning.

Aug 10th 2026

5-12 Weeks

Statistics Analysis Probability

Coursera

University of Illinois at Urbana-Champaign

Visualization for Data Journalism (Coursera)

Statistics & Data Analysis Data Science

While telling stories with data has been part of the news practice since its earliest days, it is in the midst of a renaissance. Graphics desks which used to be deemed as “the art department,” a subfield outside the work of newsrooms, are becoming a core part of newsrooms’ operation. Those people (they often have various titles: data journalists, news artists, graphic reporters, developers, etc.) who design news graphics are expected to be full-fledged journalists and work closely with reporters and editors.

Aug 10th 2026

5-12 Weeks

Python Storytelling Data Analysis

Coursera

Johns Hopkins University

Statistics for Genomic Data Science (Coursera)

Statistics & Data Analysis Data Science

An introduction to the statistics behind the most popular genomic data science projects. This is the sixth course in the Genomic Big Data Science Specialization from Johns Hopkins University.

Aug 17th 2026

4 Weeks

Statistics Biostatistics Data Analysis

Coursera

Georgia Institute of Technology

Fundamentals of Engineering Exam Review (Coursera)

Engineering Sci: Physics

The purpose of this course is to review the material covered in the Fundamentals of Engineering (FE) exam to enable the student to pass it. It will be presented in modules corresponding to the FE topics, particularly those in Civil and Mechanical Engineering. Each module will review main concepts, illustrate them with examples, and provide extensive practice problems.

Aug 10th 2026

5-12 Weeks

Math Statistics Probability

Coursera

Johns Hopkins University

Principles of fMRI 1 (Coursera)

Statistics & Data Analysis

Functional Magnetic Resonance Imaging (fMRI) is the most widely used technique for investigating the living, functioning human brain as people perform tasks and experience mental states. It is a convergence point for multidisciplinary work from many disciplines. Psychologists, statisticians, physicists, computer scientists, neuroscientists, medical researchers, behavioral scientists, engineers, public health researchers, biologists, and others are coming together to advance our understanding of the human mind and brain. This course covers the design, acquisition, and analysis of Functional Magnetic Resonance Imaging (fMRI) data, including psychological inference, MR Physics, K Space, experimental design, pre-processing of fMRI data, as well as Generalized Linear Models (GLM’s).

Aug 17th 2026

4 Weeks

Medicine Statistics Data Analysis

Coursera

IBM

Machine Learning Introduction for Everyone (Coursera)

Data Science

This three-module course introduces machine learning and data science for everyone with a foundational understanding of machine learning models. You’ll learn about the history of machine learning, applications of machine learning, the machine learning model lifecycle, and tools for machine learning. You’ll also learn about supervised versus unsupervised learning, classification, regression, evaluating machine learning models, and more.

Aug 17th 2026

3 Weeks

ML Artificial Intelligence Machine Learning