Coursera

Mathematics for Machine Learning: Multivariate Calculus (Coursera)

Offered by Imperial College London,

This course offers a brief introduction to the multivariate calculus required to build many common machine learning techniques. We start at the very beginning with a refresher on the “rise over run” formulation of a slope, before converting this to the formal definition of the gradient of a function. We then start to build up a set of tools for making calculus easier and faster. Next, we learn how to calculate vectors that point up hill on multidimensional surfaces and even put this into action using an interactive game.

Class Deals by MOOC List - Click here and see Coursera's Active Discounts, Deals, and Promo Codes.

We take a look at how we can use calculus to build approximations to functions, as well as helping us to quantify how accurate we should expect those approximations to be. We also spend some time talking about where calculus comes up in the training of neural networks, before finally showing you how it is applied in linear regression models. This course is intended to offer an intuitive understanding of calculus, as well as the language necessary to look concepts up yourselves when you get stuck. Hopefully, without going into too much detail, you’ll still come away with the confidence to dive into some more focused machine learning courses in future.
Course 2 of 3 in the Mathematics for Machine Learning Specialization.

Syllabus

WEEK 1
What is calculus?
Understanding calculus is central to understanding machine learning! You can think of calculus as simply a set of tools for analysing the relationship between functions and their inputs. Often, in machine learning, we are trying to find the inputs which enable a function to best match the data. We start this module from the basics, by recalling what a function is and where we might encounter one. Following this, we talk about the how, when sketching a function on a graph, the slope describes the rate of change of the output with respect to an input. Using this visual intuition we next derive a robust mathematical definition of a derivative, which we then use to differentiate some interesting functions. Finally, by studying a few examples, we develop four handy time saving rules that enable us to speed up differentiation for many common scenarios.

WEEK 2
Multivariate calculus
Building on the foundations of the previous module, we now generalise our calculus tools to handle multivariable systems. This means we can take a function with multiple inputs and determine the influence of each of them separately. It would not be unusual for a machine learning method to require the analysis of a function with thousands of inputs, so we will also introduce the linear algebra structures necessary for storing the results of our multivariate calculus analysis in an orderly fashion.

WEEK 3
Multivariate chain rule and its applications
Having seen that multivariate calculus is really no more complicated than the univariate case, we now focus on applications of the chain rule. Neural networks are one of the most popular and successful conceptual structures in machine learning. They are build up from a connected web of neurons and inspired by the structure of biological brains. The behaviour of each neuron is influenced by a set of control parameters, each of which needs to be optimised to best fit the data. The multivariate chain rule can be used to calculate the influence of each parameter of the networks, allow them to be updated during training.

WEEK 4
Taylor series and linearisation
The Taylor series is a method for re-expressing functions as polynomial series. This approach is the rational behind the use of simple linear approximations to complicated functions. In this module, we will derive the formal expression for the univariate Taylor series and discuss some important consequences of this result relevant to machine learning. Finally, we will discuss the multivariate case and see how the Jacobian and the Hessian come in to play.

WEEK 5
Intro to optimisation
If we want to find the minimum and maximum points of a function then we can use multivariate calculus to do this, say to optimise the parameters (the space) of a function to fit some data. First we’ll do this in one dimension and use the gradient to give us estimates of where the zero points of that function are, and then iterate in the Newton-Raphson method. Then we’ll extend the idea to multiple dimensions by finding the gradient vector, Grad, which is the vector of the Jacobian. This will then let us find our way to the minima and maxima in what is called the gradient descent method. We’ll then take a moment to use Grad to find the minima and maxima along a constraint in the space, which is the Lagrange multipliers method.

WEEK 6
Regression
In order to optimise the fitting parameters of a fitting function to the best fit for some data, we need a way to define how good our fit is. This goodness of fit is called chi-squared, which we’ll first apply to fitting a straight line - linear regression. Then we’ll look at how to optimise our fitting function using chi-squared in the general case using the gradient descent method. Finally, we’ll look at how to do this easily in Python in just a few lines of code, which will wrap up the course.

Go to Class

MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Coursera

Karlsruhe Institute of Technology - KIT

Machine Translation (Coursera)

Data Science

Welcome to the CLICS-Machine Translation MOOC. This MOOC explains the basic principles of machine translation. Machine translation is the task of translating from one natural language to another natural language. Therefore, these algorithms can help people communicate in different languages. Such algorithms are used in common applications, from Google Translate to apps on your mobile device.

Aug 3rd 2026

5-12 Weeks

Machine Learning Translation Data Science

Coursera

Duke University

Image and video processing: From Mars to Hollywood with a stop at the hospital (Coursera)

Engineering Sci: Mathematics

In this course, you will learn the science behind how digital images and video are made, altered, stored, and used. We will look at the vast world of digital imaging, from how computers and digital cameras form images to how digital special effects are used in Hollywood movies to how the Mars Rover was able to send photographs across millions of miles of space.

Aug 10th 2026

5-12 Weeks

Math Image Video

Coursera

University of Pennsylvania

Single Variable Calculus (Coursera)

Sci: Mathematics

Calculus is one of the grandest achievements of human thought, explaining everything from planetary orbits to the optimal size of a city to the periodicity of a heartbeat. This brisk course covers the core ideas of single-variable Calculus with emphases on conceptual understanding and applications. The course is ideal for students beginning in the engineering, physical, and social sciences. Distinguishing features of the course include: 1) the introduction and use of Taylor series and approximations from the beginning; 2) a novel synthesis of discrete and continuous forms of Calculus; 3) an emphasis on the conceptual over the computational; and 4) a clear, dynamic, unified approach.

Aug 3rd 2026

5-12 Weeks

Math Calculus Sequences

Coursera

University of London,Goldsmiths, University of London

Foundations of Data Science: K-Means Clustering in Python (Coursera)

Data Science

This MOOC, designed by an academic team from Goldsmiths, University of London, will quickly introduce you to the core concepts of Data Science to prepare you for intermediate and advanced Data Science courses. It focuses on the basic mathematics, statistics and programming skills that are necessary for typical data analysis tasks.

Aug 10th 2026

5-12 Weeks

Programming Python Machine Learning

Coursera

EDUCBA

Regression & Forecasting for Data Scientists using Python (Coursera)

CS: Information & Technology Data Science

This course provides comprehensive training in regression analysis and forecasting techniques for data science, emphasizing Python programming. You will master time-series analysis, forecasting, linear regression, and data preprocessing, enabling you to make data-driven decisions across industries.

Aug 10th 2026

4 Weeks

Python Regression Linear Regression

Coursera

University of Pennsylvania

Calculus: Single Variable Part 2 - Differentiation (Coursera)

Sci: Mathematics

Aug 3rd 2026

3 Weeks

Math Calculus Differential

Coursera

Stanford University

Probabilistic Graphical Models 1: Representation (Coursera)

Statistics & Data Analysis Data Science

Probabilistic graphical models (PGMs) are a rich framework for encoding probability distributions over complex domains: joint (multivariate) distributions over large numbers of random variables that interact with each other. These representations sit at the intersection of statistics and computer science, relying on concepts from probability theory, graph algorithms, machine learning, and more. They are the basis for the state-of-the-art methods in a wide variety of applications, such as medical diagnosis, image understanding, speech recognition, natural language processing, and many, many more. They are also a foundational tool in formulating many machine learning problems.

Aug 3rd 2026

5-12 Weeks

MATLAB Octave Machine Learning

Coursera

University of Pennsylvania

Calculus: Single Variable Part 4 - Applications (Coursera)

Sci: Mathematics

Aug 3rd 2026

5-12 Weeks

Math Calculus Probability

Coursera

Universitat Autònoma de Barcelona

Pre-Calculus (Coursera)

Sci: Mathematics

Curso diseñado para facilitar la entrada del estudiante en los cursos de cálculo de primer semestre de prácticamente cualquier grado universitario, con especial énfasis en Ciencias e Ingeniería.

Aug 3rd 2026

5-12 Weeks

Math Calculus Trigonometry

Coursera

DeepLearning.AI

AI For Everyone (Coursera)

Business

AI is not only for engineers. If you want your organization to become better at using AI, this is the course to tell everyone--especially your non-technical colleagues--to take.

Aug 10th 2026

4 Weeks

Artificial Intelligence Machine Learning Neural Networks

Coursera

The University of Sydney

Introduction to Calculus (Coursera)

Sci: Mathematics

The focus and themes of the Introduction to Calculus course address the most important foundations for applications of mathematics in science, engineering and commerce. The course emphasises the key ideas and historical motivation for calculus, while at the same time striking a balance between theory and application, leading to a mastery of key threshold concepts in foundational mathematics.

Aug 10th 2026

5-12 Weeks

Math Calculus Logic

Coursera

Alibaba Cloud Academy

Alibaba Cloud Native Solutions and Container Service (Coursera)

CS: Information & Technology

This course demonstrates how to use Alibaba Cloud Container Service and Container Registry Service to design and develop architectures related to cloud native applications, services, and security solutions. This course helps you understand the basic concepts of cloud native, the commercial implementation of container technology, and Kubernetes technology as well as extra benefits provided by Alibaba Cloud. This course is intended to prepare users to take the Alibaba Cloud Native ACA certification exam.

Aug 10th 2026

5-12 Weeks

Machine Learning Cloud Computing Kubernetes