Coursera

Mathematics for Machine Learning: PCA (Coursera)

Offered by Imperial College London,

This intermediate-level course introduces the mathematical foundations to derive Principal Component Analysis (PCA), a fundamental dimensionality reduction technique. We'll cover some basic statistics of data sets, such as mean values and variances, we'll compute distances and angles between vectors using inner products and derive orthogonal projections of data onto lower-dimensional subspaces. Using all these tools, we'll then derive PCA as a method that minimizes the average squared reconstruction error between data points and their reconstruction.

Class Deals by MOOC List - Click here and see Coursera's Active Discounts, Deals, and Promo Codes.

At the end of this course, you'll be familiar with important mathematical concepts and you can implement PCA all by yourself. If you’re struggling, you'll find a set of jupyter notebooks that will allow you to explore properties of the techniques and walk you through what you need to do to get on track. If you are already an expert, this course may refresh some of your knowledge.
The lectures, examples and exercises require:

Some ability of abstract thinking
Good background in linear algebra (e.g., matrix and vector algebra, linear independence, basis)
Basic background in multivariate calculus (e.g., partial derivatives, basic optimization)
Basic knowledge in python programming and numpy

Disclaimer: This course is substantially more abstract and requires more programming than the other two courses of the specialization. However, this type of abstract thinking, algebraic manipulation and programming is necessary if you want to understand and develop machine learning algorithms.
What You Will Learn

Implement mathematical concepts using real-world data
Derive PCA from a projection perspective
Understand how orthogonal projections work
Master PCA

Course 3 of 3 in the Mathematics for Machine Learning Specialization.

Syllabus

WEEK 1
Statistics of Datasets
Principal Component Analysis (PCA) is one of the most important dimensionality reduction algorithms in machine learning. In this course, we lay the mathematical foundations to derive and understand PCA from a geometric point of view. In this module, we learn how to summarize datasets (e.g., images) using basic statistics, such as the mean and the variance. We also look at properties of the mean and the variance when we shift or scale the original data set. We will provide mathematical intuition as well as the skills to derive the results. We will also implement our results in code (jupyter notebooks), which will allow us to practice our mathematical understand to compute averages of image data sets.

WEEK 2
Inner Products
Data can be interpreted as vectors. Vectors allow us to talk about geometric concepts, such as lengths, distances and angles to characterise similarity between vectors. This will become important later in the course when we discuss PCA. In this module, we will introduce and practice the concept of an inner product. Inner products allow us to talk about geometric concepts in vector spaces. More specifically, we will start with the dot product (which we may still know from school) as a special case of an inner product, and then move toward a more general concept of an inner product, which play an integral part in some areas of machine learning, such as kernel machines (this includes support vector machines and Gaussian processes). We have a lot of exercises in this module to practice and understand the concept of inner products.

WEEK 3
Orthogonal Projections
In this module, we will look at orthogonal projections of vectors, which live in a high-dimensional vector space, onto lower-dimensional subspaces. This will play an important role in the next module when we derive PCA. We will start off with a geometric motivation of what an orthogonal projection is and work our way through the corresponding derivation. We will end up with a single equation that allows us to project any vector onto a lower-dimensional subspace. However, we will also understand how this equation came about. As in the other modules, we will have both pen-and-paper practice and a small programming example with a jupyter notebook.

WEEK 4
Principal Component Analysis
We can think of dimensionality reduction as a way of compressing data with some loss, similar to jpg or mp3. Principal Component Analysis (PCA) is one of the most fundamental dimensionality reduction techniques that are used in machine learning. In this module, we use the results from the first three modules of this course and derive PCA from a geometric point of view. Within this course, this module is the most challenging one, and we will go through an explicit derivation of PCA plus some coding exercises that will make us a proficient user of PCA.

Go to Class

MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Coursera

Duke University

Introduction to Machine Learning (Coursera)

Data Science

This course will provide you a foundational understanding of machine learning models (logistic regression, multilayer perceptrons, convolutional neural networks, natural language processing, etc.) as well as demonstrate how these models can solve complex problems in a variety of industries, from medical diagnostics to image recognition to text prediction.

Jun 5th 2026

5-12 Weeks

ML NLP Machine Learning

Coursera

University of Minnesota

Matrix Factorization and Advanced Techniques (Coursera)

Data Science

In this course you will learn a variety of matrix factorization and hybrid machine learning techniques for recommender systems. Starting with basic matrix factorization, you will understand both the intuition and the practical details of building recommender systems based on reducing the dimensionality of the user-product preference space. Then you will learn about techniques that combine the strengths of different algorithms into powerful hybrid recommenders.

Jun 1st 2026

5-12 Weeks

Machine Learning Matrix Recommender Systems

Coursera

EIT Digital

Marketing Strategy for Entrepreneurs (Coursera)

Management & Leadership Marketing & Communication

You live a hands-on-life, and you intend continuing doing so! That is why I guess you already have checked where the QR-code (the logo) for this course lead to, right? And it is in such kind of setting you prefer hands-on-learning. Things you can do, already today, is something you value. You actually did not start this course when enrolling it. You started it long ago, Either as a customer somewhere, or just maybe thinking about marketing for some time. Or maybe you are already working on marketing, of you yourself or maybe your own company. Or as employed somewhere of course. In all these situations what matter is action. Action that contribute to your marketing-journey.

Jun 1st 2026

5-12 Weeks

Analysis Marketing Entrepreneur

Coursera

Johns Hopkins University

Exploratory Data Analysis (Coursera)

Statistics & Data Analysis Data Science

This course covers the essential exploratory techniques for summarizing data. These techniques are typically applied before formal modeling commences and can help inform the development of more complex statistical models. Exploratory techniques are also important for eliminating or sharpening potential hypotheses about the world that can be addressed by the data.

Jun 1st 2026

4 Weeks

Statistics Data Analysis Data Science

Coursera

Duke University

Bayesian Statistics (Coursera)

Statistics & Data Analysis Data Science

This course describes Bayesian statistics, in which one's inferences about parameters or hypotheses are updated as evidence accumulates. You will learn to use Bayes’ rule to transform prior probabilities into posterior probabilities, and be introduced to the underlying theory and perspective of the Bayesian paradigm.

Jun 1st 2026

5-12 Weeks

Statistics Data Analysis R Programming

Coursera

Icahn School of Medicine at Mount Sinai

Experimental Methods in Systems Biology (Coursera)

Sci: Biology & Life Sciences

Learn about the technologies underlying experimentation used in systems biology, with particular focus on RNA sequencing, mass spec-based proteomics, flow/mass cytometry and live-cell imaging. A key driver of the systems biology field is the technology allowing us to delve deeper and wider into how cells respond to experimental perturbations. This in turns allows us to build more detailed quantitative models of cellular function, which can give important insight into applications ranging from biotechnology to human disease. This course gives a broad overview of a variety of current experimental techniques used in modern systems biology, with focus on obtaining the quantitative data needed for computational modeling purposes in downstream analyses.

Jun 1st 2026

5-12 Weeks

Biology Cell Systems Biology

Coursera

University of Washington

Machine Learning: Classification (Coursera)

Statistics & Data Analysis Data Science

Case Studies: Analyzing Sentiment & Loan Default Prediction. In our case study on analyzing sentiment, you will create models that predict a class (positive/negative sentiment) from input features (text of the reviews, user profile information,...). In our second case study for this course, loan default prediction, you will tackle financial data, and predict when a loan is likely to be risky or safe for the bank.

Jun 1st 2026

5-12 Weeks

Python Machine Learning Classification

Coursera

Icahn School of Medicine at Mount Sinai

Big Data Science with the BD2K-LINCS Data Coordination and Integration Center (Coursera)

Health & Society Science

In this course we briefly introduce the DCIC and the various Centers that collect data for LINCS. We then cover metadata and how metadata is linked to ontologies. We then present data processing and normalization methods to clean and harmonize LINCS data. This follow discussions about how data is served as RESTful APIs. Most importantly, the course covers computational methods including: data clustering, gene-set enrichment analysis, interactive data visualization, and supervised learning. Finally, we introduce crowdsourcing/citizen-science projects where students can work together in teams to extract expression signatures from public databases and then query such collections of signatures against LINCS data for predicting small molecules as potential therapeutics.

Jun 1st 2026

5-12 Weeks

Network Analysis Big Data

Coursera

University of California, San Diego,Higher School of Economics - HSE University

Advanced Algorithms and Complexity (Coursera)

CS: Software Engineering CS: Theory

You've learned the basic algorithms now and are ready to step into the area of more complex problems and algorithms to solve them. Advanced algorithms build upon basic ones and use new ideas. We will start with networks flows which are used in more typical applications such as optimal matchings, finding disjoint paths and flight scheduling as well as more surprising ones like image segmentation in computer vision.

Jun 1st 2026

5-12 Weeks

Algorithms Data Structures Machine Learning

Coursera

University of Washington

Social Media Data Analytics (Coursera)

Statistics & Data Analysis Data Science

Learner Outcomes: After taking this course, you will be able to: utilize various Application Programming Interface (API) services to collect data from different social media sources such as YouTube, Twitter, and Flickr; process the collected data - primarily structured - using methods involving correlation, regression, and classification to derive insights about the sources and people who generated that data; analyze unstructured data - primarily textual comments - for sentiments expressed in them; use different tools for collecting, analyzing, and exploring social media data for research and development purposes.

Jun 1st 2026

4 Weeks

Data Analysis Social Media Twitter

Coursera

Johns Hopkins University

Practical Machine Learning (Coursera)

Statistics & Data Analysis Data Science

One of the most common tasks performed by data scientists and data analysts are prediction and machine learning. This course will cover the basic components of building and applying prediction functions with an emphasis on practical applications. The course will provide basic grounding in concepts such as training and tests sets, overfitting, and error rates.

Jun 1st 2026

4 Weeks

Algorithms Machine Learning Regression

Coursera

Rice University

Investment Strategies and Portfolio Analysis (Coursera)

Economics & Finance Business

In this course, you will learn about latest investment strategies and performance evaluation. You will start by learning portfolio performance measures and discuss best practices in portfolio performance evaluation. You will explore different evaluation techniques such as style analysis and attribution analysis and apply them to evaluate different investment strategies. Special emphasis will be given to recent financial market innovations and current investment trends.

Jun 1st 2026

3 Weeks

Business Analysis Measurement