Machine Learning Capstone (Coursera)

Offered by IBM,
Machine Learning Capstone (Coursera)

In this Machine Learning Capstone course, you will be using various Python-based machine learning libraries such as Pandas, scikit-learn, Tensorflow/Keras, to: build a course recommender system; analyze course related datasets, calculate cosine similarity, and create a similarity matrix; create recommendation systems by applying your knowledge of KNN, PCA, and non-negative matrix collaborative filtering; build similarity-based recommender systems; predict course ratings by training a neural network and constructing regression and classification models; build a Streamlit app that displays your work, and; share your work then evaluate your peers.

Class Deals by MOOC List - Click here and see Coursera's Active Discounts, Deals, and Promo Codes.

Course 6 of 6 in the IBM Machine Learning Professional Certificate.

What You Will Learn

  • Compare and contrast different machine learning algorithms by creating recommender systems in Python
  • Develop a final project using machine learning methods and evaluate your peers’ projects
  • Predict course ratings by training a neural network and constructing regression and classification models
  • Create recommendation systems by applying your knowledge of KNN, PCA, and non-negative matrix collaborative filtering

Syllabus

WEEK 1
Capstone Overview
In this module, you will be introduced to the idea of recommender systems in the first video. All labs in subsequent modules are based on this concept. You will also be provided with an overview of the capstone project. In the last two exercises, you will obtain an IBM Cloud feature code and use that code to create an IBM Watson Studio account.

WEEK 2
Exploratory Data Analysis and Feature Engineering
In module 2, you will perform exploratory data analysis to find preliminary insights such as data patterns. You will also use it to check assumptions with the help of summary statistics and graphical representations of online course-related data sets such as course titles, course genres, and course enrollments. Next, you will extract a word-count vector called a “bag of words” (BoW) from course titles and descriptions. The BoW feature is probably the simplest but most effective feature characterizing textual data. It is widely used in many textual machine learning tasks. Finally, you will apply the cosine similarity measurement to calculate the course similarity using the extracted BoW feature vectors.

WEEK 3
Unsupervised-Learning Based Recommender System
In module 3, you will create three course recommendation systems using different methods. In lab 1, you will create a course recommendation system based on user profile and course genre matrices by computing an interest score for each course and recommend the courses with the highest interest scores. In the second lab, you will generate a course similarity matrix to create the recommendation system. In the third lab, you will implement a clustering-based recommender system algorithm using K-means clustering and principal component analysis based on group members’ course enrollment history. In labs four and five you will use collaborative filtering to make predictions about a user’s interest based on a collection of other users’ similar preferences. In lab 4, you will perform KNN-based collaborative filtering and in lab 5, you will use non-negative matrix factorization.

WEEK 4
Supervised-Learning Based Recommender Systems
In this module, you will predict course ratings using neural networks. In the first lab, you will train neural networks to predict course ratings while simultaneously extracting users' and items' latent features. In lab 2, you will be given course interaction feature vectors as input data. Using regression analysis, you will calculate numerical rating scores that predict whether a student will audit or complete a course. Lab 3 is similar to lab 2 but instead of using regression you will use a classification model. You will extract user and item embedding feature vectors from a neural network. With those embedding feature vectors, you will create an interaction feature vector and use that to build a classification model. The model maps the interaction feature vector to a rating mode that predicts whether a learner will audit or complete a course.

WEEK 5
Share and Present Your Recommender Systems
In this module, you will be introduced to Streamlit and have the opportunity to build a Streamlit app to showcase your work in previous modules. You will review guidelines and best practices for creating successful reports. As well you may wish to review instructions on creating PowerPoint presentations and how to save a PowerPoint as a PDF.

WEEK 6
Final Submission
In this final module you will complete your submission of screenshots from the hands-on labs for your peers to review. Once you have completed your submission you will then review the submission of one of your peers and grade their submission.

Go to Class
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Advanced Algorithms and Complexity (Coursera) Coursera
University of California, San Diego,Higher School of Economics - HSE University

Advanced Algorithms and Complexity (Coursera)

You've learned the basic algorithms now and are ready to step into the area of more complex problems and algorithms to solve them. Advanced algorithms build upon basic ones and use new ideas. We will start with networks flows which are used in more typical applications such as optimal matchings, finding disjoint paths and flight scheduling as well as more surprising ones like image segmentation in computer vision.

Jun 22nd 2026
5-12 Weeks
Computational Neuroscience (Coursera) Coursera
University of Washington

Computational Neuroscience (Coursera)

This course provides an introduction to basic computational methods for understanding what nervous systems do and for determining how they function. We will explore the computational principles governing various aspects of vision, sensory-motor control, learning, and memory. Specific topics that will be covered include representation of information by spiking neurons, processing of information in neural networks, and algorithms for adaptation and learning.

Jun 22nd 2026
5-12 Weeks
Probabilistic Graphical Models 2: Inference (Coursera) Coursera
Stanford University

Probabilistic Graphical Models 2: Inference (Coursera)

Probabilistic graphical models (PGMs) are a rich framework for encoding probability distributions over complex domains: joint (multivariate) distributions over large numbers of random variables that interact with each other. These representations sit at the intersection of statistics and computer science, relying on concepts from probability theory, graph algorithms, machine learning, and more.

Jun 22nd 2026
5-12 Weeks
Preparing for the Google Cloud Professional Data Engineer Exam (Coursera) Coursera
Google Cloud

Preparing for the Google Cloud Professional Data Engineer Exam (Coursera)

From the course: "The best way to prepare for the exam is to be competent in the skills required of the job." This course uses a top-down approach to recognize knowledge and skills already known, and to surface information and skill areas for additional preparation. You can use this course to help create your own custom preparation plan. It helps you distinguish what you know from what you don't know. And it helps you develop and practice skills required of practitioners who perform this job.

Jun 27th 2026
5-12 Weeks
Machine Learning: Clustering & Retrieval (Coursera) Coursera
University of Washington

Machine Learning: Clustering & Retrieval (Coursera)

Case Studies: Finding Similar Documents. A reader is interested in a specific news article and you want to find similar articles to recommend. What is the right notion of similarity? Moreover, what if there are millions of other documents? Each time you want to a retrieve a new document, do you need to search through all other documents? How do you group similar documents together? How do you discover new, emerging topics that the documents cover?

Jun 22nd 2026
5-12 Weeks
Recommender Systems: Evaluation and Metrics (Coursera) Coursera
University of Minnesota

Recommender Systems: Evaluation and Metrics (Coursera)

In this course you will learn how to evaluate recommender systems. You will gain familiarity with several families of metrics, including ones to measure prediction accuracy, rank accuracy, decision-support, and other factors such as diversity, product coverage, and serendipity. You will learn how different metrics relate to different user goals and business goals.

Jun 22nd 2026
4 Weeks
Python for Data Science, AI & Development (Coursera) Coursera
IBM

Python for Data Science, AI & Development (Coursera)

Kickstart your learning of Python for data science, as well as programming in general, with this beginner-friendly introduction to Python. Python is one of the world’s most popular programming languages, and there has never been greater demand for professionals with the ability to apply Python fundamentals to drive business solutions across industries.

Jun 23rd 2026
5-12 Weeks
Using Python to Interact with the Operating System (Coursera) Coursera
Google

Using Python to Interact with the Operating System (Coursera)

By the end of this course, you’ll be able to manipulate files and processes on your computer’s operating system. You’ll also have learned about regular expressions -- a very powerful tool for processing text files -- and you’ll get practice using the Linux command line on a virtual machine. And, this might feel like a stretch right now, but you’ll also write a program that processes a bunch of errors in an actual log file and then generates a summary file. That’s a super useful skill for IT Specialists to know.

Jun 23rd 2026
5-12 Weeks
Practical Predictive Analytics: Models and Methods (Coursera) Coursera
University of Washington

Practical Predictive Analytics: Models and Methods (Coursera)

Statistical experiment design and analytics are at the heart of data science. In this course you will design statistical experiments and analyze the results using modern methods. You will also explore the common pitfalls in interpreting statistical arguments, especially those associated with big data. Collectively, this course will help you internalize a core set of practical and effective machine learning methods and concepts, and apply them to solve some real world problems.

Jun 22nd 2026
4 Weeks