Introduction to Computer Vision (Coursera)

Introduction to Computer Vision (Coursera)

Introduction to Computer Vision guides learners through the essential algorithms and methods to help computers 'see' and interpret visual data. You will first learn the core concepts and techniques that have been traditionally used to analyze images. Then, you will learn modern deep learning methods, such as neural networks and specific models designed for image recognition, and how it can be used to perform more complex tasks like object detection and image segmentation. Additionally, you will learn the creation and impact of AI-generated images and videos, exploring the ethical considerations of such technology.

Class Deals by MOOC List - Click here and see Coursera's Active Discounts, Deals, and Promo Codes.

What you'll learn

  • Understand the fundamental principles and algorithms of classical computer vision.
  • Apply deep learning models to various computer vision tasks.
  • Evaluate and implement computer vision solutions for real-world applications.

Syllabus

Week 1
This module introduces foundational concepts related to common image types and functions. It offers a comprehensive overview of different formats and their unique characteristics. This section establishes the context for understanding how images are represented and processed in various applications. Next, the module delves into image functions, explaining the basic operations that can be performed on images to enhance or manipulate them, such as cropping, resizing, or adjusting brightness. It also covers more advanced operations like filtering and thresholding, illustrating how these functions play a crucial role in image processing. Then the module explores the underlying mathematics of image transformations. It starts with linear transforms, highlighting their application in image scaling, rotation, and translation. The module then introduces homogeneous coordinates, providing a simplified approach to represent complex transformations with additional dimensions. This leads into a deeper exploration of homogeneous transformations, demonstrating how they are used to perform multiple transformations in a single step.

Week 2
This module provides a deep dive into image analysis and similarity assessment techniques. It starts by exploring the basic concept of comparing pixels, highlighting how individual pixel values can be used to gauge similarity. This is followed by a detailed discussion on comparing multiple images by their features, emphasizing the advantages of feature-based analysis over pixel-by-pixel comparison. The module introduces the concept of image moments, revealing how these statistical properties help identify shapes and patterns within images. The module then addresses similarity and distance, offering a quick overview of how these concepts are calculated and applied in image processing. You'll also learn about converting pixels into distributions, an essential technique for more complex analysis. This leads to a comprehensive explanation of cross-entropy, providing insights into its role in measuring the dissimilarity between distributions. You'll explore cross-correlation in 1D, followed by a deeper examination of cross-correlation as matrix multiplication. The module wraps up by exploring cross-correlation in more detail, with a focus on the mathematics behind it.

Week 3
This module delves into multiview geometry, a pivotal concept in computer vision and 3D modeling. It starts with a brief overview of the motivation behind multiview systems, highlighting the advantages of capturing scenes from multiple viewpoints. The module then discusses multiple coordinate systems, exploring how different reference frames can describe points and transformations in 3D space. You'll also learn about multiple viewing planes, which play a crucial role in multiview setups by providing unique perspectives for scene reconstruction. The focus shifts to multiview projection, examining how distinct images from multiple cameras can be used to create a cohesive 3D scene. You'll gain insights into the principles of translation and rotation in 3D, crucial for understanding camera movement and orientation. The module also covers camera translation and camera rotation, offering practical examples to illustrate how camera motion affects the geometry and visual representation of a scene.

Week 4
This module delves into key concepts of camera models and their role in computer vision and photogrammetry. Learn about the Extrinsic Matrix, exploring how it defines the position and orientation of a camera in 3D space. Understand the Pinhole Camera Model, a simplified optical system that forms the basis for many computer vision applications, alongside the Intrinsic Matrix, which captures the internal parameters of the camera. Epipolar geometry is examined, with a focus on its significance in 3D reconstruction and stereo vision. The module covers the motivation behind epipolar geometry, breaking down its basic components, and explaining the Essential Matrix, which encapsulates the geometric relationship between camera views, as well as the Fundamental Matrix, a core component in epipolar geometry that represents the relationship between two cameras in stereo vision.

Go to Class
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Image Segmentation, Filtering, and Region Analysis (Coursera) Coursera
MathWorks

Image Segmentation, Filtering, and Region Analysis (Coursera)

In this course, you will build on the skills learned in Introduction to Image Processing to work through common complications such as noise. You’ll use spatial filters to deal with different types of artifacts. You’ll learn new approaches to segmentation such as edge detection and clustering. You’ll also analyze regions of interest and calculate properties such as size, orientation, and location.

Jun 8th 2026
4 Weeks
Neural Networks and Deep Learning (Coursera) Coursera
DeepLearning.AI

Neural Networks and Deep Learning (Coursera)

If you want to break into cutting-edge AI, this course will help you do so. Deep learning engineers are highly sought after, and mastering deep learning will give you numerous new career opportunities. Deep learning is also a new "superpower" that will let you build AI systems that just weren't possible a few years ago. In this course, you will learn the foundations of deep learning.

Jun 8th 2026
4 Weeks
Introduction to Image Processing (Coursera) Coursera
MathWorks

Introduction to Image Processing (Coursera)

In this introduction to image processing, you'll take your first steps in accessing and adjusting digital images for analysis and processing. You will load, save, and adjust image size and orientation while also understanding how digital images are recognized. You will then perform basic segmentation and quantitative analysis. Lastly, you will enhance the contrast of images to make objects of interest easier to identify.

Jun 8th 2026
4 Weeks
Computer Vision with Embedded Machine Learning (Coursera) Coursera
Edge Impulse

Computer Vision with Embedded Machine Learning (Coursera)

Computer vision (CV) is a fascinating field of study that attempts to automate the process of assigning meaning to digital images or videos. In other words, we are helping computers see and understand the world around us! A number of machine learning (ML) algorithms and techniques can be used to accomplish CV tasks, and as ML becomes faster and more efficient, we can deploy these techniques to embedded systems.

Jun 8th 2026
3 Weeks
Introduction to Deep Learning & Neural Networks with Keras (Coursera) Coursera
IBM

Introduction to Deep Learning & Neural Networks with Keras (Coursera)

Looking to start a career in Deep Learning? Look no further. This course will introduce you to the field of deep learning and help you answer many questions that people are asking nowadays, like what is deep learning, and how do deep learning models compare to artificial neural networks? You will learn about the different deep learning models and build your first deep learning model using the Keras library.

Jun 8th 2026
5-12 Weeks
Structuring Machine Learning Projects (Coursera) Coursera
DeepLearning.AI

Structuring Machine Learning Projects (Coursera)

You will learn how to build a successful machine learning project. If you aspire to be a technical leader in AI, and know how to set direction for your team's work, this course will show you how. Much of this content has never been taught elsewhere, and is drawn from my experience building and shipping many deep learning products. This course also has two "flight simulators" that let you practice decision-making as a machine learning project leader. This provides "industry experience" that you might otherwise get only after years of ML work experience.

Jun 8th 2026
2 Weeks
AI Capstone Project with Deep Learning (Coursera) Coursera
IBM

AI Capstone Project with Deep Learning (Coursera)

In this capstone, learners will apply their deep learning knowledge and expertise to a real world challenge. They will use a library of their choice to develop and test a deep learning model. They will load and pre-process data for a real problem, build the model and validate it. Learners will then present a project report to demonstrate the validity of their model and their proficiency in the field of Deep Learning.

Jun 8th 2026
4 Weeks
NLP Modelos y Algoritmos (Coursera) Coursera
Universidad Austral

NLP Modelos y Algoritmos (Coursera)

Este curso te brindará los conocimientos necesarios para la implementación de algoritmos de NLP. Mediante el uso de los últimos algoritmos más populares en NLP se procederá a dar solución a un conjunto de problemas propios del área. Para realizar este curso es necesario contar con conocimientos de programación de nivel básico a medio, deseablemente conocimiento básico del lenguaje Python y es recomendable conocer los Jupyter Notebooks en el entorno Anaconda.

Jun 8th 2026
4 Weeks
Object Localization with TensorFlow (Coursera) Coursera
Coursera Project Network

Object Localization with TensorFlow (Coursera)

Welcome to this 2 hour long guided project on creating and training an Object Localization model with TensorFlow. In this guided project, we are going to use TensorFlow's Keras API to create a convolutional neural network which will be trained to classify as well as localize emojis in images. Localization, in this context, means the position of the emojis in the images. This means that the network will have one input and two outputs.

Jun 8th 2026
Self-Paced
Getting Started with Machine Learning at the Edge on Arm (Coursera) Coursera
Arm

Getting Started with Machine Learning at the Edge on Arm (Coursera)

The age of machine learning has arrived! Arm technology is powering a new generation of connected devices with sophisticated sensors that can collect a vast range of environmental, spatial and audio/visual data. Typically this data is processed in the cloud using advanced machine learning tools that are enabling new applications reshaping the way we work, travel, live and play.

Jun 8th 2026
5-12 Weeks
Machine Learning and NLP Basics (Coursera) Coursera
Edureka

Machine Learning and NLP Basics (Coursera)

Welcome to the "Machine Learning and NLP Basics" course, a comprehensive learning resource designed for enthusiasts keen on mastering the foundational aspects of machine learning (ML) and natural language processing (NLP). This course is structured to provide a deep dive into the core concepts, algorithms, and applications of ML and NLP, preparing you for advanced exploration and application in these fields.

Jun 8th 2026
4 Weeks