Coursera

Comparing Genes, Proteins, and Genomes (Bioinformatics III) (Coursera)

Offered by University of California, San Diego,

Once we have sequenced genomes in the previous course, we would like to compare them to determine how species have evolved and what makes them different. In the first half of the course, we will compare two short biological sequences, such as genes (i.e., short sequences of DNA) or proteins. We will encounter a powerful algorithmic tool called dynamic programming that will help us determine the number of mutations that have separated the two genes/proteins.

Class Deals by MOOC List - Click here and see Coursera's Active Discounts, Deals, and Promo Codes.

In the second half of the course, we will "zoom out" to compare entire genomes, where we see large scale mutations called genome rearrangements, seismic events that have heaved around large blocks of DNA over millions of years of evolution. Looking at the human and mouse genomes, we will ask ourselves: just as earthquakes are much more likely to occur along fault lines, are there locations in our genome that are "fragile" and more susceptible to be broken as part of genome rearrangements? We will see how combinatorial algorithms will help us answer this question.
Finally, you will learn how to apply popular bioinformatics software tools to solve problems in sequence alignment, including BLAST.
Course 3 of 7 in the Bioinformatics Specialization.

Syllabus

WEEK 1
Introduction to Sequence Alignment
If you joined us in the previous course in this Specialization, then you became an expert at assembling genomes and sequencing antibiotics. The next natural question to ask is how to compare DNA and amino acid sequences. This question will motivate this week's discussion of sequence alignment, which is the first of two questions that we will ask in this class (the algorithmic methods used to answer them are shown in parentheses): How Do We Compare DNA Sequences? (Dynamic Programming)Are There Fragile Regions in the Human Genome? (Combinatorial Algorithms)

WEEK 2
From Finding a Longest Path to Aligning DNA Strings
Last week, we saw how touring around Manhattan and making change in a Roman shop help us find a longest common subsequence of two DNA or protein strings. This week, we will study how to find a highest scoring alignment of two strings. We will see that regardless of the underlying assumptions that we make regarding how the strings should be aligned, we will be able to phrase our alignment problem as an instance of finding the longest path in a directed acyclic graph.

WEEK 3
Advanced Topics in Sequence Alignment
Last week, we saw how a variety of different applications of sequence alignment can all be reduced to finding the longest path in a Manhattan-like graph. This week, we will conclude the current chapter by considering a few advanced topics in sequence alignment. For example, if we need to align long strings, our current algorithm will consume a huge amount of memory. Can we find a more memory-efficient approach? And what should we do when we move from aligning just two strings at a time to aligning many strings?

WEEK 4
Genome Rearrangements and Fragility
You now know how to compare two DNA (or protein) strings. But what if we wanted to compare entire genomes? When we "zoom out" to the genome level, we find that substitutions, insertions, and deletions don't tell the whole story of evolution: we need to model more dramatic evolutionary events known as genome rearrangements, which wrench apart chromosomes and put them back together in a new order. A natural question to ask is whether there are "fragile regions" hidden in your genome where chromosome breakage has occurred more often over millions of years. This week, we will begin addressing this question by asking how we can compute the number of rearrangements on the evolutionary path connecting two species.

WEEK 5
Applying Genome Rearrangement Analysis to Find Genome Fragility
Last week, we asked whether there are fragile regions in the human genome. Then, we took a lengthy detour to see how to compute a distance between species genomes, a discussion that we will continue this week. It is probably unclear how computing the distance between two genomes can help us understand whether fragile regions exist. If so, please stay tuned -- we will see that the connection between these two concepts will yield a surprising conclusion to the class.

WEEK 6
Bioinformatics Application Challenge
In the sixth and final week of the course, we will apply sequence alignment algorithms to infer the non-ribosomal code.

Suggested Readings:
Bioinformatics Algorithms An Active Learning Approach

Go to Class

MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Coursera

University of Virginia

Bacterial Bioinformatics (Coursera)

Health & Society CS: Information & Technology

This course provides demonstrations and exercises for performing common genomics-based analysis tasks of bacterial sequence data. It uses PATRIC, the PathoSystems Resource Integration Center, as the platform for analysis. PATRIC is the NIH/NIAID-funded bacterial Bioinformatics Resource Center, providing comprehensive bacterial genomic data with integrated analysis tools and visualizations.

Aug 17th 2026

5-12 Weeks

Bioinformatics Genomic Data Bacterial Genomes

Coursera

University of Michigan

Uso de bases de datos con Python (Coursera)

CS: Software Engineering Computer Science

Este curso presentará a los estudiantes los conceptos básicos del lenguaje de consulta estructurado (Structured Query Language, SQL), así como el diseño básico de bases de datos para almacenar datos como parte de una iniciativa de varios pasos para recopilar, analizar y procesar datos. El curso utilizará SQLite3 como base de datos. También crearemos rastreadores web y procesos de visualización y recopilación de datos de varios pasos. Utilizaremos la biblioteca D3.js para realizar la visualización básica de datos.

Aug 17th 2026

5-12 Weeks

Programming Python Databases

Coursera

Edureka

Introduction to PySpark (Coursera)

CS: Software Engineering

Welcome to Introduction to PySpark, a short course strategically crafted to empower you with the skills needed to assess the concepts of Big Data Management and efficiently perform data analysis using PySpark. Throughout this short course, you will acquire the expertise to perform data processing with PySpark, enabling you to efficiently handle large-scale datasets, conduct advanced analytics, and derive valuable insights from diverse data sources.

Aug 17th 2026

1 Week

Python Big Data Data Analysis

Coursera

Illinois Tech

Introduction to Open Source Application Development (Coursera)

CS: Software Engineering

This course introduces basic concepts of systems programming using a modern open source language. You will learn to apply basic programming concepts toward solving problems, writing pseudocode, working with and effectively using basic data types, abstract data types, control structures, code modularization and arrays. You will learn to detect errors, work with variables and loops, and discover how functions, methods, and operators work with different data types. You will also be introduced to the object paradigm including classes, inheritance, and polymorphism.

Aug 17th 2026

5-12 Weeks

Programming Python Object-Oriented Programming

Coursera

Johns Hopkins University

Python for Genomic Data Science (Coursera)

Statistics & Data Analysis Data Science

This class provides an introduction to the Python programming language and the iPython notebook. This is the third course in the Genomic Big Data Science Specialization from Johns Hopkins University.

Aug 17th 2026

4 Weeks

Programming Python Big Data

Coursera

Edureka

Gen AI for Code Generation for Python (Coursera)

CS: Software Engineering

Welcome to the 'Gen AI for Code Generation for Python' course, where you'll embark on a journey to explore and develop your skills in the art of code generation with Generative AI. Throughout this short course, you will delve into various techniques for generating Python code effortlessly, ranging from simple scripts to complete end-to-end projects.

Aug 17th 2026

1 Week

Python Artificial Intelligence Coding

Coursera

University of California, San Diego

Finding Mutations in DNA and Proteins (Bioinformatics VI) (Coursera)

Sci: Biology & Life Sciences Computer Science

In previous courses in the Specialization, we have discussed how to sequence and compare genomes. This course will cover advanced topics in finding mutations lurking within DNA and proteins. In the first half of the course, we would like to ask how an individual's genome differs from the "reference genome" of the species.

Aug 17th 2026

5-12 Weeks

DNA Bioinformatics Proteins

Coursera

University of Illinois at Urbana-Champaign

Accounting Data Analytics with Python (Coursera)

Statistics & Data Analysis Data Science

This course focuses on developing Python skills for assembling business data. It will cover some of the same material from Introduction to Accounting Data Analytics and Visualization, but in a more general purpose programming environment (Jupyter Notebook for Python), rather than in Excel and the Visual Basic Editor. These concepts are taught within the context of one or more accounting data domains (e.g., financial statement data from EDGAR, stock data, loan data, point-of-sale data).

Aug 17th 2026

5-12 Weeks

Python SQL Linear Regression

Coursera

University of Cape Town

Julia Scientific Programming (Coursera)

Statistics & Data Analysis Data Science

This four-module course introduces users to Julia as a first language. Julia is a high-level, high-performance dynamic programming language developed specifically for scientific computing. This language will be particularly useful for applications in physics, chemistry, astronomy, engineering, data science, bioinformatics and many more.

Aug 17th 2026

4 Weeks

IPython Data Analysis Computer Programming

Coursera

University of Minnesota

Introduction to Human Behavioral Genetics (Coursera)

Sci: Biology & Life Sciences Health & Society

Behavioral genetic methodologies from twin and adoption studies through DNA analysis will be described and applied to address longstanding questions about the origins of individual differences in behavioral traits.

Aug 10th 2026

5-12 Weeks

Genetics Psychology Behavior

Coursera

Johns Hopkins University

Introduction to Genomic Technologies (Coursera)

Statistics & Data Analysis Data Science

This course introduces you to the basic biology of modern genomics and the experimental tools that we use to measure it. We'll introduce the Central Dogma of Molecular Biology and cover how next-generation sequencing can be used to measure DNA, RNA, and epigenetic patterns. You'll also get an introduction to the key concepts in computing and data science that you'll need to understand how data from next-generation sequencing experiments are generated and analyzed.

Aug 17th 2026

4 Weeks

Biology DNA RNA

Coursera

Nanyang Technological University

Introduction to Forensic Science (Coursera)

Sci: Biology & Life Sciences Sci: Chemistry

We have all seen forensic scientists in TV shows, but how do they really work? What is the science behind their work? The course aims to explain the scientific principles and techniques behind the work of forensic scientists and will be illustrated with numerous case studies from Singapore and around the world.

Aug 24th 2026

5-12 Weeks

Science Biology Chemistry