Coursera

Finding Mutations in DNA and Proteins (Bioinformatics VI) (Coursera)

Offered by University of California, San Diego,

In previous courses in the Specialization, we have discussed how to sequence and compare genomes. This course will cover advanced topics in finding mutations lurking within DNA and proteins. In the first half of the course, we would like to ask how an individual's genome differs from the "reference genome" of the species.

Class Deals by MOOC List - Click here and see Coursera's Active Discounts, Deals, and Promo Codes.

Our goal is to take small fragments of DNA from the individual and "map" them to the reference genome. We will see that the combinatorial pattern matching algorithms solving this problem are elegant and extremely efficient, requiring a surprisingly small amount of runtime and memory.
In the second half of the course, we will learn how to identify the function of a protein even if it has been bombarded by so many mutations compared to similar proteins with known functions that it has become barely recognizable. This is the case, for example, in HIV studies, since the virus often mutates so quickly that researchers can struggle to study it. The approach we will use is based on a powerful machine learning tool called a hidden Markov model.
Finally, you will learn how to apply popular bioinformatics software tools applying hidden Markov models to compare a protein against a related family of proteins.
Course 6 of 7 in the Bioinformatics Specialization.

Syllabus

WEEK 1
Introduction to Read Mapping
In this class, we will consider the following two central biological questions (the computational approaches needed to solve them are shown in parentheses): How Do We Locate Disease-Causing Mutations? (Combinatorial Pattern Matching)Why Have Biologists Still Not Developed an HIV Vaccine?(Hidden Markov Models)

WEEK 2
The Burrows-Wheeler Transform
This week, we will introduce a paradigm called the Burrows-Wheeler transform; after seeing how it can be used in string compression, we will demonstrate that it is also the foundation of modern read-mapping algorithms.

WEEK 3
Speeding Up Burrows-Wheeler Read Mapping
Last week, we saw how the Burrows-Wheeler transform could be applied to multiple pattern matching. This week, we will speed up our algorithm and generalize it to the case that patterns have errors, which models the biological problem of mapping reads with errors to a reference genome.

WEEK 4
Introduction to Hidden Markov Models
This week, we will start examining the case of aligning sequences with many mutations -- such as related genes from different HIV strains -- and see that our problem formulation for sequence alignment is not adequate for highly diverged sequences. To improve our algorithms, we will introduce a machine-learning paradigm called a hidden Markov model and see how dynamic programming helps us answer questions about these models.

WEEK 5
Profile HMMs for Sequence Alignment
Last week, we introduced hidden Markov models. This week, we will see how hidden Markov models can be applied to sequence alignment with a profile HMM. We will then consider some advanced topics in this area, which are related to advanced methods that we considered in a previous course for clustering.

WEEK 6
Bioinformatics Application Challenge
This week brings our Application Challenge, in which we apply the HMM sequence alignment algorithms that we have developed.

Go to Class

MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Coursera

University of California, San Diego

Finding Hidden Messages in DNA (Bioinformatics I) (Coursera)

Sci: Biology & Life Sciences Health & Society

This course begins a series of classes illustrating the power of computing in modern biology. Please join us on the frontier of bioinformatics to look for hidden messages in DNA without ever needing to put on a lab coat. In the first half of the course, we investigate DNA replication, and ask the question, where in the genome does DNA replication begin? We will see that we can answer this question for many bacteria using only some straightforward algorithms to look for hidden messages in the genome.

Jul 20th 2026

5-12 Weeks

Biology Algorithms DNA

Coursera

University of Illinois at Urbana-Champaign

Genomics: Decoding the Universal Language of Life (Coursera)

Sci: Biology & Life Sciences

What is a genome? A genome contains all of the information that a cell needs to develop, function, and reproduce itself, and all the information needed for those cells to come together to form a person, plant, or animal. Genomes contain an organism’s complete set of genes, and also the even tinier genetic structures that help regulate when and how those genes are used. The ability to regrow a torn ligament, the clues that might predict the onset of mental illness, the nutritional potential of crops, and even the history of life itself, are all encoded in genomes. By taking this course, you will discover how scientists are deciphering the language of genomes to learn how to develop sustainable food and fuel supplies, improve disease treatment and prevention, and protect our environment.

Aug 3rd 2026

5-12 Weeks

Health Biology Genes

Coursera

University of California, San Diego

Hacking COVID-19 — Course 5: Tracing SARS-CoV-2's Evolution (Coursera)

Health & Society

INTROIn this course, you will follow in the footsteps of the bioinformaticians investigating the COVID-19 outbreak by tracing the evolution of SARS-CoV-2. Whether you’re new to the world of computational biology, or you’re a bioinformatics expert seeking to learn about its applications in the COVID-19 pandemic, or somewhere in between, this course is for you! As you go through this journey, we will introduce and explain genomic concepts and give you many opportunities to practice your skills, and we will provide a series of problems with gradually increasing complexity.

Jul 27th 2026

2 Weeks

Bioinformatics Genomics COVID-19

Coursera

Johns Hopkins University

Introduction to Genomic Technologies (Coursera)

Statistics & Data Analysis Data Science

This course introduces you to the basic biology of modern genomics and the experimental tools that we use to measure it. We'll introduce the Central Dogma of Molecular Biology and cover how next-generation sequencing can be used to measure DNA, RNA, and epigenetic patterns. You'll also get an introduction to the key concepts in computing and data science that you'll need to understand how data from next-generation sequencing experiments are generated and analyzed.

Jul 20th 2026

4 Weeks

Biology DNA RNA

Coursera

Johns Hopkins University

Python for Genomic Data Science (Coursera)

Statistics & Data Analysis Data Science

This class provides an introduction to the Python programming language and the iPython notebook. This is the third course in the Genomic Big Data Science Specialization from Johns Hopkins University.

Jul 20th 2026

4 Weeks

Programming Python Big Data

Coursera

University of Virginia

Bacterial Bioinformatics (Coursera)

Health & Society CS: Information & Technology

This course provides demonstrations and exercises for performing common genomics-based analysis tasks of bacterial sequence data. It uses PATRIC, the PathoSystems Resource Integration Center, as the platform for analysis. PATRIC is the NIH/NIAID-funded bacterial Bioinformatics Resource Center, providing comprehensive bacterial genomic data with integrated analysis tools and visualizations.

Jul 20th 2026

5-12 Weeks

Bioinformatics Genomic Data Bacterial Genomes

Coursera

University of Geneva

Classical papers in molecular genetics (Coursera)

Sci: Biology & Life Sciences Health & Society

You have all heard about the DNA double helix and genes. Many of you know that mutations occur randomly, that the DNA sequence is read by successive groups of three bases (the codons), that many genes encode enzymes, and that gene expression can be regulated. These concepts were proposed on the basis of astute genetic experiments, as well as often on biochemical results. The original articles were these concepts appeared are however not frequently part of the normal curriculum of biologists, biochemists and medical students.

Jul 27th 2026

5-12 Weeks

Genetics Genes DNA

Coursera

University of Toronto

Plant Bioinformatics (Coursera)

Health & Society Computer Science

The past 15 years have been exciting ones in plant biology. Hundreds of plant genomes have been sequenced, RNA-seq has enabled transcriptome-wide expression profiling, and a proliferation of "-seq"-based methods has permitted protein-protein and protein-DNA interactions to be determined cheaply and in a high-throughput manner. These data sets in turn allow us to generate hypotheses at the click of a mouse.

Jul 27th 2026

5-12 Weeks

Bioinformatics Plant Biology Plant Bioinformatic Methods Specialization

Coursera

Johns Hopkins University

Bioconductor for Genomic Data Science (Coursera)

Statistics & Data Analysis Data Science

Learn to use tools from the Bioconductor project to perform analysis of genomic data. This is the fifth course in the Genomic Big Data Specialization from Johns Hopkins University.

Jul 20th 2026

4 Weeks

Bioinformatics Data Analysis Data Science

Coursera

McMaster University

DNA Decoded (Coursera)

Health & Society Science

Are you a living creature? Then, congratulations! You’ve got DNA. But how much do you really know about the microscopic molecules that make you unique? Why is DNA called the “blueprint of life”? What is a “DNA fingerprint”? How do scientists clone DNA? What can DNA teach you about your family history? Are Genetically Modified Organisms (GMOs) safe? Is it possible to revive dinosaurs by cloning their DNA?

Aug 10th 2026

4 Weeks

Genetics DNA Translation

Coursera

University of Geneva

Chemical Biology (Coursera)

Sci: Chemistry

Chemical biology is a burgeoning field that has rapidly risen to prominence. This surge of interest has been fuelled by chemical biology’s applicability to understanding critical processes in live cells or model organisms in real time. This success has arisen because chemical biology straddles a nexus between chemistry, biology, and physics. Thus, chemical biology can harness rapid chemistry to observe or perturb biological processes, that are in turn reported using physical assays, all in an otherwise unperturbed living entity. Although its boundaries are endless, the multidisciplinary nature of chemical biology can make the field seem daunting; we beg to differ! Here, we deconstruct chemical biology into its core components, and repackage the material.

Aug 3rd 2026

5-12 Weeks

Biology Chemistry Physical Sciences

Coursera

University of Colorado Boulder

The Little Stuff: Energy, Cells, and Genetics (Coursera)

Sci: Biology & Life Sciences

In this course, we will explore the smaller side of biology: molecular biology. We’ll cover basic topics including cell biology and how cells can go “rogue” and turn into cancer, how energy from the sun is transferred to fuel our bodies, basics of genetics and inheritance, and genetic technologies. At the end of this course, we will discuss ethical and moral implications of several exciting and new genetic technologies.

Jul 27th 2026

4 Weeks

Biology Genetics Energy