Bioinformatic Methods II (Coursera)

Offered by University of Toronto,
Bioinformatic Methods II (Coursera)

Large-scale biology projects such as the sequencing of the human genome and gene expression surveys using RNA-seq, microarrays and other technologies have created a wealth of data for biologists. However, the challenge facing scientists is analyzing and even accessing these data to extract useful information pertaining to the system being studied. This course focuses on employing existing bioinformatic resources – mainly web-based programs and databases – to access the wealth of data to answer questions relevant to the average biologist, and is highly hands-on.

Class Deals by MOOC List - Click here and see Coursera's Active Discounts, Deals, and Promo Codes.

Topics covered include multiple sequence alignments, phylogenetics, gene expression data analysis, and protein interaction networks, in two separate parts. The first part, Bioinformatic Methods I, dealt with databases, Blast, multiple sequence alignments, phylogenetics, selection analysis and metagenomics. This, the second part, Bioinformatic Methods II, will cover motif searching, protein-protein interactions, structural bioinformatics, gene expression data analysis, and cis-element predictions. This pair of courses is useful to any student considering graduate school in the biological sciences, as well as students considering molecular medicine.
Course 2 of 4 in the Plant Bioinformatic Methods Specialization.

Syllabus

Week 1
Protein Motifs
In this module we'll be exploring conserved regions within protein families. Such regions can help us understand the biology of a sequence, in that they are likely important for biological function, and also be used to help ascribe function to sequences where we can't identify any homologs in the databases. There are various ways of describing the conserved regions from simple regular expressions to profiles to profile hidden Markov models (HMMs).

Week 2
Protein-Protein Interactions
In this module we'll be exploring protein-protein interactions (PPIs). Protein-protein interactions are important as proteins don't act in isolation, and often an examination of the interaction partners (determined in an unbiased, perhaps high throughput way) of a given protein can tell us a lot about its biology. We'll talk about some different methods used to determine PPIs and go over their strengths and weaknesses. In the lab we'll use 3 different tools and two different databases to examine interaction partners of BRCA2, a protein that we examined in last module's lab. Finally, we'll touch on a "foundational" concept, Gene Ontology (GO) term enrichment analysis, to help us understand in an overview way the proteins interacting with our example.

Week 3
Protein Structure
The determination of a protein's tertiary structure in three dimensions can tell us a lot about the biology of that protein. In this module's mini-lecture, we'll talk about some different methods used to determine a protein's tertiary structure and cover the main database for protein structure data, the PDB. In the lab we'll explore the PDB and an online tool for searching for structural (as opposed to sequence) similarity, VAST. We'll then use a nice piece of stand-alone software, PyMOL, to explore several protein structures in more detail.

Week 4
Review: Protein Motifs, Protein-Protein Interactions, and Protein Structure

Week 5
Gene Expression Analysis I
When and where genes are expressed (active) in tissues or cells is one of the main determinants of what makes that tissue or cell the way it is, both in terms of morphology and in terms of response to external stimuli. Several different methods exist for generating gene expression levels for all of the genes in the genome in tissues or even at cell-type-specific resolution. In this class we'll be processing and then examining some gene expression data generated using RNA-seq. We'll explore one of the main databases for RNA-seq expression data, the Sequence Read Archive (SRA), and then use an open-source suite of programs in R called BioConductor to process the raw reads from 4 RNA-seq data sets, to summarize their expression levels, to select significantly differentially expressed genes, and finally to visualize these as a heat map.

Week 6
Gene Expression Analysis II
When and where genes are expressed (active) in tissues or cells is one of the main determinants of what makes that tissue or cell the way it is, both in terms of morphology and in terms of response to external stimuli. Several different methods exist for generating gene expression levels for all of the genes in the genome in tissues or even at cell-type-specific resolution. In this class we'll be hierarchically clustering our significantly differentially expressed genes from last time using BioConductor and the built-in function of an online tool, called Expression Browser. Then we'll be using another online tool that uses a similarity metric, the Pearson correlation coefficient, to identify genes responding in a similar manner to our gene of interest, in this case AP3. We'll use a second tool, ATTED-II to corroborate our gene list. We'll also be exploring some online databases of gene expression and an online tool for doing a Gene Ontology enrichment analysis.

Week 7
Cis Regulatory Systems
When and where genes are expressed in tissues or cells is one of the main determinants of what makes that tissue or cell the way it is, both in terms of morphology and in terms of response to external stimuli. Gene expression is controlled in part by the presence of short sequences in the promoters (and other parts) of genes, called cis-elements, which permit transcription factors and other regulatory proteins to bind to direct the patterns of expression in certain tissues or cells or in response to environmental stimuli: We'll explore a couple of sets of promoters of genes that are coexpressed with AP3 from Arabidopsis, and with INSULIN from human, for the presence of known cis-elements, and we'll also try to predict some new ones using a couple of different methods.

Week 8
Review: Gene Expression Analysis and Cis Regulatory Systems + Final Assignment

Go to Class
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Market Research and Consumer Behavior (Coursera) Coursera
IE Business School

Market Research and Consumer Behavior (Coursera)

Your marketing quest begins here! The first course in this specialization lays the neccessary groundwork for an overall successful marketing strategy. It is separated into two sections: Market Research and Consumer Behavior. Gain the tools and techniques to translate a decision problem into a research question in the Market Research module. Learn how to design a research plan, analyze the data gathered and accurately interpret and communicate survey reports, translating the results into practical recommendations.

Jun 8th 2026
4 Weeks
Graph Analytics for Big Data (Coursera) Coursera
University of California, San Diego

Graph Analytics for Big Data (Coursera)

Want to understand your data network structure and how it changes under different conditions? Curious to know how to identify closely interacting clusters within a graph? Have you heard of the fast-growing area of graph analytics and want to learn more? This course gives you a broad overview of the field of graph analytics so you can learn new ways to model, store, retrieve and analyze graph-structured data.

Jun 8th 2026
5-12 Weeks
Infonomics II: Business Information Management and Measurement (Coursera) Coursera
University of Illinois at Urbana-Champaign

Infonomics II: Business Information Management and Measurement (Coursera)

Even decades into the Information Age, accounting practices yet fail to recognize the financial value of information. Moreover, traditional asset management practices fail to recognize information as an asset to be managed with earnest discipline. This has led to a business culture of complacence, and the inability for most organizations to fully leverage available information assets. This second course in the two-part Infonomics series explores how and why to adapt well-honed asset management principles and practices to information, and how to apply accepted and new valuation models to gauge information’s potential and realized economic benefits.

Jun 10th 2026
4 Weeks
Framework for Data Collection and Analysis (Coursera) Coursera
University of Maryland, College Park

Framework for Data Collection and Analysis (Coursera)

This course will provide you with an overview over existing data products and a good understanding of the data collection landscape. With the help of various examples you will learn how to identify which data sources likely matches your research question, how to turn your research question into measurable pieces, and how to think about an analysis plan.

Jun 8th 2026
4 Weeks
Machine Learning Foundations: A Case Study Approach (Coursera) Coursera
University of Washington

Machine Learning Foundations: A Case Study Approach (Coursera)

Do you have data and wonder what it can tell you? Do you need a deeper understanding of the core ways in which machine learning can improve your business? Do you want to be able to converse with specialists about anything from regression and classification to deep learning and recommender systems? In this course, you will get hands-on experience with machine learning from a series of practical case-studies.

Jun 8th 2026
5-12 Weeks
Introducción a Data Science: Programación Estadística con R (Coursera) Coursera
Universidad Nacional Autónoma de México

Introducción a Data Science: Programación Estadística con R (Coursera)

Este curso te proporcionará las bases del lenguaje de programación estadística R, la lengua franca de la estadística, el cual te permitirá escribir programas que lean, manipulen y analicen datos cuantitativos. Te explicaremos la instalación del lenguaje; también verás una introducción a los sistemas base de gráficos y al paquete para graficar ggplot2, para visualizar estos datos. Además también abordarás la utilización de uno de los IDEs más populares entre la comunidad de usuarios de R, llamado RStudio.

Jun 8th 2026
4 Weeks
Paleontology: Early Vertebrate Evolution (Coursera) Coursera
University of Alberta

Paleontology: Early Vertebrate Evolution (Coursera)

Paleontology: Early Vertebrate Evolution is a four-lesson course teaching a comprehensive overview of the origin of vertebrates. Students will explore the diversity of Palaeozoic lineages within a phylogenetic and evolutionary framework .This course examines the evolution of major vertebrate novelties including the origin of fins, jaws, and tetrapod limbs. Students also explore key Canadian fossil localities, including the Burgess Shale (British Columbia), Miguasha (Quebec), and Man On The Hill (Northwest Territories).

Jun 10th 2026
4 Weeks
Network Analysis in Systems Biology (Coursera) Coursera
Icahn School of Medicine at Mount Sinai

Network Analysis in Systems Biology (Coursera)

An introduction to data integration and statistical methods used in contemporary Systems Biology, Bioinformatics and Systems Pharmacology research. The course covers methods to process raw data from genome-wide mRNA expression studies (microarrays and RNA-seq) including data normalization, differential expression, clustering, enrichment analysis and network construction. The course contains practical tutorials for using tools and setting up pipelines, but it also covers the mathematics behind the methods applied within the tools.

Jun 8th 2026
5-12 Weeks
Exploratory Data Analysis (Coursera) Coursera
Johns Hopkins University

Exploratory Data Analysis (Coursera)

This course covers the essential exploratory techniques for summarizing data. These techniques are typically applied before formal modeling commences and can help inform the development of more complex statistical models. Exploratory techniques are also important for eliminating or sharpening potential hypotheses about the world that can be addressed by the data.

Jun 8th 2026
4 Weeks
Business Intelligence Concepts, Tools, and Applications (Coursera) Coursera
University of Colorado System

Business Intelligence Concepts, Tools, and Applications (Coursera)

This is the fourth course in the Data Warehouse for Business Intelligence specialization. Ideally, the courses should be taken in sequence. In this course, you will gain the knowledge and skills for using data warehouses for business intelligence purposes and for working as a business intelligence developer. You’ll have the opportunity to work with large data sets in a data warehouse environment and will learn the use of MicroStrategy's Online Analytical Processing (OLAP) and Visualization capabilities to create visualizations and dashboards.

Jun 8th 2026
5-12 Weeks