Plant Bioinformatics (Coursera)

Offered by University of Toronto,
Plant Bioinformatics (Coursera)

The past 15 years have been exciting ones in plant biology. Hundreds of plant genomes have been sequenced, RNA-seq has enabled transcriptome-wide expression profiling, and a proliferation of "-seq"-based methods has permitted protein-protein and protein-DNA interactions to be determined cheaply and in a high-throughput manner. These data sets in turn allow us to generate hypotheses at the click of a mouse.

Class Deals by MOOC List - Click here and see Coursera's Active Discounts, Deals, and Promo Codes.

For instance, knowing where and when a gene is expressed can help us narrow down the phenotypic search space when we don't see a phenotype in a gene mutant under "normal" growth conditions. Coexpression analyses and association networks can provide high-quality candidate genes involved in a biological process of interest. Using Gene Ontology enrichment analysis and pathway visualization tools can help us make sense of our own 'omics experiments and answer the question "what processes/pathways are being perturbed in our mutant of interest?"

Tools covered:
Module 1: Genomic DBs / Precomputed gene tress / Protein tools. Araport, TAIR, Gramene, EnsemblPlants Compara, PLAZA; SUBA4 and Cell eFP Browser, 1001 Genomes Browser
Module 2: Expression tools. eFP Browser / eFP-Seq Browser, Araport, Genevestigator, TravaDB, NCBI Genome Data Viewer for exploring RNA-seq data for many plant species other than Arabidopsis, MPSS database for small RNAs
Module 3: Coexpresseion tools. ATTED II, Expression Angler, AraNet, AtCAST2
Module 4: Promoter analysis. Cistome, Athena, ePlant
Module 5: Go enrichment analysis and pathway visualisation, AgriGO, AmiGO, Classification SuperViewer, TAIR, g:profiler, AraCyc, MapMan (optional: Plant Reactome)
Module 6: Network exploration. Arabidopsis Interactions Viewer 2, ePlant, TF2Network, Virtual Plant, GeneMANIA
Course 3 of 4 in the Plant Bioinformatic Methods Specialization.

Syllabus

Week 1
Plant Genomic Databases, and useful sites for info about proteins
In this module we'll be exploring several plant databases including Ensembl Plants, Gramene, PLAZA, SUBA, TAIR and Araport. The information in these databases allows us to easily identify functional regions within gene products, view subcellular localization, find homologs in other species, and even explore pre-computed gene trees to see if our gene of interest has undergone a gene duplication event in another species, all at the click of a mouse!

Week 2
Expression Analysis
Vast databases of gene expression and nifty visualization tools allow us to explore where and when a gene is expressed. Often this information can be used to help guide a search for a phenotype if we don't see a phenotype in a gene mutant under "normal" growth conditions. We explore several tools for Arabidopsis data (eFP Browser, Genevestigator, TraVA DB, Araport) along with NCBI's Genome Data Viewer for RNA-seq data for other plant species. We also examine the MPSS database of small RNAs and degradation products to see if our example gene has any potential microRNA targets.

Week 3
Coexpression Tools
Being able to group genes by similar patterns of expression across expression data sets using algorithms like WGCNA is a very useful way of organizing the data. Clusters of genes with similar patterns of expression can then be subject to Gene Ontology term enrichment analysis (see Module 5) or examined to see if they are part of the same pathway. What's even more powerful is being able to identify genes with similar patterns of expression without doing a single expression profiling experiment, by mining gene expression databases! There are several tools that allow you to do this in many plant species simply by entering a query gene identifier. The genes that are returned are often in the same biological process as the query gene, and thus this "guilt-by-association" paradigm is a excellent tool for hypothesis generation.

Week 4
Sectional Quiz 1

Week 5
Promoter Analysis
The regulation of gene expression is one of the main ways by which a plant can control the abundance of a gene product (post-translational modifications and protein degradation are some others). When and where a gene is expressed is controlled to a large extent by the presence of short sequence motifs, called cis-elements, present in the promoter of the gene. These in turn are regulated by transcription factors that perhaps get induced in response to environmental stresses or during specific developmental programs. Thus understanding which transcription factors can bind to which promoters can help us understand the role the downstream genes might be playing in a biological system.

Week 6
Functional Classification and Pathway Vizualization
Often the results of 'omics experiments are large lists of genes, such as those that are differentially expressed. We can use a "cherry picking" approach to explore individual genes in those lists but it's nice to be able to have an automated way of analyzing them. Here tools for performing Gene Ontology enrichment analysis are invaluable and can tell you if any particular biological processes or molecular functions are over-represented in your gene list. We'll explore AgriGO, AmiGO, tools at TAIR and the BAR, and g:Profiler, which all allow you to do such analyses. Another useful analysis is to be able to map your gene lists (along with associated e.g. expression values) onto pathway representations, and we'll use AraCyc and MapMan to do this. In this way it is easy to see if certain biosynthetic reactions are upregulated, which can help you interpret your 'omics data!

Week 7
Network Exploration (PPIs, PDIs, GRNs)
Molecules inside the cell rarely operate in isolation. Proteins act together to form complexes, or are part of signal transduction cascades. Transcription factors bind to cis-elements in promoters or elsewhere and can act as activators or repressors of transcription. MicroRNAs can affect transcription in other ways. One of the main themes to have emerged in the past two decades in biology is that of networks. In terms of protein-protein interaction networks, often proteins that are highly connected with others are crucial for biological function – when these “hubs” are perturbed, we see large phenotypic effects. The way that transcription factors interact with downstream promoters, some driving the expression of other transcription factors that in turn regulate genes combinatorially with upstream transcription factors can have an important biological effect in terms of modulating the kind of output achieved. The tools described in this lab can help us to explore molecular interactions in a network context, perhaps with the eventual goal of modeling the behaviour of a given system.

Week 8
Sectional Quiz 2 and Final Assignment

Go to Class
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Fundamental Skills in Bioinformatics (Coursera) Coursera
King Abdullah University of Science and Technology (KAUST)

Fundamental Skills in Bioinformatics (Coursera)

The course provides a broad and mainly practical overview of fundamental skills for bioinformatics (and, in general, data analysis). The aim is to support the simultaneous development of quantitative and programming skills for biological and biomedical students with little or no background in programming or quantitative analysis.

Jun 1st 2026
4 Weeks
Artificial Intelligence in Bioinformatics (FutureLearn) FutureLearn
Taipei Medical University

Artificial Intelligence in Bioinformatics (FutureLearn)

Discover the future of bioinformatics and learn how AI models of bioinformatics data help us to understand biological processes. Study the use of AI, machine learning, and deep learning in bioinformatics. This course will teach you the fundamentals of how AI is applied in the field of bioinformatics.

May 24th 2021
3 Weeks
Genomic Medicine: Transforming Patient Care in Diabetes (FutureLearn) FutureLearn
University of Exeter

Genomic Medicine: Transforming Patient Care in Diabetes (FutureLearn)

Learn how developments in genomics are transforming our knowledge and treatment of conditions such as diabetes. This course introduces genomics, using the University of Exeter’s research expertise in diabetes with patient cases, to illustrate the clinical application and impact of current knowledge.

Nov 2nd 2020
4 Weeks
Genomic Medicine Gets Personal (edX) EdX
Georgetown University,GeorgetownX

Genomic Medicine Gets Personal (edX)

This course will provide an introduction to genomic medicine and a better understanding of the issues associated with personal genomic information. While the advances in genomics promise to usher a new era in medical practice and create a major paradigm shift in patient care, the ethical, legal and social impact of genomic medicine will be equally significant. The information and potential use of genomic discoveries are no longer issues left for scientists and medical professionals to handle, but have become ones for the public at large.

No sessions available
4 Weeks
Bioinformatic Methods II (Coursera) Coursera
University of Toronto

Bioinformatic Methods II (Coursera)

Large-scale biology projects such as the sequencing of the human genome and gene expression surveys using RNA-seq, microarrays and other technologies have created a wealth of data for biologists. However, the challenge facing scientists is analyzing and even accessing these data to extract useful information pertaining to the system being studied. This course focuses on employing existing bioinformatic resources – mainly web-based programs and databases – to access the wealth of data to answer questions relevant to the average biologist, and is highly hands-on.

Jun 8th 2026
5-12 Weeks
Bioinformatic Methods I (Coursera) Coursera
University of Toronto

Bioinformatic Methods I (Coursera)

Large-scale biology projects such as the sequencing of the human genome and gene expression surveys using RNA-seq, microarrays and other technologies have created a wealth of data for biologists. However, the challenge facing scientists is analyzing and even accessing these data to extract useful information pertaining to the system being studied. This course focuses on employing existing bioinformatic resources – mainly web-based programs and databases – to access the wealth of data to answer questions relevant to the average biologist, and is highly hands-on.

Jun 1st 2026
5-12 Weeks
Genome Sequencing (Bioinformatics II) (Coursera) Coursera
University of California, San Diego

Genome Sequencing (Bioinformatics II) (Coursera)

You may have heard a lot about genome sequencing and its potential to usher in an era of personalized medicine, but what does it mean to sequence a genome? Biologists still cannot read the nucleotides of an entire genome as you would read a book from beginning to end. However, they can read short pieces of DNA. In this course, we will see how graph theory can be used to assemble genomes from these short pieces. We will further learn about brute force algorithms and apply them to sequencing mini-proteins called antibiotics.

May 25th 2026
5-12 Weeks
Big Data Science with the BD2K-LINCS Data Coordination and Integration Center (Coursera) Coursera
Icahn School of Medicine at Mount Sinai

Big Data Science with the BD2K-LINCS Data Coordination and Integration Center (Coursera)

In this course we briefly introduce the DCIC and the various Centers that collect data for LINCS. We then cover metadata and how metadata is linked to ontologies. We then present data processing and normalization methods to clean and harmonize LINCS data. This follow discussions about how data is served as RESTful APIs. Most importantly, the course covers computational methods including: data clustering, gene-set enrichment analysis, interactive data visualization, and supervised learning. Finally, we introduce crowdsourcing/citizen-science projects where students can work together in teams to extract expression signatures from public databases and then query such collections of signatures against LINCS data for predicting small molecules as potential therapeutics.

Jun 1st 2026
5-12 Weeks
Plant Bioinformatics Capstone (Coursera) Coursera
University of Toronto

Plant Bioinformatics Capstone (Coursera)

The past 15 years have been exciting ones in plant biology. Hundreds of plant genomes have been sequenced, RNA-seq has enabled transcriptome-wide expression profiling, and a proliferation of "-seq"-based methods has permitted protein-protein and protein-DNA interactions to be determined cheaply and in a high-throughput manner. These data sets in turn allow us to generate hypotheses at the click of a mouse or tap of a finger. In Plant Bioinformatics on Coursera.org, we covered 33 plant-specific online tools from genome browsers to transcriptomic data mining to promoter/network analyses and others, and in this Plant Bioinformatics Capstone we'll use these tools to hypothesize a biological role for a gene of unknown function, summarized in a written lab report.

Jun 8th 2026
5-12 Weeks
Making sense of genomic data: COVID-19 web-based bioinformatics (FutureLearn) FutureLearn
Wellcome Connecting Science,COG-Train

Making sense of genomic data: COVID-19 web-based bioinformatics (FutureLearn)

Explore the tools for web-based SARS-CoV-2 sequencing analyses and learn techniques to prepare data and share genomic outputs. Discover how to analyse SARS-CoV-2 sequences using web-based tools. Over the past decade, sequencing technologies have become more accessible and sequencing data has increased exponentially. However, bioinformatics capacity is still limited. On this three-week course, you will learn the basics of bioinformatics concepts and explore the web-based tools used for viral data and sequencing analyses.

Jul 25th 2022
3 Weeks