EdX

Case Studies in Functional Genomics (edX)

Offered by HarvardX, Harvard University,

Perform RNA-Seq, ChIP-Seq, and DNA methylation data analyses, using open source software, including R and Bioconductor. We will explain how to perform the standard processing and normalization steps, starting with raw data, to get to the point where one can investigate relevant biological questions.

Class Deals by MOOC List - Click here and see EdX's Active Discounts, Deals, and Promo Codes.

Throughout the case studies, we will make use of exploratory plots to get a general overview of the shape of the data and the result of the experiment. We start with RNA-seq data analysis covering basic concepts and a first look at FASTQ files. We will also go over quality control of FASTQ files; aligning RNA-seq reads; visualizing alignments and move on to analyzing RNA-seq at the gene-level : counting reads in genes; Exploratory Data Analysis and variance stabilization for counts; count-based differential expression; normalization and batch effects. Finally, we cover RNA-seq at the transcript-level : inferring expression of transcripts (i.e. alternative isoforms); differential exon usage. We will learn the basic steps in analyzing DNA methylation data, including reading the raw data, normalization, and finding regions of differential methylation across multiple samples. The course will end with a brief description of the basic steps for analyzing ChIP-seq datasets, from read alignment, to peak calling, and assessing differential binding patterns across multiple samples.

Given the diversity in educational background of our students we have divided the series into seven parts. You can take the entire series or individual courses that interest you. If you are a statistician you should consider skipping the first two or three courses, similarly, if you are biologists you should consider skipping some of the introductory biology lectures. Note that the statistics and programming aspects of the class ramp up in difficulty relatively quickly across the first three courses. By the third course will be teaching advanced statistical concepts such as hierarchical models and by the fourth advanced software engineering skills, such as parallel computing and reproducible research concepts.
These courses make up two Professional Certificates and are self-paced:
Data Analysis for Life Sciences:
PH525.1x: Statistics and R for the Life Sciences
PH525.2x: Introduction to Linear Models and Matrix Algebra
PH525.3x: Statistical Inference and Modeling for High-throughput Experiments
PH525.4x: High-Dimensional Data Analysis
Genomics Data Analysis:
PH525.5x: Introduction to Bioconductor
PH525.6x: Case Studies in Functional Genomics
PH525.7x: Advanced Bioconductor

What you'll learn:

Mapping reads
Quality assessment of Next Generation Data
Analyzing RNA-seq data
Analyzing DNA methylation data
Analyzing ChIP Seq data

Prerequisites:
Statistical Inference and Modeling for High-throughput Experiments
High-Dimensional Data Analysis

Go to Class

MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Advanced Bayesian Statistics Using R (edX)

EdX

University of Canterbury,UCx

Advanced Bayesian Statistics Using R (edX)

Statistics & Data Analysis Data Science

Now that you know the basics of Bayesian inference, dive deeper to explore its richness and flexibility more fully. Let’s take a closer look at modeling latent variables, Bayesian model averaging, generalised linear models, and MCMC methods. Advanced Bayesian Data Analysis Using R is part two of the Bayesian Data Analysis in R professional certificate.

Self Paced

Self-Paced

Calculus Algorithms Machine Learning

EdX

Davidson College,DavidsonX

RShiny for Everyone (edX)

Statistics & Data Analysis

Use R’s Shiny package to create data-driven, interactive web applications. In this course, you will use R Shiny to create an interactive web application that highlights the biodiversity of America’s National Parks. Your application will feature an interactive map, biodiversity calculator, trail journal and species images. Using R Shiny, you will expand your data analysis and visualization skills while developing your workflow through web application deployment.

Self Paced

Self-Paced

Data Analysis Web Applications Data Visualization

Inteligencia empresarial con Tableau (edX)

EdX

University of Naples Federico II,FedericaX

Inteligencia empresarial con Tableau (edX)

Statistics & Data Analysis

Aprovecha al máximo las funciones de Tableau, analiza los datos. Este curso se dirige a usuarios de Tableau que han madurado un sólido conocimiento del software en los cursos de nivel básico e intermedio.

Self Paced

Self-Paced

Data Analysis Business Intelligence Data Visualization

Analyzing and Visualizing Data with Power BI (edX)

EdX

Davidson College,DavidsonX

Analyzing and Visualizing Data with Power BI (edX)

Statistics & Data Analysis Computer Science

Step up your analytics game and learn one of the most in-demand job skills in the United States. Power BI is a robust business analytics and visualization tool from Microsoft that helps data professionals bring their data to life and tell more meaningful stores. This four-week course is a beginner's guide to working with data in Power BI and is perfect for professionals. You'll become confident in working with data, creating data visualizations, and preparing reports and dashboards.

Self Paced

Self-Paced

Data Analysis Data Visualization Power BI

EdX

University of Adelaide,AdelaideX

Programming for Data Science (edX)

CS: Programming Data Science

Learn how to apply fundamental programming concepts, computational thinking and data analysis techniques to solve real-world data science problems. There is a rising demand for people with the skills to work with Big Data sets and this course can start you on your journey through our Big Data MicroMasters program towards a recognised credential in this highly competitive area. Using practical activities you will learn how digital technologies work and will develop your coding skills through engaging and collaborative assignments.

Self Paced

Self-Paced

Programming Big Data Data Analysis

Mathematical Methods for Data Analysis (edX)

EdX

The Hong Kong University of Science and Technology - HKUST,HKUSTx

Mathematical Methods for Data Analysis (edX)

Statistics & Data Analysis Computer Science

Learn mathematical methods for data analysis including mathematical formulations and computational methods. Some well-known machine learning algorithms such as k-means are introduced in the examples.

Self Paced

Self-Paced

Data Analysis Differentiation Linear Functions

Introduction to Bayesian Statistics Using R (edX)

EdX

University of Canterbury,UCx

Introduction to Bayesian Statistics Using R (edX)

Statistics & Data Analysis Data Science

Learn the fundamentals of Bayesian approach to data analysis, and practice answering real life questions using R. Basics of Bayesian Data Analysis Using R is part one of the Bayesian Data Analysis in R professional certificate. Bayesian approach is becoming increasingly popular in all fields of data analysis, including but not limited to epidemiology, ecology, economics, and political sciences. It also plays an increasingly important role in data mining and deep learning. Let this course be your first step into Bayesian statistics.

Self Paced

Self-Paced

Data Mining Data Analysis ANOVA

Introduction to Linear Models and Matrix Algebra (edX)

EdX

HarvardX,Harvard University

Introduction to Linear Models and Matrix Algebra (edX)

Sci: Biology & Life Sciences Statistics & Data Analysis

Learn to use R programming to apply linear models to analyze data in life sciences. Matrix Algebra underlies many of the current tools for experimental design and the analysis of high-dimensional data. In this introductory data analysis course, we will use matrix algebra to represent the linear models that commonly used to model differences between experimental units. We perform statistical inference on these differences. Throughout the course we will use the R programming language.

Self Paced

Self-Paced

Algebra Linear Algebra Matrix

Dynamic Programming: Applications In Machine Learning and Genomics (edX)

EdX

University of California, San Diego,UC San DiegoX

Dynamic Programming: Applications In Machine Learning and Genomics (edX)

Sci: Biology & Life Sciences Sci: Mathematics

Learn how dynamic programming and Hidden Markov Models can be used to compare genetic strings and uncover evolution. If you look at two genes that serve the same purpose in two different species, how can you rigorously compare these genes in order to see how they have evolved away from each other?

Self Paced

Self-Paced

Machine Learning Genomics Dynamic Programming

Big Data Technology Capstone Project (edX)

EdX

The Hong Kong University of Science and Technology - HKUST,HKUSTx

Big Data Technology Capstone Project (edX)

Statistics & Data Analysis Computer Science

The Big Data Technology Capstone Project will allow you to apply the techniques and theory you have gained from the four courses in this MicroMasters program to a medium-scale project. In this capstone course, you will get an opportunity to apply the knowledge and skills that you have gained throughout this MicroMasters program.

Self Paced

Self-Paced

Big Data Data Mining Data Analysis

EdX

HarvardX,Harvard University

High-Dimensional Data Analysis (edX)

Sci: Biology & Life Sciences Statistics & Data Analysis

A focus on several techniques that are widely used in the analysis of high-dimensional data. If you’re interested in data analysis and interpretation, then this is the data science course for you. We start by learning the mathematical definition of distance and use this to motivate the use of the singular value decomposition (SVD) for dimension reduction and multi-dimensional scaling and its connection to principle component analysis.

Self Paced

Self-Paced

Machine Learning Clustering Data Analysis

Observation Theory: Estimating the Unknown (edX)

EdX

Delft University of Technology,DelftX

Observation Theory: Estimating the Unknown (edX)

Engineering Sci: Mathematics

Learn how to estimate parameters from observational data for real-world engineering applications and assess the quality of the results. Are you an engineer, scientist or technician? Are you dealing with measurements or big data, but are you unsure about how to proceed? This is the course that teaches you how to find the best estimates of the unknown parameters from noisy observations. You will also learn how to assess the quality of your results.

Self Paced

Self-Paced

Data Analysis Mathematical Models Observation Theory