Big Data Analytics in Healthcare (Udacity)

Big Data Analytics in Healthcare (Udacity)

Data science plays an important role in many industries. In facing massive amount of heterogeneous data, scalable machine learning and data mining algorithms and systems become extremely important for data scientists. The growth of volume, complexity and speed in data drives the need for scalable data analytic algorithms and systems. In this course, we study such algorithms and systems in the context of healthcare applications.

Class Deals by MOOC List - Click here and see Udacity's Active Discounts, Deals, and Promo Codes.

In healthcare, large amounts of heterogeneous medical data have become available in various healthcare organizations (payers, providers, pharmaceuticals). This data could be an enabling resource for deriving insights for improving care delivery and reducing waste. The enormity and complexity of these datasets present great challenges in analyses and subsequent applications to a practical clinical environment.
In this course, we introduce the characteristics of medical data and associated data mining challenges on dealing with such data. We cover various algorithms and systems for big data analytics. We focus on studying those big data techniques in the context of concrete healthcare analytic applications such as predictive modeling, computational phenotyping and patient similarity. We also study big data analytic technology:
Scalable machine learning algorithms such as online learning and fast similarity search;
Big data analytic system such as Hadoop family (Hive, Pig, HBase), Spark and Graph DB

What You Will Learn

Lesson 1
Big Data

  • Predictive Modeling
  • Dimensionality Reduction & Tensor Factorization
  • Graph Analysis

Lesson 2
Healthcare

  • Computational Phenotyping
  • Patient Similarity Metrics
  • Medical Ontology

Lesson 3
Technologies

  • MapReduce
  • Spark
  • Hadoop

Prerequisites and Requirements
Basic machine learning and data mining concepts such as classification and clustering;Proficient programming and system skills in Python, Java and Scala;Proficient knowledge and experience in dealing with data (recommended skills include SQL, NoSQL such as MongoDB).

Go to Class
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Statistical Inference (Coursera) Coursera
Johns Hopkins University

Statistical Inference (Coursera)

Statistical inference is the process of drawing conclusions about populations or scientific truths from data. There are many modes of performing inference including statistical modeling, data oriented strategies and explicit use of designs and randomization in analyses. Furthermore, there are broad theories (frequentists, Bayesian, likelihood, design based, …) and numerous complexities (missing data, observed and unobserved confounding, biases) for performing inference.

Jun 1st 2026
4 Weeks
Data Science Interview Prep (Udacity) Udacity
Udacity

Data Science Interview Prep (Udacity)

Confidently take on the tech interview. Data science job interviews can be daunting. Technical interviewers often ask you to design an experiment or model. You may need to solve problems using Python and SQL. You will likely need to show how you connect data skills to business decisions and strategy. In this course, you'll review the common questions asked in data science, data analyst, and machine learning interviews.

Self Paced
Self-Paced
Introduction to Machine Learning Course (Udacity) Udacity
Udacity

Introduction to Machine Learning Course (Udacity)

This class will teach you the end-to-end process of investigating data through a machine learning lens. Learn online, with Udacity. Machine Learning is a first-class ticket to the most exciting careers in data analysis today. As data sources proliferate along with the computing power to process them, going straight to the data is one of the most straightforward ways to quickly gain insights and make predictions.

Self Paced
Self-Paced
Guided Imagery (Coursera) Coursera
University of Minnesota

Guided Imagery (Coursera)

In this course, you will learn how you can use imagery and imagery interventions to help with symptom management and healing, as well as to enhance overall health and wellbeing. You will experience a variety of imagery interventions and evaluate how they might be helpful in providing relief or enhancing quality of life.

Jun 3rd 2026
5-12 Weeks
Introduction to Machine Learning (Coursera) Coursera
Duke University

Introduction to Machine Learning (Coursera)

This course will provide you a foundational understanding of machine learning models (logistic regression, multilayer perceptrons, convolutional neural networks, natural language processing, etc.) as well as demonstrate how these models can solve complex problems in a variety of industries, from medical diagnostics to image recognition to text prediction.

Jun 5th 2026
5-12 Weeks
Google Data Analytics Capstone: Complete a Case Study (Coursera) Coursera
Google

Google Data Analytics Capstone: Complete a Case Study (Coursera)

This course is the eighth course in the Google Data Analytics Certificate. You’ll have the opportunity to complete an optional case study, which will help prepare you for the data analytics job hunt. Case studies are commonly used by employers to assess analytical skills. For your case study, you’ll choose an analytics-based scenario. You’ll then ask questions, prepare, process, analyze, visualize and act on the data from the scenario.

Jun 2nd 2026
4 Weeks
Practical Machine Learning (Coursera) Coursera
Johns Hopkins University

Practical Machine Learning (Coursera)

One of the most common tasks performed by data scientists and data analysts are prediction and machine learning. This course will cover the basic components of building and applying prediction functions with an emphasis on practical applications. The course will provide basic grounding in concepts such as training and tests sets, overfitting, and error rates.

Jun 1st 2026
4 Weeks
Public Involvement in Research (Coursera) Coursera
Imperial College London

Public Involvement in Research (Coursera)

This course focuses on participatory approaches in research, known as 'public involvement' in the UK. You'll specifically, consider why citizens and patients would be involved in research and explore participatory approaches across and within the research cycle in more detail, diving into questions such as: What kinds of participation can be undertaken at each of the 7 stages of the cycle? How can you utilise participation in research? What examples of using participatory approaches exist in research?

Jun 3rd 2026
4 Weeks
Regression Models (Coursera) Coursera
Johns Hopkins University

Regression Models (Coursera)

Linear models, as their name implies, relates an outcome to a set of predictors of interest using linear assumptions. Regression models, a subset of linear models, are the most important statistical analysis tool in a data scientist’s toolkit. This course covers regression analysis, least squares and inference using regression models.

Jun 1st 2026
4 Weeks