EdX

Cluster Analysis (edX)

Cluster Analysis (edX)

Learn how to conduct a cluster analysis to discover important patterns in student behavior using the popular Weka data mining toolkit. In this course, you will learn the basics of cluster analysis, one of the most popular data mining methods for the discovery of patterns in learning data, and its application in learning analytics.

Class Deals by MOOC List - Click here and see EdX's Active Discounts, Deals, and Promo Codes.

Cluster analysis enables the identification of common, archetypal patterns of student interactions, which can lead to better understanding of student learning behaviors and provision of personalized feedback and interventions.
This course will have a strong hands-on component, as you will learn how to conduct a cluster analysis using the popular Weka data mining toolkit.
We will cover K-means and Hierarchical clustering techniques, which are two simple, yet widely used, cluster analysis methods. We will also review some of the published learning analytics studies that adopted cluster analysis and learn how to interpret the cluster analysis results.
Finally, we will also examine some of the more advanced techniques and identify certain practical challenges with cluster analysis, such as the selection of the optimal number of clusters and the validation of cluster analysis results.

What you'll learn

  • Understand clustering and its use in learning analytics
  • How to use the Weka toolkit to conduct cluster analysis
  • Popular clustering algorithms (k-means, hierarchical clustering, EM clustering)
  • How to interpret cluster analysis results
  • How to use clustering in learning analytics to solve problems, such as improving student learning experiences and learning outcomes, increasing retention, or providing personalized feedback and support to students
  • How to determine an optimal number of clusters for the analysis

Syllabus

Week 1: Introduction
Lectures:
Introduction to unsupervised machine learning methods
Introduction to clustering
Overview of clustering uses for learning analytics
Labs:
Introduction to Weka toolkit

Week 2: Overview of k-means and hierarchical clustering methods
Lectures:
K-means clustering theory
K-means full example
Hierarchical clustering theory
Hierarchical clustering full example
Labs:
Conducting k-means clustering using Weka
Conducting hierarchical clustering using Weka

Week 3: Practical considerations
Lectures:
How to choose the number of clusters
How to interpret clustering results
Overview of more advanced clustering methods
Labs:
Real-world cluster analysis walkthrough

Prerequisites
We highly recommend that you take the previous course in the series before beginning this course:
Social Network Analysis
This course is intended for those who have a bachelor’s degree and are interested in developing learning and data science skills for employment in education, corporate, nonprofit, and military sectors. Experience with programming and statistics will be beneficial to participants.

Go to Class
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Analytics in Python (edX) EdX
Columbia University,ColumbiaX

Analytics in Python (edX)

Learn the fundamental of programming in Python and develop the ability to analyze data and make data-driven decisions. Data is the lifeblood of an organization. Competency in programming is an essential skill for successfully extracting information and knowledge from data. The goal of this course is to introduce learners to the basics of programming in Python and to give a working knowledge of how to use programs to deal with data.

This course is archived
5-12 Weeks
Data Warehousing and BI Analytics (edX) EdX
IBM

Data Warehousing and BI Analytics (edX)

This course introduces you to designing, implementing and populating a data warehouse and analyzing its data using SQL & Business Intelligence (BI) tools. Today’s businesses are investing significantly in capabilities to harness the massive amounts of data that fuel Business Intelligence (BI). Working knowledge of Data Warehouses and BI Analytics tools are a crucial skill for Data Engineers, Data Warehousing Specialists and BI Analysts, making who are amongst, the most valued resources for organizations.

Self Paced
Self-Paced
Process Mining: Data science in Action (Coursera) Coursera
Eindhoven University of Technology

Process Mining: Data science in Action (Coursera)

Process mining is the missing link between model-based process analysis and data-oriented analysis techniques. Through concrete data sets and easy to use software the course provides data science knowledge that can be applied directly to analyze and improve processes in a variety of domains. Data science is the profession of the future, because organizations that are unable to use (big) data in a smart way will not survive. It is not sufficient to focus on data storage and data analysis. The data scientist also needs to relate data to process analysis.

Jun 1st 2026
5-12 Weeks
GIS Data Formats, Design and Quality (Coursera) Coursera
University of California, Davis

GIS Data Formats, Design and Quality (Coursera)

In this course, the second in the Geographic Information Systems (GIS) Specialization. What you will learn: design data tables and use separating and joining data in a relational database; write query strings to subset data; create and work with raster data; create web maps.

Jun 1st 2026
4 Weeks
Learning Analytics Fundamentals (edX) EdX
University of Texas at Arlington,UTArlingtonX

Learning Analytics Fundamentals (edX)

Learn about the growing field of learning analytics and how to analyze basic data sets to generate insights. The demand for data science and learning science skills has continued to increase as classrooms, labs, and organizations look to optimize their data and improve learning environments for students and employees. The UTArlingtonX Learning Analytics courses will give you the opportunity to gain invaluable knowledge and expertise in this growing field.

No sessions available
4 Weeks
Data, Analytics and Learning (edX) EdX
University of Texas at Arlington,UTArlingtonX

Data, Analytics and Learning (edX)

An introduction to the logic and methods of analysis of data to improve teaching and learning. Capturing and analyzing data has changed how decisions are made and resources are allocated in businesses, journalism, government, and military and intelligence fields. Through better use of data, leaders are able to plan and enact strategies with greater clarity and confidence.

No sessions available
4 Weeks
Analítica avanzada y seguridad cibernética (edX) EdX
Galileo University,GalileoX

Analítica avanzada y seguridad cibernética (edX)

La digitalización del sector energético brinda una gran oportunidad para alcanzar una matriz energética diversificada y sostenible. Sin embargo existen grandes retos por delante, los cuales pueden ser superados gracias a los avances en los sistemas de analítica avanzada. Por otra parte, la digitalización del sector energético requiere la implementación de las mejores prácticas para proteger los sistemas y la información de ciberataques y así, mejorar la seguridad operativa y la confiabilidad de los sistemas.

Self Paced
Self-Paced
Feature Engineering for Improving Learning Environments (edX) EdX
University of Texas at Arlington,UTArlingtonX

Feature Engineering for Improving Learning Environments (edX)

Every model used to predict a future outcome depends upon the quality of features used. This course focuses on developing better features to create better models. How can data-intensive research methods be used to create more equitable and effective learning environments? In this course, you will learn how data from digital learning environments and administrative data systems can be used to help better understand relevant learning environments, identify students in need of support, and assess changes made to learning environments.

No sessions available
3 Weeks
Data Science for Smart Cities (edX) EdX
Purdue University,PurdueX

Data Science for Smart Cities (edX)

Learn various scientific techniques that will allow the analysis, inference and prediction of large-scale data (e.g. GPS vehicular data, social media data, mobile phone data, individual social network data etc.) that are present in city networks. The availability of low cost and ubiquitous sensors in city infrastructure provides high granular data at unprecedented spatiotemporal scales. “Smart Cities” envision to utilize this data to provide a healthy, happy and sustainable urban ecosystem by integrating the information and communication technology (ICT), Internet of things (IoT) and citizen participation to effectively manage and utilize city infrastructure and services.

No sessions available
13-24 Weeks
Introduction to Bayesian Statistics Using R (edX) EdX
University of Canterbury,UCx

Introduction to Bayesian Statistics Using R (edX)

Learn the fundamentals of Bayesian approach to data analysis, and practice answering real life questions using R. Basics of Bayesian Data Analysis Using R is part one of the Bayesian Data Analysis in R professional certificate. Bayesian approach is becoming increasingly popular in all fields of data analysis, including but not limited to epidemiology, ecology, economics, and political sciences. It also plays an increasingly important role in data mining and deep learning. Let this course be your first step into Bayesian statistics.

Self Paced
Self-Paced
Analytics for the Classroom Teacher (edX) EdX
Curtin University,CurtinX

Analytics for the Classroom Teacher (edX)

This course is ideal for school teachers who want to improve their teaching through valuable data-driven insights. Do you want to be more reflective in your teaching practice and wonder if there are technologies that can help? Are you curious about how data-driven, evidence-based teaching practices can improve your students’ learning? This is the course for you!

No sessions available
5-12 Weeks