A Step-by-Step Introduction to Process Mining (openHPI)

A Step-by-Step Introduction to Process Mining (openHPI)

Process mining is widely used in organizations to improve the understanding of business processes, based on data. Therefore, process mining is also called “data science for business processes”. While process mining has gone mainstream, there are many underlying concepts and techniques, and these are complex. The goal of this online course is to provide a general understanding of the concepts and techniques behind process mining. The course will be most valuable for domain experts, whose business processes are investigated, and for professionals in IT and in business consulting. We aim at providing a common understanding and a common language that facilitates communication between all stakeholders involved in process mining projects.

Course information

Week 1 introduces the main concepts in process mining, using a sample business process. We explore the data generated during the execution of this process and we transform data items to events that tell us about the execution of process activities. Process discovery shows us how business processes are actually executed. After week 1 you will have a good understanding of fundamental concepts in process mining, including event log generation and process discovery.

Week 2 focuses on process mining techniques beyond process discovery. First, we explore how data about the process execution can be used to detect undesired behavior and potential compliance issues. Second, we take a look at how process mining can help to understand more detailed aspects of the process execution. This includes understanding what decisions have been made in the process and why and what factors determine the overall completion time of a process. After week 2 you will have an overview and a good understanding of the potential of process mining beyond process discovery.

Go to Class
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Advanced Linear Models for Data Science 2: Statistical Linear Models (Coursera) Coursera
Johns Hopkins University

Advanced Linear Models for Data Science 2: Statistical Linear Models (Coursera)

Welcome to the Advanced Linear Models for Data Science Class 2: Statistical Linear Models. This class is an introduction to least squares from a linear algebraic and mathematical perspective. Before beginning the class make sure that you have the following: a basic understanding of linear algebra and multivariate calculus; a basic understanding of statistics and regression models; at least a little familiarity with proof based mathematics; basic knowledge of the R programming language.

Jul 6th 2026
4 Weeks
Machine Learning Using SAS Viya (Coursera) Coursera
SAS

Machine Learning Using SAS Viya (Coursera)

This course covers the theoretical foundation for different techniques associated with supervised machine learning models. In addition, a business case study is defined to guide participants through all steps of the analytical life cycle, from problem understanding to model deployment, through data preparation, feature selection, model training and validation, and model assessment. A series of demonstrations and exercises is used to reinforce the concepts and the analytical approach to solving business problems.

Jul 6th 2026
5-12 Weeks
Data Science Bootcamp (openHPI) OpenHPI
Hasso-Plattner-Institut

Data Science Bootcamp (openHPI)

The ultimate goal of the bootcamp is to cultivate strong data science skills with an emphasis on machine learning techniques to satisfactorily meet and exceed the requests of the Data science world. In the process, we will develop good habits for operating independently as data scientists and for operating as members of productive data science teams.

Jun 7th 2023
4 Weeks
Data Engineering und Data Science – Klarheit in den Schlagwort-Dschungel (openHPI) OpenHPI
Hasso-Plattner-Institut

Data Engineering und Data Science – Klarheit in den Schlagwort-Dschungel (openHPI)

Die Schlagwörter Künstliche Intelligenz, Data Science, Data Engineering, und Big Data dominieren seit einigen Jahren nicht nur die IT-Schlagzeilen. In unserem Kurs wollen wir diese Wörter mit grundlegendem Inhalt füllen und die typischen Arbeitsschritte eines Data Scientists nachvollziehen. Insbesondere schauen wir hinter die Kulissen und betrachten den oft mühsamen Weg der Daten bis sie endlich genutzt werden können um z.B. mittels maschinellem Lernen Modelle trainieren zu können. Dazu gehören die Datenbeschaffung, die Datenreinigung, und die Datenintegration. Anschließend lernen wir, wie man aus diesen Daten und auch aus Texten neue Erkenntnisse mittels Data Mining und maschinellem Lernen gewinnt. Der Abschluss bildet eine Diskussion über Ethik und Fairness bei der automatisierten Datenanalyse.

Self Paced
Self-Paced
Understanding China, 1700-2000: A Data Analytic Approach, Part 2 (Coursera) Coursera
The Hong Kong University of Science and Technology - HKUST

Understanding China, 1700-2000: A Data Analytic Approach, Part 2 (Coursera)

The purpose of this course is to summarize new directions in Chinese history and social science produced by the creation and analysis of big historical datasets based on newly opened Chinese archival holdings, and to organize this knowledge in a framework that encourages learning about China in comparative perspective. Our course demonstrates how a new scholarship of discovery is redefining what is singular about modern China and modern Chinese history.

Jul 6th 2026
4 Weeks
AI Workflow: Feature Engineering and Bias Detection (Coursera) Coursera
IBM

AI Workflow: Feature Engineering and Bias Detection (Coursera)

This is the third course in the IBM AI Enterprise Workflow Certification specialization. You are STRONGLY encouraged to complete these courses in order as they are not individual independent courses, but part of a workflow where each course builds on the previous ones. Course 3 introduces you to the next stage of the workflow for our hypothetical media company. In this stage of work you will learn best practices for feature engineering, handling class imbalances and detecting bias in the data.

Jul 6th 2026
2 Weeks
Identifying Patient Populations (Coursera) Coursera
University of Colorado System

Identifying Patient Populations (Coursera)

This course teaches you the fundamentals of computational phenotyping, a biomedical informatics method for identifying patient populations. In this course you will learn how different clinical data types perform when trying to identify patients with a particular disease or trait. You will also learn how to program different data manipulations and combinations to increase the complexity and improve the performance of your algorithms.

Jul 6th 2026
5-12 Weeks
Reproducible Research (Coursera) Coursera
Johns Hopkins University

Reproducible Research (Coursera)

This course focuses on the concepts and tools behind reporting modern data analyses in a reproducible manner. Reproducible research is the idea that data analyses, and more generally, scientific claims, are published with their data and software code so that others may verify the findings and build upon them. The need for reproducibility is increasing dramatically as data analyses become more complex, involving larger datasets and more sophisticated computations.

Jul 6th 2026
4 Weeks
Introduction to Machine Learning (Coursera) Coursera
Duke University

Introduction to Machine Learning (Coursera)

This course will provide you a foundational understanding of machine learning models (logistic regression, multilayer perceptrons, convolutional neural networks, natural language processing, etc.) as well as demonstrate how these models can solve complex problems in a variety of industries, from medical diagnostics to image recognition to text prediction.

Jul 10th 2026
5-12 Weeks