Coursera

Data Wrangling with Python Project (Coursera)

Offered by University of Colorado Boulder,

The "Data Wrangling Project" course provides students with an opportunity to apply the knowledge gained throughout the specialization in a real-life data wrangling project of their interest. Participants will follow the data wrangling pipeline step by step, from identifying data sources to processing and integrating data, to achieve a fine dataset ready for analysis. This course enables students to gain hands-on experience in the data wrangling process and prepares them to handle complex data challenges in real-world scenarios.

Class Deals by MOOC List - Click here and see Coursera's Active Discounts, Deals, and Promo Codes.

Throughout the course, students will work on their data wrangling project, applying the knowledge and skills gained in each module to achieve a refined and well-prepared dataset. By the end of the course, participants will be proficient in the data wrangling process and ready to tackle real-world data challenges in diverse domains.
This course is part of the Data Wrangling with Python Specialization.

What you'll learn

Initiate and conduct a data wrangling project from raw data to a refined dataset for analysis.
Apply data wrangling techniques learned in the specialization to handle real-life data scenarios.
Utilize Python libraries and tools effectively for data wrangling tasks. Communicate and present data wrangling results effectively to stakeholders.

Syllabus

Data Wrangling Pipeline
Module 1
In this introductory week, you will gain an understanding of the data wrangling pipeline, which serves as a structured approach to transform raw data into a cleaned and organized dataset for analysis. You will learn the key stages involved in the pipeline, setting the foundation for the rest of the course.

Identify Your Data
Module 2
In this week, you will learn how to identify and define the scope and objectives of your data wrangling project. You will explore various data sources, understand their structure, and assess the suitability of each source for the project.

Data Collection and Integration
Module 3
This week covers the data collection and integration stage of the data wrangling process. You will learn techniques for data collection, validate the collected data, and integrate data from multiple sources.

Data Understanding and Visualization
Module 4
This week focuses on gaining a comprehensive understanding of the dataset through statistical analysis and data visualization. You will learn how to perform descriptive statistics, create informative visualizations, and conduct exploratory data analysis (EDA).

Data Processing and Manipulation
Module 5
In this week, you will delve into essential data processing and manipulation techniques. You will learn how to handle missing values, detect and handle outliers, perform data sampling and dimensionality reduction, apply data scaling and discretization, and explore data cubes and pivot tables.

Go to Class

MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Coursera

PwC

Data Visualization with Advanced Excel (Coursera)

Statistics & Data Analysis Data Science

In this course, you will get hands-on instruction of advanced Excel 2013 functions. You’ll learn to use PowerPivot to build databases and data models. We’ll show you how to perform different types of scenario and simulation analysis and you’ll have an opportunity to practice these skills by leveraging some of Excel's built in tools including, solver, data tables, scenario manager and goal seek.

Jul 6th 2026

4 Weeks

Databases Excel Data Analysis

Coursera

Google

Using Python to Interact with the Operating System (Coursera)

CS: Information & Technology Computer Science

By the end of this course, you’ll be able to manipulate files and processes on your computer’s operating system. You’ll also have learned about regular expressions -- a very powerful tool for processing text files -- and you’ll get practice using the Linux command line on a virtual machine. And, this might feel like a stretch right now, but you’ll also write a program that processes a bunch of errors in an actual log file and then generates a summary file. That’s a super useful skill for IT Specialists to know.

Jul 7th 2026

5-12 Weeks

Programming Python Regular Expressions

Coursera

University of California, San Diego

Code Free Data Science (Coursera)

Data Science

The Code Free Data Science class is designed for learners seeking to gain or expand their knowledge in the area of Data Science. Participants will receive the basic training in effective predictive analytic approaches accompanying the growing discipline of Data Science without any programming requirements. Machine Learning methods will be presented by utilizing the KNIME Analytics Platform to discover patterns and relationships in data.

Jul 6th 2026

4 Weeks

Machine Learning Big Data Data Science

Coursera

University of Michigan

Applied Social Network Analysis in Python (Coursera)

Statistics & Data Analysis Data Science

This course will introduce the learner to network analysis through the NetworkX library. The course begins with an understanding of what network analysis is and motivations for why we might model phenomena as networks. The second week introduces the concept of connectivity and network robustness.. The third week will explore ways of measuring the importance or centrality of a node in a network. The final week will explore the evolution of networks over time and cover models of network generation and the link prediction problem.

Jul 6th 2026

4 Weeks

Python Networks Social Networks

Coursera

Google

Crash Course on Python (Coursera)

CS: Information & Technology Computer Science

This course is designed to teach you the foundations in order to write simple programs in Python using the most common structures. No previous exposure to programming is needed. By the end of this course, you'll understand the benefits of programming in IT roles; be able to write simple programs using Python; figure out how the building blocks of programming fit together; and combine all of this knowledge to solve a complex programming problem.

Jul 7th 2026

5-12 Weeks

Python Data Structures Object-Oriented Programming

Coursera

University of Colorado System

Data Warehouse Concepts, Design, and Data Integration (Coursera)

CS: Design & Product

This is the second course in the Data Warehousing for Business Intelligence specialization. Ideally, the courses should be taken in sequence. In this course, you will learn exciting concepts and skills for designing data warehouses and creating data integration workflows. These are fundamental skills for data warehouse developers and administrators. You will have hands-on experience for data warehouse design and use open source products for manipulating pivot tables and creating data integration workflows.

Jul 6th 2026

5-12 Weeks

Business Intelligence Data Warehousing Data Warehouse

Coursera

University of Amsterdam

Data Analytics for Lean Six Sigma (Coursera)

Statistics & Data Analysis Data Science

Welcome to this course on Data Analytics for Lean Six Sigma. In this course you will learn data analytics techniques that are typically useful within Lean Six Sigma improvement projects. At the end of this course you are able to analyse and interpret data gathered within such a project. You will be able to use Minitab to analyse the data. I will also briefly explain what Lean Six Sigma is.

Jul 6th 2026

5-12 Weeks

Testing Lean Six Sigma Data Visualization

Coursera

Nanjing University

Data Processing Using Python (Coursera)

CS: Software Engineering Statistics & Data Analysis

This course is mainly for non-computer majors. It starts with the basic syntax of Python, to how to acquire data in Python locally and from network, to how to present data, then to how to conduct basic and advanced statistic analysis and visualization of data, and finally to how to design a simple GUI to present and process data, advancing level by level.

Jul 6th 2026

5-12 Weeks

Python Data Structures Data Analysis

Coursera

University of Illinois at Urbana-Champaign

Pattern Discovery in Data Mining (Coursera)

Statistics & Data Analysis Data Science

Learn the general concepts of data mining along with basic methodologies and applications. Then dive into one subfield in data mining: pattern discovery. Learn in-depth concepts, methods, and applications of pattern discovery in data mining. We will also introduce methods for data-driven phrase mining and some interesting applications of pattern discovery. This course provides you the opportunity to learn skills and content to practice and engage in scalable pattern discovery methods on massive transactional data, discuss pattern evaluation measures, and study methods for mining diverse kinds of patterns, sequential patterns, and sub-graph patterns.

Jul 6th 2026

4 Weeks

Algorithms Data Mining Data Analysis

Coursera

University of Washington

Machine Learning: Classification (Coursera)

Statistics & Data Analysis Data Science

Case Studies: Analyzing Sentiment & Loan Default Prediction. In our case study on analyzing sentiment, you will create models that predict a class (positive/negative sentiment) from input features (text of the reviews, user profile information,...). In our second case study for this course, loan default prediction, you will tackle financial data, and predict when a loan is likely to be risky or safe for the bank.

Jul 6th 2026

5-12 Weeks

Python Machine Learning Classification

Coursera

Google Cloud

Getting Started with Google Sheets (Coursera)

Business

Google Sheets is a robust, cloud-based application that empowers you to create sophisticated spreadsheets. Whether you are working at your desk—or from your smartphone or tablet on-the-go—Google Sheets helps you organize, analyze, and share your most important data. In this course for Sheets users, you’ll learn how to make your own supercharged spreadsheets.

Jul 6th 2026

5-12 Weeks

Spreadsheets Data Analysis Data Visualization

Coursera

University of Michigan

Programming for Everybody (Getting Started with Python) (Coursera)

CS: Programming

This course aims to teach everyone the basics of programming computers using Python. We cover the basics of how one constructs a program from a series of simple instructions in Python. The course has no pre-requisites and avoids all but the simplest mathematics. Anyone with moderate computer experience should be able to master the materials in this course.

Jul 6th 2026

5-12 Weeks

Programming Python Functions