AWS Data Processing (Coursera)

Offered by Whizlabs,
AWS Data Processing (Coursera)

AWS: Data Processing Course is the second course of AWS Certified Data Analytics Specialty Specialization. This course focuses on providing data processing solutions. The entire course is designed to teach learners the concept of EMR and Extract, Transform and Load. This course also put emphasis on ETL services and Data Processing solutions in AWS.

Class Deals by MOOC List - Click here and see Coursera's Active Discounts, Deals, and Promo Codes.

The course is divided into three modules and each module is further segmented by Lessons and Video Lectures. This course facilitates learners with approximately 3:30-4:00 Hours Video lectures that provide both Theory and Hands -On knowledge. Also, Graded and Ungraded Quiz are provided with every module in order to test the ability of learners.
Module 1: Introduction : Extract, Transform and Load Jobs
Module 2: Introduction: EMR
Module 3: ETL Services and Data Processing Solution in AWS
It is recommended that folks should have experience of working with AWS services for designing, building, securing, and maintaining analytics solutions for understanding this course. By the end of this course, learners will be able to :
-Analyze Modeling Concepts and train Machine Learning Models
-Examine performance of machine learning models
-Implement automatic model tuning by training a model
Course 3 of 5 in the Exam Prep DAS-C01: AWS Certified Data Analytics Specialty Specialization.

What You Will Learn

  • Analyze Modeling Concepts and train Machine Learning Models
  • Examine performance of machine learning models
  • Implement automatic model tuning by training a model

Syllabus

WEEK 1
Introduction : Extract, Transform and Load Jobs
Welcome to Week 1 of the AWS: Data Processing .This week, we will focus on determining the appropriate data processing solution requirements and gain an understanding of Extract, Transform, Load (ETL) jobs. We will also gain hands-on experience with implementing ETL jobs to move data between different sources and destinations. By the end of the week, we should have a good understanding of how to effectively process data using ETL jobs and the requirements needed to do so.

WEEK 2
Introduction: EMR
Welcome to Week 2 of the AWS: Data Processing . This week, we will be introduced to Amazon EMR and its various applications such as Spark, Hudi, Hbase, TensorFlow, Flink, Presto, and Hue. We will also learn how to design a solution for transforming and preparing data for analysis using EMR. Through practical demonstrations, we will gain a solid understanding of how to effectively use EMR to process, transform, and analyze large datasets. By the end of the week, we should have a good understanding of how to leverage EMR for data processing and analytics needs.

WEEK 3
ETL Services and Data Processing Solution in AWS
Welcome to Week 3 of the AWS: Data Processing. This week, we will focus on automating and operationalizing a data processing solution using AWS Glue and EMR. We will also compare batch and streaming ETL services to determine the most appropriate solution for our needs.By the end of the week, we should have a good understanding of how to use AWS services to automate and operationalize data processing workflows.

Go to Class
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Process Mining: Data science in Action (Coursera) Coursera
Eindhoven University of Technology

Process Mining: Data science in Action (Coursera)

Process mining is the missing link between model-based process analysis and data-oriented analysis techniques. Through concrete data sets and easy to use software the course provides data science knowledge that can be applied directly to analyze and improve processes in a variety of domains. Data science is the profession of the future, because organizations that are unable to use (big) data in a smart way will not survive. It is not sufficient to focus on data storage and data analysis. The data scientist also needs to relate data to process analysis.

Jun 1st 2026
5-12 Weeks
Managing Data Analysis (Coursera) Coursera
Johns Hopkins University

Managing Data Analysis (Coursera)

This one-week course describes the process of analyzing data and how to manage that process. We describe the iterative nature of data analysis and the role of stating a sharp question, exploratory data analysis, inference, formal statistical modeling, interpretation, and communication. In addition, we will describe how to direct analytic activities within a team and to drive the data analysis process towards coherent and useful results.

Jun 1st 2026
1 Week
A Crash Course in Data Science (Coursera) Coursera
Johns Hopkins University

A Crash Course in Data Science (Coursera)

By now you have definitely heard about data science and big data. In this one-week class, we will provide a crash course in what these terms mean and how they play a role in successful organizations. This class is for anyone who wants to learn what all the data science action is about, including those who will eventually need to manage data scientists. The goal is to get you up to speed as quickly as possible on data science without all the fluff. We've designed this course to be as convenient as possible without sacrificing any of the essentials.

Jun 1st 2026
1 Week
Process Data from Dirty to Clean (Coursera) Coursera
Google

Process Data from Dirty to Clean (Coursera)

This is the fourth course in the Google Data Analytics Certificate. These courses will equip you with the skills needed to apply to introductory-level data analyst jobs. In this course, you’ll continue to build your understanding of data analytics and the concepts and tools that data analysts use in their work. You’ll learn how to check and clean your data using spreadsheets and SQL as well as how to verify and report your data cleaning results. Current Google data analysts will continue to instruct and provide you with hands-on ways to accomplish common data analyst tasks with the best tools and resources.

Jun 2nd 2026
5-12 Weeks
Introduction to Big Data (Coursera) Coursera
University of California, San Diego

Introduction to Big Data (Coursera)

Interested in increasing your knowledge of the Big Data landscape? This course is for those new to data science and interested in understanding why the Big Data Era has come to be. It is for those who want to become conversant with the terminology and the core concepts behind big data problems, applications, and systems. It is for those who want to start thinking about how Big Data might be useful in their business or career. It provides an introduction to one of the most common frameworks, Hadoop, that has made big data analysis easier and more accessible -- increasing the potential for data to transform our world!

Jun 1st 2026
3 Weeks
Business Metrics for Data-Driven Companies (Coursera) Coursera
Duke University

Business Metrics for Data-Driven Companies (Coursera)

In this course, you will learn best practices for how to use data analytics to make any company more competitive and more profitable. You will be able to recognize the most critical business metrics and distinguish them from mere data. You’ll get a clear picture of the vital but different roles business analysts, business data analysts, and data scientists each play in various types of companies. And you’ll know exactly what skills are required to be hired for, and succeed at, these high-demand jobs.

Jun 1st 2026
4 Weeks
Share Data Through the Art of Visualization (Coursera) Coursera
Google

Share Data Through the Art of Visualization (Coursera)

This is the sixth course in the Google Data Analytics Certificate. These courses will equip you with the skills needed to apply to introductory-level data analyst jobs. You’ll learn how to visualize and present your data findings as you complete the data analysis process. This course will show you how data visualizations, such as visual dashboards, can help bring your data to life. You’ll also explore Tableau, a data visualization platform that will help you create effective visualizations for your presentations.

Jun 2nd 2026
4 Weeks
Data Engineering with Rust (Coursera) Coursera
Duke University

Data Engineering with Rust (Coursera)

Are you a data engineer, software developer, or a tech enthusiast with a basic understanding of Rust, seeking to enhance your skills and dive deep into the realm of data engineering with Rust? Or are you a professional from another programming language background, aiming to explore the efficiency, safety, and concurrency features of Rust for data engineering tasks? If so, this course is designed for you.

Jun 4th 2026
4 Weeks
Machine Learning for Data Analysis (Coursera) Coursera
Wesleyan University

Machine Learning for Data Analysis (Coursera)

Are you interested in predicting future outcomes using your data? This course helps you do just that! Machine learning is the process of developing, testing, and applying predictive algorithms to achieve this goal. Make sure to familiarize yourself with course 3 of this specialization before diving into these machine learning concepts. Building on Course 3, which introduces students to integral supervised machine learning concepts, this course will provide an overview of many additional concepts, techniques, and algorithms in machine learning, from basic classification to decision trees and clustering.

Jun 1st 2026
4 Weeks
Principles of fMRI 2 (Coursera) Coursera
Johns Hopkins University,University of Colorado Boulder

Principles of fMRI 2 (Coursera)

Functional Magnetic Resonance Imaging (fMRI) is the most widely used technique for investigating the living, functioning human brain as people perform tasks and experience mental states. It is a convergence point for multidisciplinary work from many disciplines. Psychologists, statisticians, physicists, computer scientists, neuroscientists, medical researchers, behavioral scientists, engineers, public health researchers, biologists, and others are coming together to advance our understanding of the human mind and brain. This course covers the analysis of Functional Magnetic Resonance Imaging (fMRI) data.

Jun 1st 2026
4 Weeks
Dealing With Missing Data (Coursera) Coursera
University of Maryland, College Park

Dealing With Missing Data (Coursera)

This course will cover the steps used in weighting sample surveys, including methods for adjusting for nonresponse and using data external to the survey for calibration. Among the techniques discussed are adjustments using estimated response propensities, poststratification, raking, and general regression estimation. Alternative techniques for imputing values for missing items will be discussed. For both weighting and imputation, the capabilities of different statistical software packages will be covered, including R®, Stata®, and SAS®.

Jun 1st 2026
4 Weeks
Bioinformatic Methods I (Coursera) Coursera
University of Toronto

Bioinformatic Methods I (Coursera)

Large-scale biology projects such as the sequencing of the human genome and gene expression surveys using RNA-seq, microarrays and other technologies have created a wealth of data for biologists. However, the challenge facing scientists is analyzing and even accessing these data to extract useful information pertaining to the system being studied. This course focuses on employing existing bioinformatic resources – mainly web-based programs and databases – to access the wealth of data to answer questions relevant to the average biologist, and is highly hands-on.

Jun 1st 2026
5-12 Weeks