Data Pipelines

Sort options

Data Engineering Capstone Project (Coursera)

Jul 4th 2022
Data Engineering Capstone Project (Coursera)
Course Auditing
Categories
Effort
Languages
In this course you will apply a variety of data engineering skills and techniques you have learned as part of the previous courses in the IBM Data Engineering Professional Certificate. You will assume the role of a Junior Data Engineer who has recently joined the organization and be presented [...]

ETL and Data Pipelines with Shell, Airflow and Kafka (Coursera)

Jul 4th 2022
ETL and Data Pipelines with Shell, Airflow and Kafka (Coursera)
Course Auditing
Categories
Effort
Languages
After taking this course, you will be able to describe two different approaches to converting raw data into analytics-ready data. One approach is the Extract, Transform, Load (ETL) process. The other contrasting approach is the Extract, Load, and Transform (ELT) process. ETL processes apply to data warehouses and data [...]

Distributed Computing with Spark SQL (Coursera)

This course is for students with SQL experience and now want to take the next step in gaining familiarity with distributed computing using Spark. Students will gain an understanding of when to use Spark and how Spark as an engine uniquely combines Data and AI technologies at scale. The [...]

Big Data Science with the BD2K-LINCS Data Coordination and Integration Center (Coursera)

In this course we briefly introduce the DCIC and the various Centers that collect data for LINCS. We then cover metadata and how metadata is linked to ontologies. We then present data processing and normalization methods to clean and harmonize LINCS data. This follow discussions about how data is [...]

Data Integration with Microsoft Azure Data Factory (Coursera)

In this course, you will learn how to create and manage data pipelines in the cloud using Azure Data Factory. This course is part of a Specialization intended for Data engineers and developers who want to demonstrate their expertise in designing and implementing data solutions that use Microsoft Azure [...]

Healthcare Data Models (Coursera)

Career prospects are bright for those qualified to work in healthcare data analytics. Perhaps you work in data analytics, but are considering a move into healthcare where your work can improve people’s quality of life. If so, this course gives you a glimpse into why this work matters, what [...]

Customising your models with TensorFlow 2 (Coursera)

Welcome to this course on Customising your models with TensorFlow 2! In this course you will deepen your knowledge and skills with TensorFlow, in order to develop fully customised deep learning models and workflows for any application. You will use lower level APIs in TensorFlow to develop complex model [...]

Data Engineer (Dataquest)

Self Paced
Data Engineer (Dataquest)
Free Course
Categories
Effort
Languages
Get all the skills and knowledge you need to become a data engineer. You’ll learn how to work with data architecture, data processing, and data systems. By the end, you’ll be able to build a unique data infrastructure, manage data pipelines and data processing, and maintain data systems.

Machine Learning Operations 2 (MLOps2-AML): Data Pipeline Automation & Optimization using Microsoft Azure Machine Learning (AML) (edX)

Most data science projects fail. There are various reasons why, but one of the primary reasons is the challenge of deployment. One piece to the deployment puzzle is understanding how to automate your pipeline’s functions and continuously optimize its performance, which is why we developed this course, MLOps2: Data [...]