IBM Data Engineering Professional Certificate

This Professional Certificate is for anyone who wants to develop job-ready skills, tools, and a portfolio for an entry-level data engineer position. Throughout the self-paced online courses, you will immerse yourself in the role of a data engineer and acquire the essential skills you need to work with a range of tools and databases to design, deploy, and manage structured and unstructured data.
By the end of this Professional Certificate, you will be able to explain and perform the key tasks required in a data engineering role. You will use the Python programming language and Linux/UNIX shell scripts to extract, transform and load (ETL) data. You will work with Relational Databases (RDBMS) and query data using SQL statements. You will use NoSQL databases and unstructured data. You will be introduced to Big Data and work with Big Data engines like Hadoop and Spark. You will gain experience with creating Data Warehouses and utilize Business Intelligence tools to analyze and extract insights.
Each course includes numerous hands-on labs & projects to apply the concepts and skills you learn. The program will culminate in a Capstone Project where you will bring together all of these skills to develop and implement an entire data platform with various data repositories and pipelines to address a real-world inspired data analytics problem.

WHAT YOU WILL LEARN
- RDBMS fundamentals including Design & Creation of Databases, Schemas, Tables; DB Administration, Security & working with MySQL, PostgreSQL & IBM Db2.
- SQL query language, SELECT, INSERT, UPDATE, DELETE statements, database functions, stored procs, working with multiple tables, JOINs, & transactions.
- NoSQL & Big Data concepts including practice with MongoDB, Cassandra, IBM Cloudant, Apache Hadoop, Apache Spark, SparkSQL, SparkML, Spark Streaming.
- ETL, Data Pipelines using Python, Shell Scripts, Apache Airflow and Apache Kafka; Building & Populating Data Warehouses, and Querying with BI tools.

Sort options

Getting Started with Data Warehousing and BI Analytics (Coursera)

Data is one of an organization’s most valuable commodities. But how can organizations best use their data? And how does the organization determine which data is the most recent, accurate, and useful for business decision making at the highest level? After taking this course, you will be able to [...]

Introduction to Big Data with Spark and Hadoop (Coursera)

Apr 29th 2024
Introduction to Big Data with Spark and Hadoop (Coursera)
Course Auditing
Categories
Effort
Languages
Bernard Marr defines Big Data as the digital trace that we are generating in this digital era. In this course, you will learn about the characteristics of Big Data and its application in Big Data Analytics. You will gain an understanding about the features, benefits, limitations, and applications of [...]

Hands-on Introduction to Linux Commands and Shell Scripting (Coursera)

Apr 29th 2024
Hands-on Introduction to Linux Commands and Shell Scripting (Coursera)
Course Auditing
Categories
Effort
Languages
This mini-course provides a practical introduction to commonly used Linux / UNIX shell commands and teaches you basics of Bash shell scripting to automate a variety of tasks. The course includes both video-based lectures as well as hands-on labs to practice and apply what you learn. You will have [...]

Python Project for Data Engineering (Coursera)

Apr 29th 2024
Python Project for Data Engineering (Coursera)
Course Auditing
Categories
Effort
Languages
This mini-course is intended to apply foundational Python skills by implementing different techniques to collect and work with data. Assume the role of a Data Engineer and extract data from multiple file formats, transform it into specific datatypes, and then load it into a single source for analysis. Continue [...]

Introduction to Data Engineering (Coursera)

Apr 29th 2024
Introduction to Data Engineering (Coursera)
Course Auditing
Categories
Effort
Languages
This course introduces you to the core concepts, processes, and tools you need to know in order to get a foundational knowledge of data engineering. You will gain an understanding of the modern data ecosystem and the role Data Engineers, Data Scientists, and Data Analysts play in this ecosystem. [...]
Apr 29th 2024
Course Auditing
42.00 EUR/month

Data Engineering Capstone Project (Coursera)

Apr 22nd 2024
Data Engineering Capstone Project (Coursera)
Course Auditing
Categories
Effort
Languages
In this course you will apply a variety of data engineering skills and techniques you have learned as part of the previous courses in the IBM Data Engineering Professional Certificate. You will assume the role of a Junior Data Engineer who has recently joined the organization and be presented [...]

Relational Database Administration (DBA) (Coursera)

Ongoing and proactive management is critical to the security and performance of database management systems. Database administration is the function of managing the operational aspects of database systems and maintaining them. Database administrators work to ensure that applications make the most efficient use of databases and that physical resources [...]

ETL and Data Pipelines with Shell, Airflow and Kafka (Coursera)

Apr 22nd 2024
ETL and Data Pipelines with Shell, Airflow and Kafka (Coursera)
Course Auditing
Categories
Effort
Languages
After taking this course, you will be able to describe two different approaches to converting raw data into analytics-ready data. One approach is the Extract, Transform, Load (ETL) process. The other contrasting approach is the Extract, Load, and Transform (ELT) process. ETL processes apply to data warehouses and data [...]

Introduction to NoSQL Databases (Coursera)

Apr 22nd 2024
Introduction to NoSQL Databases (Coursera)
Course Auditing
Categories
Effort
Languages
This course will provide you with technical hands-on knowledge of NoSQL databases and Database-as-a-Service (DaaS) offerings. With the advent of Big Data and agile development methodologies, NoSQL databases have gained a lot of relevance in the database landscape. Their main advantage is the ability to effectively handle scalability and [...]