Modernizing Data Lakes and Data Warehouses with Google Cloud (edX)

Modernizing Data Lakes and Data Warehouses with Google Cloud (edX)
Course Auditing
Categories
Effort
Certification
Languages
Misc

MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Modernizing Data Lakes and Data Warehouses with Google Cloud (edX)
This course is intended for developers who are responsible for: Querying datasets, visualizing query results, and creating reports. Specific job roles include: Data Engineer, Data Analyst, Database Administrators, Big Data Architects.

MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

The two key components of any data pipeline are data lakes and warehouses. This course highlights use-cases for each type of storage and dives into the available data lake and warehouse solutions on Google Cloud in technical detail. Also, this course describes the role of a data engineer, the benefits of a successful data pipeline to business operations, and examines why data engineering should be done in a cloud environment.

This is the first course of the Data Engineering on Google Cloud series. After completing this course, enroll in the Building Batch Data Pipelines on Google Cloud course.

This course is part of the Google Cloud Data Engineer Learning Path Professional Certificate.


What you'll learn

- Differentiate between data lakes and data warehouses.

- Explore use-cases for each type of storage and the available data lake and warehouse solutions on Google Cloud.

- Discuss the role of a data engineer and the benefits of a successful data pipeline to business operations.

- Examine why data engineering should be done in a cloud environment.


Prerequisites:

To benefit from this course, participants should have completed “Google Cloud Big Data and Machine Learning Fundamentals” or have equivalent experience. Participant should also have: • Basic proficiency with a common query language such as SQL. • Experience with data modeling and ETL (extract, transform, load) activities. • Experience with developing applications using a common programming language such as Python. • Familiarity with machine learning and/or statistics


Syllabus


1. Introduction

This module introduces the Data Engineering on Google Cloud source series and this Modernizing Data Lakes and Data Warehouses with Google Cloud course.

2. Introduction to Data Engineering

This module discusses the role of data engineering and motivates the claim why data engineering should be done in the Cloud.

3. Building a Data Lake

In this module, we describe what data lake is and how to use Cloud Storage as your data lake on Google Cloud.

4. Building a Data Warehouse

In this module, we talk about BigQuery as a data warehousing option on Google Cloud.

5. Summary

A summary of the key learning points.

5. Course Resources

Links to PDF versions of each module.



MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Course Auditing
46.00 EUR

MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.