Apache Spark

Sort options

Machine Learning with Apache Spark (Coursera)

May 6th 2024
Machine Learning with Apache Spark (Coursera)
Course Auditing
Categories
Effort
Languages
Explore the exciting world of machine learning with this IBM course. Start by learning ML fundamentals before unlocking the power of Apache Spark to build and deploy ML models for data engineering applications. Dive into supervised and unsupervised learning techniques and discover the revolutionary possibilities of Generative AI through [...]
May 6th 2024
Course Auditing
45.00 EUR

Microsoft Azure Databricks for Data Engineering (Coursera)

In this course, you will learn how to harness the power of Apache Spark and powerful clusters running on the Azure Databricks platform to run large data engineering workloads in the cloud. You will discover the capabilities of Azure Databricks and the Apache Spark notebook for processing huge files. [...]

Perform data science with Azure Databricks (Coursera)

May 6th 2024
Perform data science with Azure Databricks (Coursera)
Course Auditing
Categories
Effort
Languages
In this course, you will learn how to harness the power of Apache Spark and powerful clusters running on the Azure Databricks platform to run data science workloads in the cloud. This is the fourth course in a five-course program that prepares you to take the DP-100: Designing and [...]

Data Engineering with MS Azure Synapse Apache Spark Pools (Coursera)

In this course, you will learn how to perform data engineering with Azure Synapse Apache Spark Pools, which enable you to boost the performance of big-data analytic applications by in-memory cluster computing. You will learn how to differentiate between Apache Spark, Azure Databricks, HDInsight, and SQL Pools and understand [...]

AI Workflow: Enterprise Model Deployment (Coursera)

May 6th 2024
AI Workflow: Enterprise Model Deployment (Coursera)
Course Auditing
Categories
Effort
Languages
This is the fifth course in the IBM AI Enterprise Workflow Certification specialization. You are STRONGLY encouraged to complete these courses in order as they are not individual independent courses, but part of a workflow where each course builds on the previous ones. Best practices for using Spark will [...]
May 6th 2024
Course Auditing
68.00 EUR/month

Distributed Computing with Spark SQL (Coursera)

This course is for students with SQL experience and now want to take the next step in gaining familiarity with distributed computing using Spark. Students will gain an understanding of when to use Spark and how Spark as an engine uniquely combines Data and AI technologies at scale. The [...]

Fundamentals of Scalable Data Science (Coursera)

Apache Spark is the de-facto standard for large scale data processing. This is the first course of a series of courses towards the IBM Advanced Data Science Specialization. We strongly believe that is is crucial for success to start learning a scalable data science platform since memory and CPU [...]

Big Data Integration and Processing (Coursera)

At the end of the course, you will be able to: Retrieve data from example database and big data management systems; Describe the connections between data management operations and the big data processing patterns needed to utilize them in large-scale analytical applications; Identify when a big data problem needs [...]

Hadoop Platform and Application Framework (Coursera)

This course is for novice programmers or business people who'd like to understand the core tools used to wrangle and analyze big data. With no prior experience, you'll have the opportunity to walk through hands-on examples with Hadoop and Spark frameworks, two of the most common in the industry. [...]