NoSQL Big Data and Spark Fundamentals Professional Certificate

What you will learn

The four categories of NoSQL databases and Database-as-a-Service (DaaS) offerings; and how to work with MongoDB, Cassandra and IBM Cloudant NoSQL databases.
The characteristics, features, benefits, limitations, and applications of the more popular Big Data processing tools, including Hadoop, HDFS, Hive and HBase.
Discover how data and machine learning engineers use Spark Structured Streaming, GraphFrames, Spark ML,Regression, Classification, and clustering, including the k-means algorithm and ETL using Spark.

Data engineers and Big Data professionals are in overwhelming demand. NoSQL and Big Data technology skills such as Apache Spark are a must-have for modern day Data engineers to enable data-driven decision-making. This three-course Professional Certificate from IBM opens the door for data engineering and big data careers.
Starting with NoSQL Database Basics, this course introduces you to NoSQL fundamentals, including the four key non-relational database categories. By the end of the course, you will have hands-on skills working with MongoDB, Cassandra, and IBM Cloudant NoSQL databases.
A crucial aspect of data engineering is Big Data and Big Data Analytics. When you enroll in Big Data, Hadoop, and Spark Basics, you'll discover the characteristics, features, benefits, limitations, and applications of some of the more popular Big Data processing tools. You explore the open-source ecosystem of Apache tools, including Apache Hadoop, Apache Hive, and Apache Spark. Discover how to leverage Spark to deliver reliable insights. You'll gain hands-on skills analyzing data using PySpark and Spark SQL, creating a streaming analytics application using Spark Streaming, and more.
Then enroll in Apache Spark for Data Engineering and Machine Learning to discover how data and machine learning engineers use Spark Structured Streaming, GraphFrames, Regression, Classification, and clustering. Learn about clustering and how to apply the k-means clustering algorithm using Spark MLlib. ETL is at the heart of data and machine learning engineering, and you'll gain skills using Spark to perform extract, transform and load (ETL) tasks. This course will culminate with a hands-on Spark project.
This Professional Certificate does not require any prior programming or data science skills, however prior basic data literacy and SQL skills will prove valuable in completing this program.

Big Data, Hadoop, and Spark Basics (edX)

EdX

IBM

Big Data, Hadoop, and Spark Basics (edX)

Computer Science

Discover the basics of Big Data, Hadoop, and Spark with our beginner-friendly course. Learn how to manage and analyze massive datasets using industry-standard tools and develop essential analytical skills that are in high demand across various sectors.

Self Paced

Self-Paced

Big Data Hadoop Apache Spark

Apache Spark for Data Engineering and Machine Learning (edX)

EdX

IBM

Apache Spark for Data Engineering and Machine Learning (edX)

Computer Science

Dive into the world of big data processing and machine learning with our introductory course on Apache Spark. Whether you're new to data engineering or looking to enhance your ML skills, this course provides a hands-on approach to understanding and applying Spark's powerful capabilities in Structured Streaming, Extract-Transform-Load (ETL) processes for Machine Learning pipelines, and Spark ML techniques.

Self Paced

Self-Paced

ML Machine Learning Apache Spark

EdX

IBM

NoSQL Database Basics (edX)

Computer Science

Discover the world of NoSQL databases with our beginner-friendly course. Dive into the basics of non-relational data management and learn to work effectively with popular NoSQL systems like MongoDB, Cassandra, and IBM Cloudant. Whether you're new to database technology or looking to expand your skill set, this course will equip you with essential knowledge and practical skills.

Self Paced

Self-Paced

MongoDB NoSQL NoSQL Databases

Page 1