NoSQL Big Data and Spark Foundations Specialization

Big Data Engineers and professionals with NoSQL skills are highly sought after in the data management industry. This Specialization is designed for those seeking to develop fundamental skills for working with Big Data, Apache Spark, and NoSQL databases. Three information-packed courses cover popular NoSQL databases like MongoDB and Apache Cassandra, the widely used Apache Hadoop ecosystem of Big Data tools, as well as Apache Spark analytics engine for large-scale data processing.
You start with an overview of various categories of NoSQL (Not only SQL) data repositories, and then work hands-on with several of them including IBM Cloudant, MonogoDB and Cassandra. You’ll perform various data management tasks, such as creating & replicating databases, inserting, updating, deleting, querying, indexing, aggregating & sharding data. Next, you’ll gain fundamental knowledge of Big Data technologies such as Hadoop, MapReduce, HDFS, Hive, and HBase, followed by a more in depth working knowledge of Apache Spark, Spark Dataframes, Spark SQL, PySpark, the Spark Application UI, and scaling Spark with Kubernetes. In the final course, you will learn to work with Spark Structured Streaming Spark ML - for performing Extract, Transform and Load processing (ETL) and machine learning tasks.
This specialization is suitable for beginners in the fields of NoSQL and Big Data – whether you are or preparing to be a Data Engineer, Software Developer, IT Architect, Data Scientist, or IT Manager.
WHAT YOU WILL LEARN
- Work with NoSQL databases to insert, update, delete, query, index, aggregate, and shard/partition data.
- Develop hands-on NoSQL experience working with MongoDB, Apache Cassandra, and IBM Cloudant.
- Develop foundational knowledge of Big Data and gain hands-on lab experience using Apache Hadoop, MapReduce, Apache Spark, Spark SQL, and Kubernetes
- Perform Extract, Transform and Load (ETL) processing and Machine Learning model training and deployment with Apache Spark.

Sort options

Introduction to NoSQL Databases (Coursera)

Apr 22nd 2024
Introduction to NoSQL Databases (Coursera)
Course Auditing
Categories
Effort
Languages
This course will provide you with technical hands-on knowledge of NoSQL databases and Database-as-a-Service (DaaS) offerings. With the advent of Big Data and agile development methodologies, NoSQL databases have gained a lot of relevance in the database landscape. Their main advantage is the ability to effectively handle scalability and [...]

Introduction to Big Data with Spark and Hadoop (Coursera)

Apr 15th 2024
Introduction to Big Data with Spark and Hadoop (Coursera)
Course Auditing
Categories
Effort
Languages
Bernard Marr defines Big Data as the digital trace that we are generating in this digital era. In this course, you will learn about the characteristics of Big Data and its application in Big Data Analytics. You will gain an understanding about the features, benefits, limitations, and applications of [...]

Data Engineering and Machine Learning using Spark (Coursera)

Jan 29th 2024
Data Engineering and Machine Learning using Spark (Coursera)
Course Auditing
Categories
Effort
Languages
Organizations need skilled, forward-thinking Big Data practitioners who can apply their business and technical skills to unstructured data such as tweets, posts, pictures, audio files, videos, sensor data, and satellite imagery and more to identify behaviors and preferences of prospects, clients, competitors, and others. In this short course [...]