What you will learn:
- How to use Spark and its libraries to solve big data problems
- How to approach large scale data science and engineering problems
- Spark's APIs, architecture, and many internal details
- The trade-offs between communication and computation in a distributed environment
- Use cases for Spark
Learn the underlying principles required to develop scalable machine learning pipelines and gain hands-on experience using Apache Spark. Machine learning aims to extract knowledge from data, relying on fundamental concepts in computer science, statistics, probability and optimization.