EdX

Big Data Computing with Spark (edX)

Big Data Computing with Spark (edX)

Learn the theory and gain hands-on experience of big data systems, using Spark as the exemplary platform. Big data systems such as Hadoop and Spark emerge as enabling technologies in managing massive amounts of data across hundreds or even thousands of computing nodes. Meanwhile, cloud computing platforms have made these technologies easily accessible to individuals as well as large enterprises.

Class Deals by MOOC List - Click here and see EdX's Active Discounts, Deals, and Promo Codes.

This course exposes students to both the theory and hands-on experience of big data systems, using Spark as the exemplary platform.

What you'll learn

  • Spark programming using both RDD and DataFrame APIs
  • Useful packages including ML, GraphX/GraphFrames, and SparkStreaming
  • Spark internals and performance optimizations
  • Algorithm design for big data systems

Syllabus

Week 1: Overview, MapReduce, and Hadoop
Week 2-3: Spark Basics and RDD
Week 4: SparkSQL and MLib
Week 5: Spark internals
Week 6: Algorithm design for big data
Week 7: GraphX/GraphFrames
Week 8: Spark Streaming

Go to Class
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Big Data, Hadoop, and Spark Basics (edX) EdX
IBM

Big Data, Hadoop, and Spark Basics (edX)

This course provides foundational big data practitioner knowledge and analytical skills using popular big data tools, including Hadoop and Spark. Learn and practice your big data skills hands-on. Organizations need skilled, forward-thinking Big Data practitioners who can apply their business and technical skills to unstructured data such as tweets, posts, pictures, audio files, videos, sensor data, and satellite imagery, and more, to identify behaviors and preferences of prospects, clients, competitors, and others. ****

Self Paced
Self-Paced
Big Data sin misterios (edX) EdX
Inter-American Development Bank - IDB,IDBx

Big Data sin misterios (edX)

Transforma los datos en valor. Aprende el potencial de Big Data para aumentar la eficiencia y eficacia de tu organización en este MOOC desarrollado con Telefónica. 51.5% de los estudiantes que participaron a la encuesta dicen que el curso les servirá para mejorar la capacidad para formular, implementar, y/o evaluar políticas públicas.

Self Paced
Self-Paced
Wiretaps to Big Data: Privacy and Surveillance in the Age of Interconnection (edX) EdX
Cornell University

Wiretaps to Big Data: Privacy and Surveillance in the Age of Interconnection (edX)

Explore the privacy issues of an interconnected world. How does cellular technology enable massive surveillance? Do users have rights against surveillance? How does surveillance affect how we use cellular and other technologies? How does it affect our democratic institutions? Do you know that the metadata collected by a cellular network speaks volumes about its users? In this course you will explore all of these questions while investigating related issues in WiFi and Internet surveillance.

No sessions available
5-12 Weeks
Big Data and Education (edX) EdX
University of Pennsylvania,PennX

Big Data and Education (edX)

Learn the methods and strategies for using large-scale educational data to improve education and make discoveries about learning. Online and software-based learning tools have been used increasingly in education. This movement has resulted in an explosion of data, which can now be used to improve educational effectiveness and support basic research on learning.

Self Paced
Self-Paced
Analytics in Python (edX) EdX
Columbia University,ColumbiaX

Analytics in Python (edX)

Learn the fundamental of programming in Python and develop the ability to analyze data and make data-driven decisions. Data is the lifeblood of an organization. Competency in programming is an essential skill for successfully extracting information and knowledge from data. The goal of this course is to introduce learners to the basics of programming in Python and to give a working knowledge of how to use programs to deal with data.

This course is archived
5-12 Weeks
Excel avanzado: importación y análisis de datos (edX) EdX
Universitat Politècnica de València,UPValenciaX

Excel avanzado: importación y análisis de datos (edX)

Conoce técnicas y estrategias avanzadas para importar, consolidar y visualizar con Excel datos provenientes de cualquier fuente. En este curso de análisis e interpretación de datos te presentaremos técnicas avanzadas de importación de datos y estrategias diversas para consolidarlos y prepararlos una vez importados de forma que puedas extraer las conclusiones que necesitas (basadas en nuestra experiencia en el uso de Microsoft Excel y demostradas con casos reales).

Self Paced
Self-Paced
UX Data Analysis (edX) EdX
HECMontrealX,HEC Montréal

UX Data Analysis (edX)

Become a UX data scientist! From qualitative data analysis to big data Web analytics, you will be able to leverage insights from data to make empirically-based recommendations. Do big data and UX speak to you? This MOOC will give you the methods and tools to analyze the whole spectrum of data we handle in UX, from qualitative user research and quantitative user testing data analysis to big data Web analytics.

Self Paced
Self-Paced
Introducción a los Sistemas de Información Gerencial (MIS): Una guía de supervivencia (edX) EdX
Universidad Carlos III de Madrid - UC3M,UC3Mx

Introducción a los Sistemas de Información Gerencial (MIS): Una guía de supervivencia (edX)

Obtén las habilidades y el conocimiento necesarios para tener éxito en un mundo corporativo dominado por los sistemas de información gerencial (SIG o MIS). Los omnipresentes Sistemas de Información Gerencial (SIG) o Management Information Systems (MIS) juegan un papel crítico en el actual panorama profesional. Desde los sistemas de gestión de las relaciones con los clientes, que gestionan las interacciones diarias con los clientes actuales y potenciales, hasta los sistemas gerenciales y financieros que emiten y pagan facturas, el día a día de la vida laboral está cada vez más controlado por estos sistemas de gestión, que dictan qué hacer y cómo hacerlo.

Self Paced
Self-Paced
Minería de Datos: Análisis de la Canasta de Compra (edX) EdX
Universidad Anáhuac,AnahuacX

Minería de Datos: Análisis de la Canasta de Compra (edX)

¿Conoces realmente qué productos de la canasta de mercado compran tus clientes o te dejas llevar por lo que aparenta a simple vista? En este curso aprenderás a construir modelos basados en técnicas de data mining o minería de datos, que te permitirán conocer información relevante de tus clientes y descubrir patrones de comportamiento para definir estrategias de marketing de acuerdo a la compra de productos.

Self Paced
Self-Paced
Big Data Capstone Project (edX) EdX
University of Adelaide,AdelaideX

Big Data Capstone Project (edX)

Further develop your knowledge of big data by applying the skills you have learned to a real-world data science project. This project will give you the opportunity to deepen your learning by giving you valuable experience in evaluating, selecting and applying relevant data science techniques, principles and theory to a data science problem. This project will see you plan and execute a reasonably substantial project and demonstrate autonomy, initiative and accountability.

Self Paced
Self-Paced