DataFrames

Sort options

Introduction to Big Data with Spark and Hadoop (Coursera)

May 13th 2024
Introduction to Big Data with Spark and Hadoop (Coursera)
Course Auditing
Categories
Effort
Languages
Bernard Marr defines Big Data as the digital trace that we are generating in this digital era. In this course, you will learn about the characteristics of Big Data and its application in Big Data Analytics. You will gain an understanding about the features, benefits, limitations, and applications of [...]

Serverless Data Processing with Dataflow: Develop Pipelines (Coursera)

In this second installment of the Dataflow course series, we are going to be diving deeper on developing pipelines using the Beam SDK. We start with a review of Apache Beam concepts. Next, we discuss processing streaming data using windows, watermarks and triggers. We then cover options for sources [...]

Big Data Analysis with Scala and Spark (Scala 2 version) (Coursera)

Manipulating big data distributed over a cluster using functional concepts is rampant in industry, and is arguably one of the first widespread industrial uses of functional ideas. This is evidenced by the popularity of MapReduce and Hadoop, and most recently Apache Spark, a fast, in-memory distributed collections framework written [...]

Big Data Analysis with Scala and Spark (Coursera)

Manipulating big data distributed over a cluster using functional concepts is rampant in industry, and is arguably one of the first widespread industrial uses of functional ideas. This is evidenced by the popularity of MapReduce and Hadoop, and most recently Apache Spark, a fast, in-memory distributed collections framework written [...]

Julia Scientific Programming (Coursera)

This four-module course introduces users to Julia as a first language. Julia is a high-level, high-performance dynamic programming language developed specifically for scientific computing. This language will be particularly useful for applications in physics, chemistry, astronomy, engineering, data science, bioinformatics and many more. [...]

Applications in Engineering Mechanics (Coursera)

This course applies principles learned in my course “Introduction to Engineering Mechanics” to analyze real world engineering structures. You will need to have mastered the engineering fundamentals from that class in order to be successful in this course offering. This course addresses the modeling and analysis of static equilibrium [...]

Big Data, Hadoop, and Spark Basics (edX)

Self Paced
Big Data, Hadoop, and Spark Basics (edX)
Course Auditing
Categories
Effort
Languages
This course provides foundational big data practitioner knowledge and analytical skills using popular big data tools, including Hadoop and Spark. Learn and practice your big data skills hands-on. Organizations need skilled, forward-thinking Big Data practitioners who can apply their business and technical skills to unstructured data such as tweets, [...]
Self Paced
Course Auditing
88.00 EUR

Introduction to R (DataCamp)

Self Paced
 Introduction to R (DataCamp)
Free Course
Categories
Effort
Languages
Master the basics of data analysis by manipulating common data structures such as vectors, matrices, and data frames. In this introduction to R, you will master the basics of this beautiful open source language, including factors, lists and data frames. With the knowledge gained in this course, you will [...]

Big Data Analysis: Hive, Spark SQL, DataFrames and GraphFrames (Coursera)

No doubt working with huge data volumes is hard, but to move a mountain, you have to deal with a lot of small stones. But why strain yourself? Using Mapreduce and Spark you tackle the issue partially, thus leaving some space for high-level tools. Stop struggling to [...]

Programming for Everyone : Working with Data (Coursera)

In this 3rd Guided Project in the "Programming for Everyone" series, we will learn how to work with data. We will learn about the dplyr package and how to use it to process data. We will subsequently update the app that we created in the 2nd Guided Project in [...]