Managing Big Data with R and Hadoop (FutureLearn)

Managing Big Data with R and Hadoop (FutureLearn)

Learn how to manage and analyse big data using the R programming language and Hadoop programming framework. This course will give you access to a virtual environment with installations of Hadoop, R and Rstudio to get hands-on experience with big data management. Several unique examples from statistical learning and related R code for map-reduce operations will be available for testing and learning.

Class Deals by MOOC List - Click here and see FutureLearn's Active Discounts, Deals, and Promo Codes.

Those with basic knowledge in statistical learning and R will better understand the methods behind and how to run them in parallel using map-reduce functions and Hadoop data storage. At the end of the course you will get access to RHadoop on a supercomputer at University of Ljubljana.

Syllabus

Week 1: Welcome to BIG DATA
Week 2: Working with Hadoop
Week 3: First steps in R and RHadoop
Week 4: Statistical learning with RHadoop: clustering
Week 5: Statistical learning with RHadoop: regression and classification

By the end of the course, you will:

  • Explore basic functionality of Apache Hadoop and of RHadoop
  • Experiment how to achieve performance of modern supercomputing
  • Experiment regression, clustering and classification with RHadoop
  • Investigate basic functionality of Bash terminal window
  • Knowledge about statistical learning to instances of data provided by edcators
  • How to do big data management with RHadoop on real supercomputer provided by Universiy of Ljubljana

Who is the course for?
This course is designed for people interested in data science, computational statistics and machine learning and have basic experiences with them. It will be also useful for advanced undergraduate students and first year PhD students in data analysis, statistics or bioinformatics, who wish to understand how to manage big data with Hadoop using R programming language.
We expect that the learners will also have basic experiences with linux and bash and working experiences with R and matrix operations. They should be also capable to download and run virtual machine.

Go to Class
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Business Analytics Using Forecasting (FutureLearn) FutureLearn
National Tsing Hua University

Business Analytics Using Forecasting (FutureLearn)

Discover how business can harness the power of big data to make better predictive analysis. Learn how to use data to create powerful business forecasts. Organisations currently collect a vast quantity of data about suppliers, clients, employees, citizens, transactions, and much more. However, many are unaware of the predictive power this ‘big data’ has if anaylsed correctly. On this course, you’ll learn about forecasting using big data, exploring how it’s used by business as an important component of decision making.

Jul 15th 2024
5-12 Weeks
Predictive Analytics: Gaining Insights from Big Data (FutureLearn) FutureLearn
Queensland University of Technology

Predictive Analytics: Gaining Insights from Big Data (FutureLearn)

Learn to use predictive analytics tools and HPE Vertica Analytics to gain insights from big data, with this free online course. Collecting big data is just the first step; once you have it, how do you make sense of it? This free online course will show you how predictive analytics tools can help you gain information, knowledge and insights from big data.

No sessions available
4 Weeks
Genomic Medicine: Harnessing the Power of the Human Genome (FutureLearn) FutureLearn
University of Glasgow

Genomic Medicine: Harnessing the Power of the Human Genome (FutureLearn)

Using the latest genomics research, discover how genomic technologies are changing how we understand and treat medical conditions. Explore cutting-edge genomics data analysis tools and technology. This course will advance your understanding of the rapidly growing use of genomics in the research, diagnosis, and treatment of clinical conditions.

Available now
4 Weeks
Predictive Analytics: Solving Business Problems Using Machine Learning and Big Data (FutureLearn) FutureLearn
Sungkyunkwan University - SKKU

Predictive Analytics: Solving Business Problems Using Machine Learning and Big Data (FutureLearn)

Explore how predictive models can help businesses use data to identify risks and discover new opportunities. Discover how predictive analytics could transform your business As businesses accrue more and more data about their customers – from their behavioural history to their transactions – being able to use ‘Big Data’ is becoming increasingly key to low-term business success.

Aug 16th 2021
4 Weeks
Data Tells a Story: Reading Data in the Social Sciences and Humanities (FutureLearn) FutureLearn
Loughborough University

Data Tells a Story: Reading Data in the Social Sciences and Humanities (FutureLearn)

Learn about the role of data in a range of disciplines and about some fundamental tools for extracting knowledge from data. How can we answer questions about the world around us? How can we make decisions about what to do? Over the past years, more and more people have turned to data for help. Huge amounts of data are collected every day from millions of sources. This data has a lot to tell us! But data by itself is mute—it can only help us if we learn to make it speak and tell its story.

No sessions available
2 Weeks