Exploratory Data Analysis (Coursera)

Offered by University of Leeds,
Exploratory Data Analysis (Coursera)

Statistical analysis is an indispensable aspect of data analysis because it allows us to collect, review and analyse data to draw valuable conclusions in various industries. This is why the market for statisticians is projected to grow in the future. If you want to build your statistics and probability expertise and learn about data visualisation, this short course is a great introduction to statistics as the art of learning from data.

Class Deals by MOOC List - Click here and see Coursera's Active Discounts, Deals, and Promo Codes.

With real-life examples, you will explore the differences between data and information to discover the need for statistical models to gain objective and reliable inferences. You will consider what "unbiased" data collection means and explore various examples of data misrepresentation, misconception or incompleteness which will help you to develop statistical intuition and good practice skills.
Data visualisation is a sought-after skill. To create graphical and numerical summaries, you’ll learn and practice R software skills working in RStudio for exploratory data analysis. You will develop an intuitive concept of probability by completing probability experiments and computer simulations of binomial trails e.g., tossing a coin or rolling a die.
By the end of the course, you will be able to understand the role of statistical models in data analysis, develop numerical and graphical summaries using RStudio, and perform probability experiments in computer simulations.
No matter your current mathematics skill level, you will find something of interest in the course that offers many practical and real-life examples of statistics in action.
This course is a taster of the Online MSc in Data Science (Statistics) and it can also be completed by learners who want to understand the fundamentals of exploratory data analysis and data visualisation.

What you'll learn

  • Explain the different data types and apply data preparation methods to clean data.
  • Explore ways to visualise data using the software R.
  • Understand how visualisation of data can inform statistical model selection.

Syllabus

Getting to know your data for graphical summaries
This first week introduces you to data types (categorical, discrete, and continuous) and representing data via graphical summaries (or data visualisation). You will go through the steps you need to take to prepare data for analysis and data cleaning, by identifying missing data and outliers. You learn about and practice common graphical summaries such as box plots, histograms, and kernel density estimation (KDE).

Apply your knowledge: graphical summaries
This second week gives you the opportunity to apply your knowledge of graphical summaries from Week 1 in greater depth, with tasks in RStudio to complete such as preparing data for analysis and data cleaning, by identifying missing data and outliers.

Apply your knowledge: make your own graphical summaries and peer review
In this final week, you have the opportunity to build on your experiences of RStudio and data analysis using graphical summaries in Week 2. In Week 3, you complete a substantive task in RStudio to complete and there is a graded peer review where you share your output from the RStudio lab with a fellow student.

Go to Class
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Framework for Data Collection and Analysis (Coursera) Coursera
University of Maryland, College Park

Framework for Data Collection and Analysis (Coursera)

This course will provide you with an overview over existing data products and a good understanding of the data collection landscape. With the help of various examples you will learn how to identify which data sources likely matches your research question, how to turn your research question into measurable pieces, and how to think about an analysis plan.

Jun 8th 2026
4 Weeks
Big Data Modeling and Management Systems (Coursera) Coursera
University of California, San Diego

Big Data Modeling and Management Systems (Coursera)

Once you’ve identified a big data issue to analyze, how do you collect, store and organize your data using Big Data solutions? In this course, you will experience various data genres and management tools appropriate for each. You will be able to describe the reasons behind the evolving plethora of new big data platforms from the perspective of big data management systems and analytical tools.

Jun 8th 2026
5-12 Weeks
Exploratory Data Analysis (Coursera) Coursera
Johns Hopkins University

Exploratory Data Analysis (Coursera)

This course covers the essential exploratory techniques for summarizing data. These techniques are typically applied before formal modeling commences and can help inform the development of more complex statistical models. Exploratory techniques are also important for eliminating or sharpening potential hypotheses about the world that can be addressed by the data.

Jun 8th 2026
4 Weeks
Reproducible Research (Coursera) Coursera
Johns Hopkins University

Reproducible Research (Coursera)

This course focuses on the concepts and tools behind reporting modern data analyses in a reproducible manner. Reproducible research is the idea that data analyses, and more generally, scientific claims, are published with their data and software code so that others may verify the findings and build upon them. The need for reproducibility is increasing dramatically as data analyses become more complex, involving larger datasets and more sophisticated computations.

Jun 8th 2026
4 Weeks
Data Visualization (Coursera) Coursera
Ball State University

Data Visualization (Coursera)

In the era of big data, acquiring the ability to analyze and visually represent “Big Data” in a compelling manner is crucial. Therefore, it is essential for data scientists to develop the skills in producing and critically interpreting digital maps, charts, and graphs. Data visualization is an increasingly important topic in our globalized and digital society. It involves graphically representing data or information, enabling decision-makers across various industries to comprehend complex concepts and processes that may otherwise be challenging to grasp.

Jun 9th 2026
5-12 Weeks
Marketing Analytics (Coursera) Coursera
University of Virginia

Marketing Analytics (Coursera)

Organizations large and small are inundated with data about consumer choices. But that wealth of information does not always translate into better decisions. Knowing how to interpret data is the challenge -- and marketers in particular are increasingly expected to use analytics to inform and justify their decisions. Marketing analytics enables marketers to measure, manage and analyze marketing performance to maximize its effectiveness and optimize return on investment (ROI). Beyond the obvious sales and lead generation applications, marketing analytics can offer profound insights into customer preferences and trends, which can be further utilized for future marketing and business decisions.

Jun 8th 2026
5-12 Weeks
Data Manipulation at Scale: Systems and Algorithms (Coursera) Coursera
University of Washington

Data Manipulation at Scale: Systems and Algorithms (Coursera)

Data analysis has replaced data acquisition as the bottleneck to evidence-based decision making --- we are drowning in it. Extracting knowledge from large, heterogeneous, and noisy datasets requires not only powerful computing resources, but the programming abstractions to use them effectively. The abstractions that emerged in the last decade blend ideas from parallel databases, distributed systems, and programming languages to create a new class of scalable data analytics platforms that form the foundation for data science at realistic scales.

Jun 8th 2026
4 Weeks
Infonomics II: Business Information Management and Measurement (Coursera) Coursera
University of Illinois at Urbana-Champaign

Infonomics II: Business Information Management and Measurement (Coursera)

Even decades into the Information Age, accounting practices yet fail to recognize the financial value of information. Moreover, traditional asset management practices fail to recognize information as an asset to be managed with earnest discipline. This has led to a business culture of complacence, and the inability for most organizations to fully leverage available information assets. This second course in the two-part Infonomics series explores how and why to adapt well-honed asset management principles and practices to information, and how to apply accepted and new valuation models to gauge information’s potential and realized economic benefits.

Jun 10th 2026
4 Weeks
Exploring and Producing Data for Business Decision Making (Coursera) Coursera
University of Illinois at Urbana-Champaign

Exploring and Producing Data for Business Decision Making (Coursera)

This course provides an analytical framework to help you evaluate key problems in a structured fashion and will equip you with tools to better manage the uncertainties that pervade and complicate business processes. Specifically, you will be introduced to statistics and how to summarize data and learn concepts of frequency, normal distribution, statistical studies, sampling, and confidence intervals.

Jun 8th 2026
4 Weeks
Foundations of strategic business analytics (Coursera) Coursera
ESSEC Business School

Foundations of strategic business analytics (Coursera)

Who is this course for? This course is designed for students, business analysts, and data scientists who want to apply statistical knowledge and techniques to business contexts. For example, it may be suited to experienced statisticians, analysts, engineers who want to move more into a business role. You will find this course exciting and rewarding if you already have a background in statistics, can use R or another programming language and are familiar with databases and data analysis techniques such as regression, classification, and clustering.

Jun 8th 2026
4 Weeks
Introduction to Data Science in Python (Coursera) Coursera
University of Michigan

Introduction to Data Science in Python (Coursera)

This course will introduce the learner to the basics of the python programming environment, including fundamental python programming techniques such as lambdas, reading and manipulating csv files, and the numpy library. The course will introduce data manipulation and cleaning techniques using the popular python pandas data science library and introduce the abstraction of the Series and DataFrame as the central data structures for data analysis, along with tutorials on how to use functions such as groupby, merge, and pivot tables effectively. By the end of this course, students will be able to take tabular data, clean it, manipulate it, and run basic inferential statistical analyses.

Jun 8th 2026
4 Weeks