EdX

Introduction to Statistics for Data Science using Python (edX)

Offered by IBM,
Introduction to Statistics for Data Science using Python (edX)

This Statistics for Data Science course is designed to introduce you to the basic principles of statistical methods and procedures used for data analysis. This Statistics for Data Science course is designed to introduce you to the basic principles of statistical methods and procedures used for data analysis. After completing this course you will have practical knowledge of crucial topics in statistics including - data gathering, summarizing data using descriptive statistics, displaying and visualizing data, examining relationships between variables, probability distributions, expected values, hypothesis testing, introduction to ANOVA (analysis of variance), regression and correlation analysis.

Class Deals by MOOC List - Click here and see EdX's Active Discounts, Deals, and Promo Codes.

You will take a hands-on approach to statistical analysis using Python and Jupyter Notebooks – the tools of choice for Data Scientists and Data Analysts.
At the end of the course, you will complete a project to apply various concepts in the course to a Data Science problem involving a real-life inspired scenario and demonstrate an understanding of the foundational statistical thinking and reasoning. The focus is on developing a clear understanding of the different approaches for different data types, developing an intuitive understanding, making appropriate assessments of the proposed methods, using Python to analyze our data, and interpreting the output accurately. This course is suitable for a variety of professionals and students intending to start their journey in data and statistics-driven roles such as Data Scientists, Data Analysts, Business Analysts, Statisticians, and Researchers. It does not require any computer science or statistics background. We strongly recommend taking the Python for Data Science course before starting this course to get familiar with the Python programming language, Jupyter notebooks, and libraries. An optional refresher on Python is also provided.

What you'll learn
After completing this course, a learner will be able to:

  • Calculate and apply measures of central tendency and measures of dispersion to grouped and ungrouped data.
  • Summarize, present, and visualize data in a way that is clear, concise, and provides a practical insight for non-statisticians needing the results.
  • Identify appropriate hypothesis tests to use for common data sets.
  • Conduct hypothesis tests, correlation tests, and regression analysis.
  • Demonstrate proficiency in statistical analysis using Python and Jupyter Notebooks.
Go to Class
MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Related Courses

Introduction to Bioconductor (edX) EdX
HarvardX,Harvard University

Introduction to Bioconductor (edX)

The structure, annotation, normalization, and interpretation of genome scale assays. We begin with an introduction to the biology, explaining what we measure and why. Then we focus on the two main measurement technologies: next generation sequencing and microarrays. We then move on to describing how raw data and experimental information are imported into R and how we use Bioconductor classes to organize these data, whether generated locally, or harvested from public repositories or institutional archives.

Self Paced
Self-Paced
Statistics for Business - II (edX) EdX
Indian Institute of Management, Bangalore,IIMBx

Statistics for Business - II (edX)

Examine data drawn from allied fields of business such as Finance and HR, and learn how to simulate data to follow a specified distribution. Statistics is a versatile discipline that has revolutionized the fields of business, engineering, medicine and pure sciences. This course is Part 2 of a 4-part series on Business Statistics, and is ideal for learners who wish to enroll in business programs. The first two parts cover topics in Descriptive Statistics, whereas the next two focus on Inferential Statistics.

No sessions available
5-12 Weeks
CS For All: Introduction to Computer Science and Python Programming (edX) EdX
Harvey Mudd College,HarveyMuddX

CS For All: Introduction to Computer Science and Python Programming (edX)

A fun, fast-paced introduction to solving interesting problems with computer science through Python programming. Looking to get started with computer science while learning to program in Python? This computer science course provides an introduction to computer science that’s both challenging and fun.

No sessions available
13-24 Weeks
Successfully Evaluating Predictive Modelling (edX) EdX
University of Edinburgh,EdinburghX

Successfully Evaluating Predictive Modelling (edX)

Gain an in-depth understanding of evaluation and sampling approaches for effective predictive modelling using Python. A predictive exercise is not finished when a model is built. This course will equip you with essential skills for understanding performance evaluation metrics, using Python, to determine whether a model is performing adequately.

Oct 26th 2021
5-12 Weeks
Fundamentals of Statistics (edX) EdX
MIT,MITx

Fundamentals of Statistics (edX)

Develop a deep understanding of the principles that underpin statistical inference: estimation, hypothesis testing and prediction. Statistics is the science of turning data into insights and ultimately decisions. Behind recent advances in machine learning, data science and artificial intelligence are fundamental statistical principles. The purpose of this class is to develop and understand these core ideas on firm mathematical grounds starting from the construction of estimators and tests, as well as an analysis of their asymptotic performance.

Jan 29th 2024
13-24 Weeks
Designing and Running Randomized Evaluations (edX) EdX
MIT,MITx

Designing and Running Randomized Evaluations (edX)

Learn how to both design randomized evaluations and implement them in the field to measure the impact of social programs. A randomized evaluation, also known as a randomized controlled trial (RCT), field experiment or field trial, is a type of impact evaluation that uses random assignment to allocate resources, run programs, or apply policies as part of the study design.

Sep 7th 2021
5-12 Weeks
Data, Analytics and Learning (edX) EdX
University of Texas at Arlington,UTArlingtonX

Data, Analytics and Learning (edX)

An introduction to the logic and methods of analysis of data to improve teaching and learning. Capturing and analyzing data has changed how decisions are made and resources are allocated in businesses, journalism, government, and military and intelligence fields. Through better use of data, leaders are able to plan and enact strategies with greater clarity and confidence.

No sessions available
4 Weeks
Computer Applications of Artificial Intelligence and e-Construction (edX) EdX
Purdue University,PurdueX

Computer Applications of Artificial Intelligence and e-Construction (edX)

Learn the fundamentals of artificial intelligence, machine learning, natural language processing and their applications in e-Construction. This course is the third in a sequence of interrelated courses of the current computer applications in the construction industry. The emphasis of this course is the advanced computational tools including artificial intelligence, machine learning, and natural language processing, and their applications in e-Construction.

Mar 28th 2022
5-12 Weeks
Platform-Based Analytics (edX) EdX
Indiana University,IUx

Platform-Based Analytics (edX)

Gain hands-on experience extracting, preparing, exploring, and analyzing data statistically and visually using features and tools native to Microsoft Excel. In an ever-growing digital world, the need for strong data analysis skills is at the forefront of every business function, along with the ability to accurately describe and interpret analytical findings.

Nov 7th 2023
5-12 Weeks
Statistical Predictive Modelling and Applications (edX) EdX
University of Edinburgh,EdinburghX

Statistical Predictive Modelling and Applications (edX)

Learn how to apply statistical modelling techniques to real-world business scenarios using Python. In this course, you will learn three predictive modelling techniques - linear and logistic regression, and naive Bayes - and their applications in real-world scenarios. The first half of the course focuses on linear regression. This technique allows you to model a continuous outcome variable using both continuous and categorical predictors. This technique enables you to predict product sales based on several customer variables.

Jan 18th 2022
5-12 Weeks
Big Data Analytics Using Spark (edX) EdX
University of California, San Diego,UC San DiegoX

Big Data Analytics Using Spark (edX)

Learn how to analyze large datasets using Jupyter notebooks, MapReduce and Spark as a platform. In data science, data is called “big” if it cannot fit into the memory of a single standard laptop or workstation. The analysis of big datasets requires using a cluster of tens, hundreds or thousands of computers. Effectively using such clusters requires the use of distributed files systems, such as the Hadoop Distributed File System (HDFS) and corresponding computational models, such as Hadoop, MapReduce and Spark.

Dec 5th 2023
5-12 Weeks
CS50's Introduction to Computer Science (edX) EdX
HarvardX,Harvard University

CS50's Introduction to Computer Science (edX)

An introduction to the intellectual enterprises of computer science and the art of programming. This is CS50, Harvard University's introduction to the intellectual enterprises of computer science and the art of programming for majors and non-majors alike, with or without prior programming experience. An entry-level course taught by David J. Malan, CS50 teaches students how to think algorithmically and solve problems efficiently.

Self Paced
Self-Paced