Statistics & Data Analysis

Sort options

Serverless Data Processing with Dataflow: Operations (Coursera)

Apr 29th 2024
Serverless Data Processing with Dataflow: Operations (Coursera)
Course Auditing
Categories
Effort
Languages
In the last installment of the Dataflow course series, we will introduce the components of the Dataflow operational model. We will examine tools and techniques for troubleshooting and optimizing pipeline performance. We will then review testing, deployment, and reliability best practices for Dataflow pipelines. [...]

Probability Theory: Foundation for Data Science (Coursera)

Understand the foundations of probability and its relationship to statistics and data science. We’ll learn what it means to calculate a probability, independent and dependent outcomes, and conditional events. We’ll study discrete and continuous random variables and see how this fits with data collection. We’ll end [...]

Moneyball and Beyond (Coursera)

The book Moneyball triggered a revolution in the analysis of performance statistics in professional sports, by showing that data analytics could be used to increase team winning percentage. This course shows how to program data using Python to test the claims that lie behind the Moneyball story, and to [...]

Foundations of Sports Analytics: Data, Representation, and Models in Sports (Coursera)

This course provides an introduction to using Python to analyze team performance in sports. Learners will discover a variety of techniques that can be used to represent sports data and how to extract narratives based on these analytical techniques. The main focus of the introduction will be on the [...]

Prediction Models with Sports Data (Coursera)

In this course the learner will be shown how to generate forecasts of game results in professional sports using Python. The main emphasis of the course is on teaching the method of logistic regression as a way of modeling game results, using data on team expenditures. The learner is [...]

Tools for Exploratory Data Analysis in Business (Coursera)

This course introduces several tools for processing business data to obtain actionable insight. The most important tool is the mind of the data analyst. Accordingly, in this course, you will explore what it means to have an analytic mindset. You will also practice identifying business problems that can be [...]

Data Science as a Field (Coursera)

This course provides a general introduction to the field of Data Science. It has been designed for aspiring data scientists, content experts who work with data scientists, or anyone interested in learning about what Data Science is and what it’s used for. Weekly topics include an overview of the [...]

Modern Regression Analysis in R (Coursera)

This course will provide a set of foundational statistical modeling tools for data science. In particular, students will be introduced to methods, theory, and applications of linear statistical models, covering the topics of parameter estimation, residual diagnostics, goodness of fit, and various strategies for variable selection and model comparison. [...]

Data Mining Pipeline (Coursera)

This course introduces the key steps involved in the data mining pipeline, including data understanding, data preprocessing, data warehousing, data modeling, interpretation and evaluation, and real-world applications. Data Mining Pipeline can be taken for academic credit as part of CU Boulder’s Master of Science in Data Science (MS-DS) degree [...]

ANOVA and Experimental Design (Coursera)

This second course in statistical modeling will introduce students to the study of the analysis of variance (ANOVA), analysis of covariance (ANCOVA), and experimental design. ANOVA and ANCOVA, presented as a type of linear regression model, will provide the mathematical basis for designing experiments for data science applications. Emphasis [...]