Statistical Modeling for Data Science Applications Specialization
Statistical modeling lies at the heart of data science. Well crafted statistical models allow data scientists to draw conclusions about the world from the limited information present in their data. In this three credit sequence, learners will add some intermediate and advanced statistical modeling techniques to their data science toolkit. In particular, learners will become proficient in the theory and application of linear regression analysis; ANOVA and experimental design; and generalized linear and additive models. Emphasis will be placed on analyzing real data using the R programming language.
This specialization can be taken for academic credit as part of CU Boulder’s Master of Science in Data Science (MS-DS) degree offered on the Coursera platform. The MS-DS is an interdisciplinary degree that brings together faculty from CU Boulder’s departments of Applied Mathematics, Computer Science, Information Science, and others.
WHAT YOU WILL LEARN
- Correctly analyze and apply tools of regression analysis to model relationship between variables and make predictions given a set of input variables.
- Successfully conduct experiments based on best practices in experimental design.
- Use advanced statistical modeling techniques, such as generalized linear and additive models, to model wide range of real-world relationships.