# Linear Regression in R for Public Health (Coursera)

##### Start Date
Sep 27th 2021 Course Auditing
Categories
Effort
Certification
41.00 EUR/month
Languages
You should know the basics of types of variables, distributions, hypothesis testing, p values and confidence intervals using R, though I'll recap.
Misc

MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

Public Health has been defined as “the art and science of preventing disease, prolonging life and promoting health through the organized efforts of society”. Knowing what causes disease and what makes it worse are clearly vital parts of this. This requires the development of statistical models that describe how patient and environmental factors affect our chances of getting ill. This course will show you how to create such models from scratch, beginning with introducing you to the concept of correlation and linear regression before walking you through importing and examining your data, and then showing you how to fit models.

#### Class Deals by MOOC List - Click here and see Coursera's Active Discounts, Deals, and Promo Codes.

Using the example of respiratory disease, these models will describe how patient and other factors affect outcomes such as lung function.

Linear regression is one of a family of regression models, and the other courses in this series will cover two further members. Regression models have many things in common with each other, though the mathematical details differ.

This course will show you how to prepare the data, assess how well the model fits the data, and test its underlying assumptions – vital tasks with any type of regression.

You will use the free and versatile software package R, used by statisticians and data scientists in academia, governments and industry worldwide.

Course 2 of 4 in the Statistical Analysis with R for Public Health Specialization.

What You Will Learn

- Describe when a linear regression model is appropriate to use

- Read in and check a data set's variables using the software R prior to undertaking a model analysis

- Fit a multiple linear regression model with interactions, check model assumptions and interpret the output

### Syllabus

WEEK 1

INTRODUCTION TO LINEAR REGRESSION

Before jumping ahead to run a regression model, you need to understand a related concept: correlation. This week you’ll learn what it means and how to generate Pearson’s and Spearman’s correlation coefficients in R to assess the strength of the association between a risk factor or predictor and the patient outcome. Then you’ll be introduced to linear regression and the concept of model assumptions, a key idea underpinning so much of statistical analysis.

WEEK 2

Linear Regression in R

You’ll be introduced to the COPD data set that you’ll use throughout the course and will run basic descriptive analyses. You’ll also practise running correlations in R. Next, you’ll see how to run a linear regression model, firstly with one and then with several predictors, and examine whether model assumptions hold.

WEEK 3

Multiple Regression and Interaction

Now you’ll see how to extend the linear regression model to include binary and categorical variables as predictors and learn how to check the correlation between predictors. Then you’ll see how predictors can interact with each other and how to incorporate the necessary interaction terms into the model and interpret them. Different kinds of interactions exist and can be challenging to interpret, so we will take it slowly with worked examples and opportunities to practise.

WEEK 4

MODEL BUILDING

The last part of the course looks at how to build a regression model when you have a choice of what predictors to include in it. It describes commonly used automated procedures for model building and shows you why they are so problematic. Lastly, you’ll have the chance to fit some models using a more defensible and robust approach.

MOOC List is learner-supported. When you buy through links on our site, we may earn an affiliate commission.