0% found this document useful (0 votes)

24 views

MAP 716 Lecture 5 Multiple Regression

Multiple regression is an extension of simple linear regression that involves one response variable and two or more explanatory variables. It allows modeling of more complex relationships between variables by accounting for the effects of multiple explanatory variables simultaneously. The multiple regression model expresses the response variable as a linear combination of the predictor variables. Regression coefficients represent the effect of each predictor after controlling for all other variables in the model.

Uploaded by

josephnjenga142

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views

MAP 716 Lecture 5 Multiple Regression

Uploaded by

josephnjenga142

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

5/23/2023

MAP 716: BIOSTATITSICS II AND COMPUTTING

Multiple Regression

Lecture 5: Multiple Regression • Simple linear regression describe the linear relation
between a response variable Y and a single explanatory
variable X
Dr Alice Lakati, PhD • Multiple regression is an extension to the case of one
Senior Lecturer
Amref International University
response variable and two more explanatory variables
• In multiple linear regression a linear model is fitted for
the response variable, which is expressed as a linear
combination of the predictors
• Y’=b0 + b1x1 + b2x2 +…….bkxk

MR…. Multiple Regression…..

• b0 is the value of Y when all of the independent variables • Multiple regression is performed for several reasons;
(X1 through Xk) are equal to zero, and b1 through bp are • The need to control or adjust for possible effects of
the estimated regression coefficients. “nuisance” explanatory variables-
• Each regression coefficient represents the change in Y • Can be used to adjust for confounding variable
relative to a one unit change in the respective independent • More explanatory variables may have a meaningful
variable. relationship with the response variable and these more
• In the multiple regression situation, b1, for example, is the complex relationship need to be investigated
change in Y relative to a one unit change in X1, holding all • It is almost always better to perform one comprehensive
other independent variables constant (i.e., when the analysis including all the relevant variables than a series of
remaining independent variables are held at the same value two way comparisons
or are fixed). • To analyze simultaneous effect of a number of categorical
variables. Is an alternative technique to analysis of variance
• Statistical tests can be performed to assess whether each • To predict a value of outcome variable
regression coefficient is significantly different from zero.

Multiple Regression…. Importance of predictors

• Multiple regression models can take various forms
• The regression coefficient bi represents the effects
of that explanatory variable after controlling for all
• Multiple linear regression the other predictors in the model
• Predictors all continuous and linearly related to the outcome
variable • The importance is each individual predictor is
• Polynomial regression tested by the t-test as for SLR
• Quadratic or higher order terms fitted • A confidence interval will give further information
• Analysis of covariance for the regression parameter
• Both continuous and categorical variables are included in the model • An ANOVA table can be obtained and the
• Analysis of variance significance may be assessed via F-test
• Predictors all categorical

1
5/23/2023

MR, Explanatory variables Example: Framingham Offspring Study

• First order model means linear in both X and B

• Suppose we want to assess the association
• The parameter βo is the intercept of the response surface between BMI and systolic blood pressure using
plane
data collected in the seventh examination, a total
• The parameter β1 is the change in µ per unit increase in X1 of n=3,539 participants attended the exam, and
while X2 is held constant. It is the effect of X1 on the mean
response.
their mean systolic blood pressure was 127.3 with a
standard deviation of 19.0.
• When the slope of a variable (say X1) does not depend on
the level of the other variable then the model is additive • The mean BMI in the sample was 28.2 with a
or not interactive standard deviation of 5.3.
• The parameter β1 and β2 are called partial regression
coefficients because the reflect the effect of one
explanatory variable when the other is held constant

MR
Regression coefficients- SLR
Suppose we now want to assess whether age (a continuous variable, measured in
Independent Regression Coefficient T P value
years), male gender (yes/no), and treatment for hypertension (yes/no) are potential
variable
confounders, and if so, appropriately account for these using multiple regression
Intercept 108.28 62.61 0.0001 (ANCOVA) analysis. For analytic purposes, treatment for hypertension is coded as
BMI 0.67 11.06 0.0001 1=yes and 0=no. Gender is coded as 1=male and 0=female (indicator variables )

Independent variable Regression coefficients T value P value

Intercept 68.15 26.33 0.0001
The regression coefficient associated with BMI is 0.67 suggesting that each one BMI 0.58 10.30 0.0001
unit increase in BMI is associated with a 0.67 unit increase in systolic blood
pressure. Age 0.65 20.22 0.0001
The association between BMI and systolic blood pressure is also statistically Male gender 0.94 1.58 0.1133
significant (p=0.0001) Treatment for 6.44 9.74 0.0001
hypertension

Indicator variables MR
• Independent variables can be qualitative e,g
gender (M/F), income group.
The multiple regression model is:
• Indicator or dummy variables are used to = 68.15 + 0.58 (BMI) + 0.65 (Age) + 0.94 (Male gender) + 6.44
quantify the effects of the levels or classes of (Treatment for hypertension).

a qualitative variable
• Take note that the association between BMI and
• If a qualitative variable has k levels, then K-1 systolic blood pressure is smaller (0.58 versus 0.67)
indicator variables will be created to after adjustment for age, gender and treatment for
hypertension.
represent that variable • BMI remains statistically significantly associated with
• An indicator variable has the form 1- if systolic blood pressure (p=0.0001), but the magnitude
of the association is lower after adjustment.
characteristic occurs and 0 if otherwise • The regression coefficient decreases by 13%.

2
5/23/2023

Interpretation of regression
MR coefficients
• In this case the true "beginning value" was 0.58,
and confounding caused it to appear to be 0.67. so • A one unit increase in BMI is associated with a 0.58 unit
the actual % change = 0.09/0.58 = 15.5%. increase in systolic blood pressure holding age, gender
and treatment for hypertension constant.
• Using the rule (i.e., a change in the coefficient in either • Each additional year of age is associated with a 0.65 unit
direction by 10% or more), we meet the criteria for increase in systolic blood pressure, holding BMI, gender
confounding.
and treatment for hypertension constant.
• Thus, part of the association between BMI and systolic
blood pressure is explained by age, gender and • Men have higher systolic blood pressures, by
treatment for hypertension approximately 0.94 units, holding BMI, age and
• It important to gender in the model even though it is treatment for hypertension constant and persons on
not significant treatment for hypertension have higher systolic blood
pressures, by approximately 6.44 units, holding BMI,
age and gender constant.

Interpretations :
Interpretations..
We can estimate the blood pressure of a 50 year old male, with a BMI of
25 who is not on treatment for hypertension as follows:
• The multiple regression equation can be used to
estimate systolic blood pressures as a function of a
participant's BMI, age, gender and treatment for
hypertension status.

Estimate the blood pressure of a 50 year old female, with a BMI of

25 who is on treatment for hypertension as follows

Interpretations : ANOVA table for MR:

We can estimate the blood pressure of a 50 year old male, with a BMI of Analysis of Variance Table
25 who is not on treatment for hypertension as follows:
Source of Sum of Degrees of Mean F Ratio p
Variation Squares Freedom Square

Regression Reg SS K Reg Reg

SS/K ms/resm
<or
s >
Residual SS res N-K-1 SS res/n-
We can estimate the blood pressure of a 50 year old female, with a k-1
BMI of 25 who is on treatment for hypertension as follows

Total N-1

3
5/23/2023

Fit of the Model Tutorial1

• R2 measure the usefulness or predictive value of the model • A researcher recruited 100 participants to perform
• R2 is interpreted as the proportion of the total variability a maximum VO2max test, but also recorded their
explained by the model "age", "weight", "heart rate" and "gender".
• But it increases in value as each additional variable is • Heart rate is the average of the last 5 minutes of a
added to the Model 20 minute, much easier, lower workload cycling test.
• Adjusted R2 (preferred measure) takes into account the
number of explanatory variables included in the model • The researcher's goal is to be able to predict
VO2max based on these four attributes: age, weight,
• Overall F-test from ANOVA table tests whether the
proportion of variation explained by the model is a heart rate and gender.
significant portion compared to the unexplained variation

Multiple Regression Outputs using

SPSS Questions
• 1. Interpret the results from ANOVA table (3 marks)

• 2. Explain the fitness of the model ( 3 marks)

• 3. Interpret the importance of each coefficient or

predictor ( 8 marks)

• 4. State the regression equation (4 marks)

• 5. Write a summary of the results ( 6 marks)

Assumptions Assumptions cont’

• Random sampling • Such assumptions are tested by;
• Observations must be independent • Assessing normality
• The relation between each of the explanatory • Obtaining scatter plots of Y or the residuals
variables and the outcome variable should be linear • Obtaining a plot of the standardized residuals against
the fitted values to assess the constant variance
• The values of the response variable Y should have a
normal distribution for specified values of the
explanatory variables
• The variability of Y should be the same for any set
of values of the explanatory variables-
Homoscedasticity

4
5/23/2023

Interactions Interactions: Example

• Interactions exist between 2 explanatory • In a chemical process the additive effects of 2
variables if the relationship between the mean drugs are not an accurate reflection of their
response and one explanatory variable is combined effect since catalytic effects are often
dependent on the value of the other explanatory present.
variable • A hospital administrator used data from 15
patients to examine the relationship between
the length of stay in hospital y(in days), the
age of patient X1 and previous admissions X2
• It would be appropriate to use the model
Y=βO + β 1x1 + β 2x2 +e

Interaction ,,, Interaction..

• To describe LOS data, because the equation Consider the model
E(Y)= (βO + β 1x1 )+ β 2x2 • Y=βO + β 1x1 + β 2x2 + β 3x1 X2 + e
• Assumes that for a fixed value of X1, the straight • This uses a cross product or interaction term 3x1 X2
line relating E(Y) to X2 has a slope B2 that is • The equation E(Y)= (βO + β 1x1 )+ (β 2 + β 3x1
independent of the fixed value of X1, ie for 2 )x2=(βO + β 2x2 )+(β 1 + β 3x2)x1
different values of X1, the slopes of the straight
lines relating E(Y) to X2 would be the same. • Assumes that for a fixed value of X1, the straight
line relating E(Y) to X2 has a slope β 2 + β 3x1 that is
• Similarly for E(Y)= (βO + β 2x2 )+ β 1x1 dependent upon the fixed value of X1.
• Such a model assumes no interaction exist • That is for two different values fixed values of X1,
between X1 and X2 the slopes of the straight line relation (E(Y) to X2
would be different

Interactions Interactions example..

• Conversely, the equation assumes that for a • The least squares estimates of β= (βO β 1 β 2 β 3) =
fixed value of X2, the straight line relating E(Y) βo= -15.88, β1=1.734, β2 =7.911, β3=-0.245.
to X1, has a slope β 1 + β 3x2, that is dependent
upon a fixed value of X2.
• That is, for 2 different values of X2, The slopes of • Suppose someone is 40 years old and has had
the straight line relating E(y) to X1 would be two previous admissions, determine the LOS.
different
• To fit the interaction model, define an extra
column X1X2= age*previous admins

5
5/23/2023

Advantages of fitting MR
Interactions example..
• The least squares estimates of β= (βO β 1 β 2 β 3) = 1. Since both type of patients assume equal
βo= -15.88, β1=1.734, β2 =7.911, β3=-0.245. slopes and the same error variance, the
common slope B1 can best be estimated by
• Suppose someone is 40 years old and has had pooling all the patients together
two previous admissions, determine the LOS.
2. Comparing different levels of qualitative
variables can be done by tests on regression
• 49.702 days coefficient B2
• 53.09 days 3. Inferences on Bo and B2 can be made more
• 69.057 days precisely since more degrees of freedom will
be associated with MSE
• 50.103 days
Interactions can be introduced into the model

SPSS ANNOTATED OUTPUT Multiple Regression
No ratings yet
SPSS ANNOTATED OUTPUT Multiple Regression
12 pages
Minitab Multiple Regression Analysis
100% (1)
Minitab Multiple Regression Analysis
6 pages
Minitab Multiple Regression Analysis
No ratings yet
Minitab Multiple Regression Analysis
6 pages
Minitab Multiple Regression Analysis PDF
No ratings yet
Minitab Multiple Regression Analysis PDF
6 pages
Test Procedure in SPSS Statistics
No ratings yet
Test Procedure in SPSS Statistics
8 pages
Multiple Linear Regression
No ratings yet
Multiple Linear Regression
26 pages
Assignment of Multiple Linear Regressions
No ratings yet
Assignment of Multiple Linear Regressions
9 pages
120.508 Module 8 Multiple Regression (PDF Full Page Color)
No ratings yet
120.508 Module 8 Multiple Regression (PDF Full Page Color)
52 pages
Stats Multiple Regression
No ratings yet
Stats Multiple Regression
19 pages
Module 8 Part B Updated-2
No ratings yet
Module 8 Part B Updated-2
7 pages
Multiple Regression
No ratings yet
Multiple Regression
55 pages
T10 Regresion Multiple
No ratings yet
T10 Regresion Multiple
17 pages
Notes 6
No ratings yet
Notes 6
26 pages
11 Regression JASP
100% (1)
11 Regression JASP
35 pages
Advance
No ratings yet
Advance
17 pages
Multiple Linear Regression (Multiple Regression Analysis)
No ratings yet
Multiple Linear Regression (Multiple Regression Analysis)
37 pages
Unit 4-1
No ratings yet
Unit 4-1
29 pages
Multiple linear regression
No ratings yet
Multiple linear regression
39 pages
Name: Muhammad Siddique Class: B.Ed. Semester: Fifth Subject: Inferential Statistics Submitted To: Sir Sajid Ali
No ratings yet
Name: Muhammad Siddique Class: B.Ed. Semester: Fifth Subject: Inferential Statistics Submitted To: Sir Sajid Ali
6 pages
Unit 3 - Notes
No ratings yet
Unit 3 - Notes
32 pages
Unit 4 Biostatistics Chatgpt
No ratings yet
Unit 4 Biostatistics Chatgpt
2 pages
13Simple linear Regression
No ratings yet
13Simple linear Regression
46 pages
Regression
No ratings yet
Regression
20 pages
9 Regression and Correlation Methods 5 2023
No ratings yet
9 Regression and Correlation Methods 5 2023
7 pages
Class 11
No ratings yet
Class 11
44 pages
Linear Regression and Correlation
No ratings yet
Linear Regression and Correlation
99 pages
Calculation of VIF
No ratings yet
Calculation of VIF
24 pages
Notes 8-1
No ratings yet
Notes 8-1
28 pages
Regression Analysis: Information Point
No ratings yet
Regression Analysis: Information Point
1 page
Primer of Applied Regression and Analysis of Variance (Glantz S.a., Slinker B.K., Neilands T.B)
No ratings yet
Primer of Applied Regression and Analysis of Variance (Glantz S.a., Slinker B.K., Neilands T.B)
1,472 pages
Lunt M. 2015. Introduction to Statistical Modelling
No ratings yet
Lunt M. 2015. Introduction to Statistical Modelling
4 pages
Session 1.3 Notes
No ratings yet
Session 1.3 Notes
39 pages
MLR
No ratings yet
MLR
48 pages
Stt151a Notes
No ratings yet
Stt151a Notes
14 pages
Mapingure Simple Linear Regression
No ratings yet
Mapingure Simple Linear Regression
23 pages
Common Pitfalls in Statistical Analysis: Linear Regression Analysis
No ratings yet
Common Pitfalls in Statistical Analysis: Linear Regression Analysis
4 pages
Multiple Regression Analysis & Applications
No ratings yet
Multiple Regression Analysis & Applications
23 pages
Correlation and Regression-dr Habibullah
No ratings yet
Correlation and Regression-dr Habibullah
26 pages
MSC Nursing
No ratings yet
MSC Nursing
8 pages
Multiple Linear Regression
No ratings yet
Multiple Linear Regression
6 pages
Advanced Statistical Methods
No ratings yet
Advanced Statistical Methods
63 pages
Chapter 3
No ratings yet
Chapter 3
36 pages
Chapter 3 Notes Part 3
No ratings yet
Chapter 3 Notes Part 3
9 pages
Multivariate Analysis in SPSS
No ratings yet
Multivariate Analysis in SPSS
65 pages
Lesson 3.1 SPSS OUTPUT
No ratings yet
Lesson 3.1 SPSS OUTPUT
6 pages
Presentation Health Insurance USA
No ratings yet
Presentation Health Insurance USA
18 pages
RSM1282-2025-Session 6-Multiple Regression POST (1)
No ratings yet
RSM1282-2025-Session 6-Multiple Regression POST (1)
84 pages
Model Checking
No ratings yet
Model Checking
65 pages
Multiple Linear Regression Session 4
No ratings yet
Multiple Linear Regression Session 4
32 pages
Regression Modeling in Biostatistics
No ratings yet
Regression Modeling in Biostatistics
3 pages
Linear Regression and Correlation
No ratings yet
Linear Regression and Correlation
35 pages
Lecture LAB4 ANOVA
No ratings yet
Lecture LAB4 ANOVA
34 pages
Soal Correlation and Simple Linear Regression
No ratings yet
Soal Correlation and Simple Linear Regression
1 page
Multiple Linear Regression 2021
No ratings yet
Multiple Linear Regression 2021
45 pages
Week 8 - 10
No ratings yet
Week 8 - 10
72 pages
2024 Chapter 1
No ratings yet
2024 Chapter 1
8 pages
correlation
No ratings yet
correlation
13 pages
Statistical Methods in Nursing
No ratings yet
Statistical Methods in Nursing
73 pages
5
No ratings yet
5
23 pages
Correlation and Regression: Six Sigma Thinking, #8
From Everand
Correlation and Regression: Six Sigma Thinking, #8
Sumeet Savant
5/5 (1)
(1986) Dommel - Et - Al
No ratings yet
(1986) Dommel - Et - Al
7 pages
Discrete Dynamical Systems
100% (1)
Discrete Dynamical Systems
254 pages
Economics
No ratings yet
Economics
1 page
Physics of Wall Balls
No ratings yet
Physics of Wall Balls
5 pages
UNIT 1:: Overview of Graphics System Characteristics of Cathode-Ray Tube (CRT)
No ratings yet
UNIT 1:: Overview of Graphics System Characteristics of Cathode-Ray Tube (CRT)
37 pages
LectureNotes Dirac PDF
No ratings yet
LectureNotes Dirac PDF
195 pages
Mathematics STD 3 - Scheme of Work 2024
No ratings yet
Mathematics STD 3 - Scheme of Work 2024
6 pages
Coordinate Systems and Transformations: Topics
No ratings yet
Coordinate Systems and Transformations: Topics
19 pages
PYTHON
No ratings yet
PYTHON
184 pages
Requirements For Degree Transfer: Bcomm & Badmin: Myconcordia - Ca
No ratings yet
Requirements For Degree Transfer: Bcomm & Badmin: Myconcordia - Ca
2 pages
Computer Simulation Studies To Assist in Mine Equipment Selection
No ratings yet
Computer Simulation Studies To Assist in Mine Equipment Selection
5 pages
PHY101 Lecture Note 6
No ratings yet
PHY101 Lecture Note 6
13 pages
CH 02
No ratings yet
CH 02
38 pages
Mathematics For Physics
100% (1)
Mathematics For Physics
26 pages
Physics Project
No ratings yet
Physics Project
16 pages
Strategic Management Model
43% (7)
Strategic Management Model
13 pages
Orthographic Projections - Modified
No ratings yet
Orthographic Projections - Modified
58 pages
Solution Manual For Elementary Statistics 9th Edition by Weiss ISBN 0321989392 9780321989390
100% (47)
Solution Manual For Elementary Statistics 9th Edition by Weiss ISBN 0321989392 9780321989390
36 pages
Risk Assessment of Construction Projects
No ratings yet
Risk Assessment of Construction Projects
15 pages
Dice (Reasoning-PrashantChaturvedi.Com).pdf.pdf
No ratings yet
Dice (Reasoning-PrashantChaturvedi.Com).pdf.pdf
24 pages
HTET 2023
No ratings yet
HTET 2023
35 pages
C2 Notes
No ratings yet
C2 Notes
17 pages
Excel Functions
No ratings yet
Excel Functions
1 page
Bfin 332 Topic 4
No ratings yet
Bfin 332 Topic 4
26 pages
Introduction, Advantages of Linear Programming
No ratings yet
Introduction, Advantages of Linear Programming
27 pages
S.6 Applied Math 2
No ratings yet
S.6 Applied Math 2
5 pages
AI Chapter 5
No ratings yet
AI Chapter 5
65 pages
NCF 2005
No ratings yet
NCF 2005
30 pages
SUBJECT-MATHS CLASS-10
No ratings yet
SUBJECT-MATHS CLASS-10
6 pages
DSA Ch9 Heaps
No ratings yet
DSA Ch9 Heaps
44 pages

Uploaded by

Uploaded by

5/23/2023

MAP 716: BIOSTATITSICS II AND COMPUTTING

MR…. Multiple Regression…..

Multiple Regression…. Importance of predictors

MR, Explanatory variables Example: Framingham Offspring Study

• First order model means linear in both X and B

Independent variable Regression coefficients T value P value

Estimate the blood pressure of a 50 year old female, with a BMI of

Interpretations : ANOVA table for MR:

Regression Reg SS K Reg Reg

Fit of the Model Tutorial1

Multiple Regression Outputs using

• 2. Explain the fitness of the model ( 3 marks)

• 3. Interpret the importance of each coefficient or

• 4. State the regression equation (4 marks)

• 5. Write a summary of the results ( 6 marks)

Assumptions Assumptions cont’

Interactions Interactions: Example

Interaction ,,, Interaction..

Interactions Interactions example..

You might also like