Lecture5 Mar22 2024
Lecture5 Mar22 2024
(PBH 711)
Y = 0 + 1X +
Constant or Slope
intercept parameter Independent or
parameter “explanatory”
variable
Simple Linear Regression Model
β0 = Intercept
20
β1 = Slope
19
age at marriage
1817
16
0 5 10 15
education in single years
Example: Simple Linear Regression Model
in STATA
• Bring the dataset provided in the last class (BDHS
IR file 2017-18)
•Multiple regression
is similar in principle to simple
regression. However, it is more complex conceptually
and computationally.
Multiple Regression Models
A Description of the Model and its Coefficients
• In multiple regressionanalysis, the dependent variable,
y, is sometimes referred to as the response variable.
ANOVA
df SS MS F p
Regression 2 8189.723 4094.86 28.63 .000
Residual (Error) 20 2861.017 143.1
Total 22 11050.74
Significance Test of the Regression
Coefficients for the Real Estate Example
H 0: 1
=0
H :
a
1
0 t.025,20 = 2.086
H 0: 2
=0
tCal = 5.63 > 2.086, reject H0.
H :
a
2
0
90 15
Residual
Percent
50 0
10 -15
1 -30
-30 -15 0 15 30 60 80 100 120 140
Residual Fitted Value
4.5 15
Frequency
Residual
3.0 0
1.5 -15
0.0 -30
-24 -12 0 12 24 2 4 6 8 10 12 14 16 18 20 22
Residual Observation Order
Definition and Interpretation of Adjusted R2
SSE
SSE
Se = n − k −1
2861
=
23 − 2 − 1
= 1196
.
where: n = number of observations
k = number of independent variables
SSE and Standard Error of the Estimate
SSYY
SSE SSR
ANOVA
df SS MS F p
Regression 2 8189.7 4094.89 28.63 .000
Residual (Error) 20 2861.0 143.1
Total 22 11050.7
SSR 8189.723
= = = .741
2
R SS yy 11050.74
SSE 2861.017
= 1− = 1− = .741
2
R SS yy 11050.74
Interpretation of
Coefficient of Multiple Determination
• The coefficientof multiple determination (R 2) is analogous to r2
in simple regression, but it requires a very different interpretation
and computation.
• R2 does not tell you which and how a given independent variable
is contributing to explaining the total variation in Y. It considers
them altogether.
Interpretation of
Coefficient of Multiple Determination