0% found this document useful (0 votes)

13 views

C2-English

Uploaded by

tranlh234081e

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views

C2-English

Uploaded by

tranlh234081e

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 33

Chapter 2.

Multiple Regression
(Course: Econometrics)

Phuong Le

Faculty of Economic Mathematics

University of Economics and Law
Vietnam National University, Ho Chi Minh City
Content

1 Multiple regression model

Introduction
Least squares method
Multiple coefficient of determination

2 Hypothesis testing and interval estimation

Testing for significance
Interval estimation

3 Model selection
Information criteria
Wald test
Introduction

Multiple regression model

The equation that describes how the dependent variable y is related
to p independent variables x1 , x2 , . . . , xp and an error term ε is:

y = β0 + β1 x1 + β2 x2 + ... + βp xp + ε,

where
• β0 , β1 , . . . , βp are the parameters (there is k = p + 1 parameters),
• ε is a random variable called the error term.

Multiple Regression Equation

The equation that describes how the mean value of y is related to
x1 , x2 , . . . , xp is:

E(y) = β0 + β1 x1 + β2 x2 + ... + βp xp .
Introduction
Matrix representation

Y = X β + ε,
where
   
1 x11 x21 ... xp1 y1
 1 x12 x22 ... xp2   y2 
X = .. .. .. ..  , Y =  .. ,
   
 . . . .   . 
1 x1n x2n ... xpn yn
   
β0 ε1
 β1   ε2 
β= .. , ε =  ..
   

 .   . 
βp εn

Note: (x1j , x2j , . . . , xpj , yj ) is the j-th observation for j = 1, 2, . . . , n.

Introduction
Estimated Multiple Regression Equation

ŷ = β̂0 + β̂1 x1 + β̂2 x2 + ... + β̂p xp .

A simple random sample is used to compute sample statistics
β̂0 , β̂1 , . . . , β̂p that are used as the point estimators of the parameters
β0 , β1 , . . . , βp .

Representation in matrix

Y = X β̂ + e,

where    
β̂0 e1
 β̂1   e2 
β̂ =  , e =  .. .
   
..
 .   . 
β̂p en
Some multiple regression functions

Cobb-Douglas prodution functions

Cobb-Douglas production functions are represented as

yi = β0 x1iβ1 x2iβ2 eεi ,

where yi : production, x1i : capital, x2i : labor, εi : error.

Cobb-Douglas production function can be transformed to

ln yi = β0 + β1 ln x1i + β2 ln x2i + εi .

This is a multiple regression model for ln y, ln x1 and ln x2 .

Quadratic regression function

yi = β0 + β1 xi + β2 xi2 + εi .
This is a multiple regression model for y, x and x 2 .
Least squares method
Least squares criterion

X X X 2
ei2 = (yi −ŷi )2 = yi − β̂0 − β̂1 x1i − β̂2 x2i − · · · − β̂p xpi → min .

Solving this, we get the OLS formula

−1
β̂ = X T X XTY.

Computation of coefficient values

• The formulas for the regression coefficients β̂0 , β̂1 , . . . , β̂p involve
the use of matrix algebra. We will rely on computer software
packages to perform the calculations.
• The emphasis will be on how to interpret the computer output
rather than on how to make the multiple regression
computations.
Least squares method

Example 1. A software firm collected data for a sample of 20

computer programmers. A suggestion was made that regression
analysis could be used to determine if salary was related to the years
of experience and the score on the firm’s programmer aptitude test.

The years of experience, score on the aptitude test, and

corresponding annual salary ($1000s) for a sample of 20
programmers is shown on the next slide.
Least squares method
Least squares method

Suppose we believe that salary (Salary) is related to the years of

experience (Experience) and the score on the programmer aptitude
test (TestScore) by the following regression model

Salary = β0 + β1 Experience + β2 TestScore + ε,

where
• Salary: annual salary ($1000s),
• Experience: years of experience,
• TestScore: score on programmer aptitude test.
Least squares method
STATA code: regress Salary Experience TestScore
MSE = ESS / n -p - 1 MSR = RSS / p
Result:

p=(tổng số biến độc lập)

n - p -1 =

n - 1=
căn bậc 2 của MSE

biến phụ thuộc

biến độc lập

Estimated Regression Equation

\ = 3.174 + 1.404 · Experience + 0.251 · TestScore.

Salary

(Note: Predicted salary will be in thousands of dollars.)

Least squares method
Interpretation of parameters
In multiple regression analysis, we interpret each regression
coefficient as follows: β̂i represents an estimate of the change in y
corresponding to one unit increase in xi when all other independent
variables are held constant.

Example 2. Interpretation of parameters in example 1:

\ = 3.174 + 1.404 · Experience + 0.251 · TestScore.

Salary

• Salary is expected to increase by $1,404 for each additional year

of experience (when the variable score on programmer attitude
test is held constant).
• Salary is expected to increase by $251 for each additional point
scored on the programmer aptitude test (when the variable years
of experience is held constant).
Multiple coefficient of determination
• Total Sum of Squares
X X
TSS = (yi − y)2 = yi2 − ny 2 = Y T Y − ny 2 .

• Sum of Squares due to Regression

X X
RSS = (ŷi − y)2 = xi2 − nx 2 .

• Sum of Squares due to Errors

X
ESS = (ŷi − yi )2 = β̂ T X T Y − ny 2 .

Relationship Among SST, SSR, SSE: TSS = ESS + RSS.

Multiple Coefficient of Determination:

RSS
R2 = .
TSS

These value can be found in ANOVA Output of the regression result.

Multiple coefficient of determination
Adjusted Multiple Coefficient of Determination
• Adding independent variables, even ones that are not statistically
significant, causes the prediction errors to become smaller, thus
reducing ESS.
• Because RSS = TSS – ESS, when ESS becomes smaller, RSS
becomes larger, causing R 2 = RSSTSS to increase.
• The adjusted multiple coefficient of determination compensates
for the number of independent variables in the model

n−1
Ra2 = R 2 := 1 − (1 − R 2 ) .
n−p−1

Example 3. From the result of example 1:

RSS 500.32853
R2 = = = 0.8342.
TSS 599.7855
n−1 20 − 1
R 2 = 1 − (1 − R 2 ) = 1 − (1 − 0.8342) = 0.8147.
n−p−1 20 − 2 − 1
Testing for significance

Assumptions About the Error Term ε

• The error ε is a random variable with mean of zero.
• The variance of ε, denoted by σ 2 is the same for all values of the
independent variables.
• The values of ε are independent.
• The error ε is a normally distributed random variable reflecting
the deviation between the y value and the expected value of y
given by β0 + β1 x1 + β2 x2 + ... + βp xp .
Testing for significance
• In simple linear regression, the F and t tests provide the same
conclusion.
• In multiple regression, the F and t tests have different purposes.
Testing for Significance: F Test
• The F test is used to determine whether a significant relationship
exists between the dependent variable and the set of all the
independent variables.
• The F test is referred to as the test for overall significance.

Testing for Significance: t Test

• If the F test shows an overall significance, the t test is used to
determine whether each of the individual independent variables
is significant.
• A separate t test is conducted for each of the independent
variables in the model.
• We refer to each of these t tests as a test for individual
significance.
Testing for significance

F Test for Overall Significance

1 Hypotheses:
H0 : β1 = β2 = · · · = βp = 0,
Ha : One or more of the parameters is not equal to zero.
2 Test Statistics:
RSS/p (n − p − 1)R 2
F = = .
ESS/(n − p − 1) p(1 − R 2 )

3 Rejection Rule: Reject H0 if p-value < α or if

F ≥ Fα (p, n − p − 1).
RSS ESS
Notes: We usually denote MSR = p , MSE = n−p−1 . Hence
MSR
F = MSE .
Testing for significance

Example 4. From the result of example 1, we want to test for overall

significance of the model at the significant level 0.05.
• Hypotheses:
H0 : β1 = β2 = 0,
Ha : One or more of the parameters is not equal to zero.
• Test Statistics:
RSS/p 500.32853/2
F = = = 42.76.
ESS/(n − p − 1) 99.4569697/17

• Fα (p, n − p − 1) = F0.05 (2, 17) = 3.592.

Since F > Fα (p, n − p − 1), we reject H0 .
Testing for significance

Estimation of standard errors for coefficents

The covariance matrix of β̂:

Var (β̂) = σ 2 (X T X )−1 .

Let cii be the item at the cell (i, i) of the matrix (X T X )−1 , then

σβ̂2 = σ 2 cii ≈ s2 cii ,

ESS
where s2 = MSE = n−p−1 . Hence we can estimate σβ̂i by
√
se(β̂i ) = s cii .
Testing for significance
t Test for Significance of Individual Parameters
For a given number β ∗ .
1 Hypotheses:
H0 : β i = β ∗ ,
Ha : βi ̸= β ∗ .
2 Test Statistics:
β̂i − β ∗
t= .
se(β̂i )

3 Rejection Rule: Reject H0 if p-value < α or if |t| > tα/2 (n − p − 1).

Notes:
• t statistics reported in STATA and other statistical software
corresponds to the case β ∗ = 0.
• For Ha : βi > β ∗ , we reject H0 if t > tα (n − p − 1).
• For Ha : βi < β ∗ , then we reject H0 if t < −tα (n − p − 1).
Testing for significance

Example 5. From the result of example 1, we want to test if TestScore

is significant, using the significant level 5%.
• Hypotheses:
H0 : β2 = 0,
Ha : β2 ̸= 0.
• Test Statistics:

β̂2 .2508854
t= = = 3.24
se(β̂2 ) .0773541

(this t statistics has already been computed by STATA).

• tα/2 (n − p − 1) = t0.025 (17) = 2.11.
Since |t| > tα/2 (n − p − 1), we cannot reject H0 .
Alternatively, we can compare the corresponding p-value computed
by STATA (0.005) to α = 5% to conclude that H0 should be rejected.
Testing for significance

Example 6. From the result of example 1, we want to test if one more

year of experience produces $1000 more dollars in income, using the
significant level 5%.
• Hypotheses:
H0 : β1 = 1,
Ha : β1 ̸= 1.
• Test Statistics:

β̂1 − 1 1.403902 − 1
t= = = 2.03.
se(β̂1 ) 0.1985669

• tα/2 (n − p − 1) = t0.025 (17) = 2.11.

Since |t| < tα/2 (n − p − 1), we cannot reject H0 .
Testing for significance
t Test for a linear combination of parameters
The same procedure can be used to test for a linear combination of
parameters
r T β := r0 β0 + r1 β1 + · · · + rp βp .
For given (r0 , r1 , . . . , rp ) and β ∗ .
1 Hypotheses:
H0 : r T β = β ∗ ,
Ha : r T β ̸= β ∗ .
2 Test Statistics:
r T β̂ − β ∗
t= .
se(r T β̂)

3 Rejection Rule: Reject H0 if p-value < α or if |t| > tα/2 (n − p − 1).

Notes: se(r T β̂) can be computed using the variance rule
s X sX X
se(r T β̂) = Var ( ri β̂i ) = ri2 Var (β̂i ) + 2 ri rj Cov(β̂i , β̂j ).
i i i<j
Interval estimation
Confident interval for βi
Interval estimate for the mean value of βi with confident level 1 − α is

β̂i − tα/2 (n − p − 1)se(β̂i ), β̂i + tα/2 (n − p − 1)se(β̂i ) .

Example 7. From the result of example 1, we want to find the 95%

confident interval for β1 (the coefficient of Experience).
The error is

ε = tα/2 (n − p − 1)se(β̂1 ) = t0.025 (17)se(β̂1 )

= 2.11 · 0.1985669 = 0.419.

The 95% confident interval for β1 is

β̂1 − ε, β̂1 + ε = (1.4039−0.419, 1.4039+0.419) = (0.9849, 1.8229).

Note: this confident interval has already been computed by STATA.

Prediction
Prediction
Let  
1
 x10 
x20
 
X0 =  ,
 
 .. 
 . 
xp0
we want to predict value y0 of y.
• Point estimate of y0 :

ŷ0 = β̂0 + β̂1 x10 + β̂2 x20 + · · · + β̂p xp0 .

• Interval estimate of y0 :

ŷ0 − tα/2 (n − p − 1)se(ŷ0 ), ŷ0 + tα/2 (n − p − 1)se(ŷ0 ) ,
q −1
where se(ŷ0 ) = σŷ20 where σŷ20 ≈ s2 X0T X T X X0 .
Information criteria
• The adjusted multiple coefficient of determination

n−1 ESS/(n − p − 1)
Ra2 = R 2 := 1 − (1 − R 2 ) =1−
n−p−1 TSS/(n − 1)

(higher is better).
• Akaike information criterion

ESS 2(p+1)
AIC = e n
n

(smaller is better).
• Schwarz information criterion (BIC/SC)

ESS p+1
BIC = n n
n

(smaller is better).
Information criteria
Example 8. A real estate company investigates the prices of
apartments for young families. They use the following regression
model:

PRICE = β0 + β1 SQFT + β2 BEDRMS + β3 BATHS + ε,

where
• PRICE: price of the
apartment (in thousands
dollars),
• SQFT: area (in square feet),
• BEDRMS: number of
bedrooms,
• BATHS: number of
bathrooms.
Find the best linear model.
Information criteria
Information criteria
Information criteria

Conclusion: we should use model 3 (PRICE = β0 + β1 SQFT ).

Wald test

Wald test is an extension of F test to test for the significance of a

group of j independent variables (2 ≤ j ≤ p).
1 Hypotheses:
H0 : βp−j+1 = βp−j+2 = · · · = βp = 0,
Ha : One or more of βp−j+1 , βp−j+2 , . . . , βp are not equal to zero.
2 Test Statistics:

(ESSR − ESSU )/j (RU2 − RR2 )/j

F = = ,
ESSU /(n − p − 1) (1 − RU2 )/(n − p − 1)

where R is the restricted model (βp−j+1 = βp−j+2 = · · · = βp = 0)

and U is the unrestricted model.
3 Rejection Rule: Reject H0 if p-value < α or if F ≥ Fα (j, n − p − 1).
Note: If we choose all j = p independent variables, we obtain the
original F test.
Wald test

Example 9. After fitting the model

PRICE = β0 + β1 SQFT + β2 BEDRMS + β3 BATHS + ε,

we want to test for the significance of the group BEDRMS, BATHS.

We cannot reject the null hypothesis H0 : β2 = β3 = 0 at 5%

significant level. This confirms that we should use the model
PRICE = β0 + β1 SQFT + ε.
Wald test
We can also test on a linear combination of j independent variables.

Example 10. After fitting the model

PRICE = β0 + β1 SQFT + β2 BEDRMS + β3 BATHS + ε,

we want to test for the hypothesis

H0 : β2 + β3 = 0,
Ha : β2 + β3 ̸= 0.

We cannot reject the null hypothesis at 5% significant level.

Sample Researh Report Mkt 537
No ratings yet
Sample Researh Report Mkt 537
37 pages
Time Series Analysis and Its Applications (Instructor's Manual) (Robert H. Shumway, David S. Stoffer)
100% (1)
Time Series Analysis and Its Applications (Instructor's Manual) (Robert H. Shumway, David S. Stoffer)
81 pages
Econometrics Cheatsheet en
100% (1)
Econometrics Cheatsheet en
3 pages
C2 English
No ratings yet
C2 English
34 pages
Multiple Regression
No ratings yet
Multiple Regression
36 pages
Week 11-2 Lecture 15 Student
No ratings yet
Week 11-2 Lecture 15 Student
54 pages
MultipleRegression 1
No ratings yet
MultipleRegression 1
40 pages
Multiple Regression
No ratings yet
Multiple Regression
60 pages
Multiple Linear Regression-I
No ratings yet
Multiple Linear Regression-I
6 pages
Multiple Regression (Compatibility Mode)
No ratings yet
Multiple Regression (Compatibility Mode)
24 pages
Chuong 6 - Hoi Quy Boi (SBE - 11e Ch15)
No ratings yet
Chuong 6 - Hoi Quy Boi (SBE - 11e Ch15)
67 pages
Topic 3 Multiple Regression Analysis Estimation
No ratings yet
Topic 3 Multiple Regression Analysis Estimation
31 pages
IST2024 Lecture02
No ratings yet
IST2024 Lecture02
31 pages
CH 15
No ratings yet
CH 15
46 pages
Chapter 3 - Multiple Linear Regression Models
No ratings yet
Chapter 3 - Multiple Linear Regression Models
29 pages
Lect 7
No ratings yet
Lect 7
15 pages
Chapter 15
No ratings yet
Chapter 15
67 pages
Session 19-20 - ANT 5001 - 2019-21
No ratings yet
Session 19-20 - ANT 5001 - 2019-21
42 pages
Multiple Regression Slides Mod-Ed
No ratings yet
Multiple Regression Slides Mod-Ed
32 pages
Chapter 15
No ratings yet
Chapter 15
43 pages
Chapter 15
No ratings yet
Chapter 15
67 pages
Chap.2 MultipleRegression
No ratings yet
Chap.2 MultipleRegression
20 pages
Week 8 - 10
No ratings yet
Week 8 - 10
72 pages
economatrics 3
No ratings yet
economatrics 3
32 pages
Multiple Linear Regression
No ratings yet
Multiple Linear Regression
73 pages
What Is Multiple Linear Regression
No ratings yet
What Is Multiple Linear Regression
23 pages
Multiple Regression
No ratings yet
Multiple Regression
57 pages
MultipleRegression
No ratings yet
MultipleRegression
43 pages
STATISTIQUE APPLIQUEE - Seance 4
No ratings yet
STATISTIQUE APPLIQUEE - Seance 4
60 pages
RiP Final Study Doc
No ratings yet
RiP Final Study Doc
35 pages
Chapter14
No ratings yet
Chapter14
65 pages
Chap3 - Multiple Regression
No ratings yet
Chap3 - Multiple Regression
56 pages
Multiple Regression
No ratings yet
Multiple Regression
20 pages
Topic Simple Linear Regression
No ratings yet
Topic Simple Linear Regression
38 pages
Chapter 11
No ratings yet
Chapter 11
18 pages
F_Regression
No ratings yet
F_Regression
65 pages
Passing Reference Multiple Regression
No ratings yet
Passing Reference Multiple Regression
10 pages
L1 QM07 High Yield Notes
No ratings yet
L1 QM07 High Yield Notes
4 pages
Regression Analysis
No ratings yet
Regression Analysis
65 pages
CH 14
No ratings yet
CH 14
31 pages
AA3 - Linear Regression - 2024
No ratings yet
AA3 - Linear Regression - 2024
26 pages
Basic Econometrics III
No ratings yet
Basic Econometrics III
23 pages
Multiple Linear Regression
No ratings yet
Multiple Linear Regression
17 pages
An Introduction To Statistical Learning
No ratings yet
An Introduction To Statistical Learning
19 pages
FinQuiz - Curriculum Note, Study Session 2, Reading 4
No ratings yet
FinQuiz - Curriculum Note, Study Session 2, Reading 4
5 pages
Na9vr1 SZWvb69fvimVUw BF C2 W2 Multiple Regression Models
No ratings yet
Na9vr1 SZWvb69fvimVUw BF C2 W2 Multiple Regression Models
25 pages
Chapter 14 Simple Linear Regression
No ratings yet
Chapter 14 Simple Linear Regression
45 pages
125.785 Module 2.2
No ratings yet
125.785 Module 2.2
95 pages
Simple Linear Regression PDF
No ratings yet
Simple Linear Regression PDF
7 pages
REGRESSION
No ratings yet
REGRESSION
8 pages
2b Multiple Linear Regression
No ratings yet
2b Multiple Linear Regression
14 pages
Chapter 3 MLR
No ratings yet
Chapter 3 MLR
40 pages
Regression and Multiple Regression Analysis
100% (1)
Regression and Multiple Regression Analysis
21 pages
Chapter 4
No ratings yet
Chapter 4
41 pages
Stat 353 Study Guide
No ratings yet
Stat 353 Study Guide
44 pages
UNIT 3 For ACfn & MGT
No ratings yet
UNIT 3 For ACfn & MGT
28 pages
4.1 Multiple Regression Models
No ratings yet
4.1 Multiple Regression Models
6 pages
Generalized Fermat Equation
From Everand
Generalized Fermat Equation
Ran Van Vo
No ratings yet
Calculus I Essentials
From Everand
Calculus I Essentials
Editors of REA
1/5 (1)
DMAIC BPO Case Abridged
No ratings yet
DMAIC BPO Case Abridged
60 pages
DirichletReg Vig
No ratings yet
DirichletReg Vig
13 pages
Module 10 - Simple Linear Regression
No ratings yet
Module 10 - Simple Linear Regression
10 pages
418-Article Text-3226-2-10-20200530
No ratings yet
418-Article Text-3226-2-10-20200530
8 pages
Effect of Valuation Accuracy On Mortgage Valuation Performance
No ratings yet
Effect of Valuation Accuracy On Mortgage Valuation Performance
11 pages
Advanced Data Analytics Certificate Glossary
No ratings yet
Advanced Data Analytics Certificate Glossary
35 pages
ProjectTemplate - Lavesh Kewlani
No ratings yet
ProjectTemplate - Lavesh Kewlani
10 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
51 pages
Business Analytics 2nd Edition Evans Test Bankdownload
100% (10)
Business Analytics 2nd Edition Evans Test Bankdownload
47 pages
Chapter 14 - ARCH-GARCH Models
No ratings yet
Chapter 14 - ARCH-GARCH Models
17 pages
Advanced Marketing Research
No ratings yet
Advanced Marketing Research
2 pages
Ecotourism in Borawan Island of Quezon Province Inputs To Tourism Development
No ratings yet
Ecotourism in Borawan Island of Quezon Province Inputs To Tourism Development
8 pages
Cost Estimation of Machined Parts Within An Aerospace Supply Chain - P. Watson
No ratings yet
Cost Estimation of Machined Parts Within An Aerospace Supply Chain - P. Watson
10 pages
Towards Development of An Isfet-Based Smart PH Sensor: Enabling Machine Learning For Drift Compensation in Iot Applications
No ratings yet
Towards Development of An Isfet-Based Smart PH Sensor: Enabling Machine Learning For Drift Compensation in Iot Applications
12 pages
Board MBA 3rd (11th Batch)
No ratings yet
Board MBA 3rd (11th Batch)
10 pages
IFRS Adoption and Accounting Quality: A Review
No ratings yet
IFRS Adoption and Accounting Quality: A Review
29 pages
BCIS 4th Semester Syallbus
No ratings yet
BCIS 4th Semester Syallbus
11 pages
[FREE PDF sample] Statistics for Business and Economics : Metric Edition, 14th Edition Cengage South-Western ebooks
No ratings yet
[FREE PDF sample] Statistics for Business and Economics : Metric Edition, 14th Edition Cengage South-Western ebooks
39 pages
Jamboree_Case_Study
No ratings yet
Jamboree_Case_Study
24 pages
Primer of Applied Regression & Analysis of Variance 3E 3rd Edition Educational eBook Download
100% (10)
Primer of Applied Regression & Analysis of Variance 3E 3rd Edition Educational eBook Download
16 pages
Identifying Predictors of Student Satisfaction And
No ratings yet
Identifying Predictors of Student Satisfaction And
11 pages
HBV Manual 2005 PDF
No ratings yet
HBV Manual 2005 PDF
32 pages
Unit 4 Regression analysis
No ratings yet
Unit 4 Regression analysis
28 pages
RCsplines
No ratings yet
RCsplines
18 pages
File 1
No ratings yet
File 1
15 pages
Demand of Burger King - Managerial Economic
100% (1)
Demand of Burger King - Managerial Economic
21 pages
Asm4 2013345148
No ratings yet
Asm4 2013345148
10 pages
Multiple Linear Regression: Response Explanatory - I
No ratings yet
Multiple Linear Regression: Response Explanatory - I
5 pages

Uploaded by

Uploaded by

Chapter 2.

Faculty of Economic Mathematics

1 Multiple regression model

2 Hypothesis testing and interval estimation

Multiple regression model

Multiple Regression Equation

Note: (x1j , x2j , . . . , xpj , yj ) is the j-th observation for j = 1, 2, . . . , n.

ŷ = β̂0 + β̂1 x1 + β̂2 x2 + ... + β̂p xp .

Cobb-Douglas prodution functions

yi = β0 x1iβ1 x2iβ2 eεi ,

where yi : production, x1i : capital, x2i : labor, εi : error.

This is a multiple regression model for ln y, ln x1 and ln x2 .

Solving this, we get the OLS formula

Computation of coefficient values

Example 1. A software firm collected data for a sample of 20

The years of experience, score on the aptitude test, and

Suppose we believe that salary (Salary) is related to the years of

Salary = β0 + β1 Experience + β2 TestScore + ε,

p=(tổng số biến độc lập)

biến phụ thuộc

biến độc lập

Estimated Regression Equation

\ = 3.174 + 1.404 · Experience + 0.251 · TestScore.

(Note: Predicted salary will be in thousands of dollars.)

Example 2. Interpretation of parameters in example 1:

\ = 3.174 + 1.404 · Experience + 0.251 · TestScore.

• Salary is expected to increase by $1,404 for each additional year

• Sum of Squares due to Regression

• Sum of Squares due to Errors

Relationship Among SST, SSR, SSE: TSS = ESS + RSS.

These value can be found in ANOVA Output of the regression result.

Example 3. From the result of example 1:

Assumptions About the Error Term ε

Testing for Significance: t Test

F Test for Overall Significance

3 Rejection Rule: Reject H0 if p-value < α or if

Example 4. From the result of example 1, we want to test for overall

• Fα (p, n − p − 1) = F0.05 (2, 17) = 3.592.

Estimation of standard errors for coefficents

Var (β̂) = σ 2 (X T X )−1 .

σβ̂2 = σ 2 cii ≈ s2 cii ,

3 Rejection Rule: Reject H0 if p-value < α or if |t| > tα/2 (n − p − 1).

Example 5. From the result of example 1, we want to test if TestScore

(this t statistics has already been computed by STATA).

Example 6. From the result of example 1, we want to test if one more

• tα/2 (n − p − 1) = t0.025 (17) = 2.11.

3 Rejection Rule: Reject H0 if p-value < α or if |t| > tα/2 (n − p − 1).

Example 7. From the result of example 1, we want to find the 95%

ε = tα/2 (n − p − 1)se(β̂1 ) = t0.025 (17)se(β̂1 )

The 95% confident interval for β1 is

Note: this confident interval has already been computed by STATA.

ŷ0 = β̂0 + β̂1 x10 + β̂2 x20 + · · · + β̂p xp0 .

PRICE = β0 + β1 SQFT + β2 BEDRMS + β3 BATHS + ε,

Conclusion: we should use model 3 (PRICE = β0 + β1 SQFT ).

Wald test is an extension of F test to test for the significance of a

(ESSR − ESSU )/j (RU2 − RR2 )/j

where R is the restricted model (βp−j+1 = βp−j+2 = · · · = βp = 0)

Example 9. After fitting the model

PRICE = β0 + β1 SQFT + β2 BEDRMS + β3 BATHS + ε,

we want to test for the significance of the group BEDRMS, BATHS.

We cannot reject the null hypothesis H0 : β2 = β3 = 0 at 5%

Example 10. After fitting the model

PRICE = β0 + β1 SQFT + β2 BEDRMS + β3 BATHS + ε,

we want to test for the hypothesis

We cannot reject the null hypothesis at 5% significant level.

You might also like