0% found this document useful (0 votes)

59 views

C2 English

Uploaded by

vidt234081e

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

59 views

C2 English

Uploaded by

vidt234081e

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 34

Chapter 2.

Multiple Regression
(Course: Econometrics)

Phuong Le

Faculty of Economic Mathematics

University of Economics and Law
Vietnam National University, Ho Chi Minh City
Content

1 Multiple regression model

Introduction
Least squares method
Multiple coefficient of determination

2 Hypothesis testing and interval estimation

Testing for significance
Interval estimation

3 Model selection
Information criteria
Wald test
Introduction
Multiple regression model
The equation that describes how the dependent variable y is related
to p independent variables x1 , x2 , . . . , xp and an error term ε is:

y = β0 + β1 x1 + β2 x2 + ... + βp xp + ε,

where
• β0 , β1 , . . . , βp are the parameters (there is k = p + 1 parameters),
• ε is a random variable called the error term.
The equation for i-th observation of the population:
yi = β0 + β1 x1i + β2 x2i + ... + βp xpi + εi , i = 1, 2, . . . , N.

Multiple Regression Equation

The equation that describes how the mean value of y is related to
x1 , x2 , . . . , xp is:

E(y) = β0 + β1 x1 + β2 x2 + ... + βp xp .
Introduction
Matrix representation

Y = X β + ε,
where    
1 x11 x21 xp1
... y1
1 x12 x22 xp2 
... y2 
X = . .. .. ..  , Y =  ..  ,
   
 .. . . .  .
1 x1n x2n . . . xpn yn
   
β0 ε1
 β1   ε2 
β =  . , ε =  . 
   
 ..   .. 
βp εn

Note: (x1j , x2j , . . . , xpj , yj ) is the j-th observation for j = 1, 2, . . . , N (N

is the population size).
Introduction
Estimated Multiple Regression Equation

ŷ = β̂0 + β̂1 x1 + β̂2 x2 + ... + β̂p xp .

A simple random sample is used to compute sample statistics
β̂0 , β̂1 , . . . , β̂p that are used as the point estimators of the parameters
β0 , β1 , . . . , βp .

Representation in matrix

Y = X β̂ + e,

where    
β̂0 e1
β̂1  e2 
β̂ =  .  , e =  . 
   
 ..   .. 
β̂p en
with n being the sample size and ei = yi − ŷi .
Some multiple regression functions

Cobb-Douglas prodution functions

Cobb-Douglas production functions are represented as

yi = β0 x1iβ1 x2iβ2 eεi ,

where yi : production, x1i : capital, x2i : labor, εi : error.

Cobb-Douglas production function can be transformed to

ln yi = β0 + β1 ln x1i + β2 ln x2i + εi .

This is a multiple regression model for ln y, ln x1 and ln x2 .

Quadratic regression function

yi = β0 + β1 xi + β2 xi2 + εi .
This is a multiple regression model for y, x and x 2 .
Least squares method
Least squares criterion

X X X 2
ei2 = (yi −ŷi )2 = yi − β̂0 − β̂1 x1i − β̂2 x2i − · · · − β̂p xpi → min .

Solving this, we get the OLS formula

−1
β̂ = X T X XTY.

Computation of coefficient values

• The formulas for the regression coefficients β̂0 , β̂1 , . . . , β̂p involve
the use of matrix algebra. We will rely on computer software
packages to perform the calculations.
• The emphasis will be on how to interpret the computer output
rather than on how to make the multiple regression
computations.
Least squares method

Example 1. A software firm collected data for a sample of 20

computer programmers. A suggestion was made that regression
analysis could be used to determine if salary was related to the years
of experience and the score on the firm’s programmer aptitude test.

The years of experience, score on the aptitude test, and

corresponding annual salary ($1000s) for a sample of 20
programmers is shown on the next slide.
Least squares method

Exper. Test Salary Exper. Test Salary

(Yrs.) Score ($1000s) (Yrs.) Score ($1000s)
4 78 24.0 9 88 38.0
7 100 43.0 2 73 26.6
1 86 23.7 10 75 36.2
5 82 34.3 5 81 31.6
8 86 35.8 6 74 29.0
10 84 38.0 8 87 34.0
0 75 22.2 4 79 30.1
1 80 23.1 6 94 33.9
6 83 30.0 3 70 28.2
6 91 33.0 3 89 30.0
The data table on years of experience, test scores, and salary.
Least squares method

Suppose we believe that salary (Salary) is related to the years of

experience (Experience) and the score on the programmer aptitude
test (TestScore) by the following regression model

Salary = β0 + β1 Experience + β2 TestScore + ε,

where
• Salary: annual salary ($1000s),
• Experience: years of experience,
• TestScore: score on programmer aptitude test.
Least squares method
STATA code: regress Salary Experience TestScore

Result:

Estimated Regression Equation

\ = 3.174 + 1.404 · Experience + 0.251 · TestScore.

Salary

(Note: Predicted salary will be in thousands of dollars.)

Least squares method
Interpretation of parameters
In multiple regression analysis, we interpret each regression
coefficient as follows: β̂i represents an estimate of the change in y
corresponding to one unit increase in xi when all other independent
variables are held constant.

Example 2. Interpretation of parameters in example 1:

\ = 3.174 + 1.404 · Experience + 0.251 · TestScore.

Salary

• Salary is expected to increase by $1,404 for each additional year

of experience (when the variable score on programmer attitude
test is held constant).
• Salary is expected to increase by $251 for each additional point
scored on the programmer aptitude test (when the variable years
of experience is held constant).
Multiple coefficient of determination
• Total Sum of Squares
X X
TSS = (yi − y)2 = yi2 − ny 2 = Y T Y − ny 2 .

• Sum of Squares due to Regression

X X
RSS = (ŷi − y)2 = xi2 − nx 2 .

• Sum of Squares due to Errors

X
ESS = (ŷi − yi )2 = β̂ T X T Y − ny 2 .

Relationship Among SST, SSR, SSE: TSS = ESS + RSS.

Multiple Coefficient of Determination:

RSS
R2 = .
TSS

These value can be found in ANOVA Output of the regression result.

Multiple coefficient of determination
Adjusted Multiple Coefficient of Determination
• Adding independent variables, even ones that are not statistically
significant, causes the prediction errors to become smaller, thus
reducing ESS.
• Because RSS = TSS – ESS, when ESS becomes smaller, RSS
becomes larger, causing R 2 = RSSTSS to increase.
• The adjusted multiple coefficient of determination compensates
for the number of independent variables in the model

n−1
Ra2 = R 2 := 1 − (1 − R 2 ) .
n−p−1

Example 3. From the result of example 1:

RSS 500.32853
R2 = = = 0.8342.
TSS 599.7855
n−1 20 − 1
R 2 = 1 − (1 − R 2 ) = 1 − (1 − 0.8342) = 0.8147.
n−p−1 20 − 2 − 1
Testing for significance

Assumptions About the Error Term ε

• Assumption 1: The relationship between y and x1 , x2 , . . . , xp is
linear.
• Assumption 2: The errors εi are random variables with zero
mean
E(εi |x1i , . . . , xpi ) = 0.
• Assumption 3: The covariance matrix of ε has the form

Var (ε|x1 , . . . , xp ) = σ 2 In ,

where In is the identity matrix of size n.

Equivalently, Var (εi |x1i , . . . , xpi ) = σ 2 and
Cov(εi , εj |x1i , . . . , xpi , x1j , . . . , xpj ) = 0 for i ̸= j.
• Assumption 4: rank(X ) = p + 1. Equivalently, X T X is invertible.
Testing for significance

Gauss – Markov Theorem

When Assumptions 1 – 4 are satisfied, the estimates obtained using
the OLS method are the best linear unbiased estimates of the
population regression function (OLS estimates are BLUE).

Assumptions about the errors ε

To perform hypothesis testing and interval estimation in the following
sections, we need an additional assumption (Assumption 5) as
follows:
• Assumption 5: The errors ε follow a multivariate normal
distribution
ε ∼ N(0, σ 2 In ).
In particular, εi ∼ N(0, σ 2 ).
Testing for significance
• In simple linear regression, the F and t tests provide the same
conclusion.
• In multiple regression, the F and t tests have different purposes.
Testing for Significance: F Test
• The F test is used to determine whether a significant relationship
exists between the dependent variable and the set of all the
independent variables.
• The F test is referred to as the test for overall significance.

Testing for Significance: t Test

• If the F test shows an overall significance, the t test is used to
determine whether each of the individual independent variables
is significant.
• A separate t test is conducted for each of the independent
variables in the model.
• We refer to each of these t tests as a test for individual
significance.
Testing for significance

F Test for Overall Significance

1 Hypotheses:
H0 : β1 = β2 = · · · = βp = 0 (R 2 = 0),
Ha : β12 + β22 + · · · + βp2 > 0 (R 2 > 0).
2 Test Statistics:
RSS/p (n − p − 1)R 2
F = = .
ESS/(n − p − 1) p(1 − R 2 )

3 Rejection Rule: Reject H0 if p-value < α or if

F ≥ Fα (p, n − p − 1).
RSS ESS
Notes: We usually denote MSR = p , MSE = n−p−1 . Hence
MSR
F = MSE .
Testing for significance

Example 4. From the result of example 1, we want to test for overall

significance of the model at the significant level 0.05.
• Hypotheses:
H0 : β1 = β2 = 0,
Ha : β1 ̸= 0 or β2 ̸= 0.
• Test Statistics:
RSS/p 500.32853/2
F = = = 42.76.
ESS/(n − p − 1) 99.4569697/17

• Fα (p, n − p − 1) = F0.05 (2, 17) = 3.592.

Since F > Fα (p, n − p − 1), we reject H0 .
Testing for significance

Estimation of standard errors for coefficents

The covariance matrix of β̂:

Var (β̂) = σ 2 (X T X )−1 .

Let cii be the item at the cell (i, i) of the matrix (X T X )−1 , then

σβ̂2 = σ 2 cii ≈ s2 cii ,

ESS
where s2 = MSE = n−p−1 . Hence we can estimate σβ̂i by
√
se(β̂i ) = s cii .

Note: To obtain the covariance matrix of β̂ from STATA, after fitting

the model, type the command:
matrix list e(V)
Testing for significance

t Test for Significance of Individual Parameters

For a given real number b.
1 Hypotheses:
H0 : βi = b, Ha : βi ̸= b.
2 Test Statistics:
β̂i − b
t= .
se(β̂i )

3 Rejection Rule: Reject H0 if p-value < α or if |t| > tα/2 (n − p − 1).

Notes:
• t statistics reported in STATA and other statistical software
corresponds to the case b = 0.
• For Ha : βi > b, we reject H0 if t > tα (n − p − 1).
• For Ha : βi < b, then we reject H0 if t < −tα (n − p − 1).
Testing for significance

Example 5. From the result of example 1, we want to test if TestScore

is significant, using the significant level 5%.
• Hypotheses:
H0 : β2 = 0, Ha : β2 ̸= 0.
• Test Statistics:

β̂2 .2508854
t= = = 3.24
se(β̂2 ) .0773541

(this t statistics has already been computed by STATA).

• tα/2 (n − p − 1) = t0.025 (17) = 2.11.
Since |t| > tα/2 (n − p − 1), we reject H0 .
Alternatively, we can compare the corresponding p-value computed
by STATA (0.005) to α = 5% to conclude that H0 should be rejected.
Testing for significance

Example 6. From the result of example 1, we want to test if one more

year of experience produces $1000 more dollars in income, using the
significant level 5%.
• Hypotheses:
H0 : β1 = 1, Ha : β1 ̸= 1.
• Test Statistics:

β̂1 − 1 1.403902 − 1
t= = = 2.03.
se(β̂1 ) 0.1985669

• tα/2 (n − p − 1) = t0.025 (17) = 2.11.

Since |t| < tα/2 (n − p − 1), we do not reject H0 .
Testing for significance
t Test for a linear combination of parameters
The same procedure can be used to test for a linear combination of
parameters
r T β := r0 β0 + r1 β1 + · · · + rp βp
for given (r0 , r1 , . . . , rp ) ∈ Rp and b ∈ R.
1 Hypotheses:
H0 : r T β = b, Ha : r T β ̸= b.
2 Test Statistics:
r T β̂ − b
t= .
se(r T β̂)

3 Rejection Rule: Reject H0 if p-value < α or if |t| > tα/2 (n − p − 1).

Notes: se(r T β̂) can be computed using the variance rule
s X sX X
se(r T β̂) = Var ( ri β̂i ) = ri2 Var (β̂i ) + 2 ri rj Cov(β̂i , β̂j ).
i i i<j
Interval estimation
Confident interval for βi
Interval estimate for the mean value of βi with confident level 1 − α is

β̂i − tα/2 (n − p − 1)se(β̂i ), β̂i + tα/2 (n − p − 1)se(β̂i ) .

Example 7. From the result of example 1, we want to find the 95%

confident interval for β1 (the coefficient of Experience).
The error is

ε = tα/2 (n − p − 1)se(β̂1 ) = t0.025 (17)se(β̂1 )

= 2.11 · 0.1985669 = 0.419.

The 95% confident interval for β1 is

β̂1 − ε, β̂1 + ε = (1.4039−0.419, 1.4039+0.419) = (0.9849, 1.8229).

Note: this confident interval has already been computed by STATA.

Prediction
Prediction
Given (x10 , . . . , xp0 ), we want to predict value y0 of y when
(x1 , . . . , xp ) = (x10 , . . . , xp0 ). Let

x10 x20 xp0 .

X0 = 1 ···

• Point estimate for y0 : ŷ0 = β̂0 + β̂1 x10 + β̂2 x20 + · · · + β̂p xp0 .
• Interval estimate for the mean of y0 :

ŷ0 − tα/2 (n − p − 1)se(ŷ0 ), ŷ0 + tα/2 (n − p − 1)se(ŷ0 ) ,
q
−1
where se(ŷ0 ) = s X0T (X T X )
X0 .
• Interval estimate for the individual value of y0 (prediction interval):

ŷ0 − tα/2 (n − p − 1)se(y0 − ŷ0 ), ŷ0 + tα/2 (n − p − 1)se(y0 − ŷ0 ) ,
q
−1 p
where se(y0 − ŷ0 ) = s 1 + X0T (X T X ) X0 = s2 + (se(ŷ0 ))2 .
Information criteria
• The adjusted multiple coefficient of determination

n−1 ESS/(n − p − 1)
Ra2 = R 2 := 1 − (1 − R 2 ) =1−
n−p−1 TSS/(n − 1)

(higher is better).
• Akaike information criterion

ESS 2(p+1)
AIC = e n
n

(smaller is better).
• Schwarz information criterion (BIC/SC)

ESS p+1
BIC = n n
n

(smaller is better).
Information criteria
Example 8. A real estate company investigates the prices of
apartments for young families. They use the following regression
model:

PRICE = β0 + β1 SQFT + β2 BEDRMS + β3 BATHS + ε,

where
• PRICE: price of the
apartment (in thousands
dollars),
• SQFT: area (in square feet),
• BEDRMS: number of
bedrooms,
• BATHS: number of
bathrooms.
Find the best linear model.
Information criteria
Information criteria
Information criteria

Conclusion: we should use model 3 (PRICE = β0 + β1 SQFT ).

Wald test

Wald test is an extension of F test to test for the significance of a

group of j independent variables (2 ≤ j ≤ p).
1 Hypotheses:
H0 : βp−j+1 = βp−j+2 = · · · = βp = 0,
Ha : One or more of βp−j+1 , βp−j+2 , . . . , βp are not equal to zero.
2 Test Statistics:

(ESSR − ESSU )/j (RU2 − RR2 )/j

F = = ,
ESSU /(n − p − 1) (1 − RU2 )/(n − p − 1)

where R is the restricted model (βp−j+1 = βp−j+2 = · · · = βp = 0)

and U is the unrestricted model.
3 Rejection Rule: Reject H0 if p-value < α or if F ≥ Fα (j, n − p − 1).
Note: If we choose all j = p independent variables, we obtain the
original F test.
Wald test

Example 9. After fitting the model

PRICE = β0 + β1 SQFT + β2 BEDRMS + β3 BATHS + ε,

we want to test for the significance of the group BEDRMS, BATHS.

We cannot reject the null hypothesis H0 : β2 = β3 = 0 at 5%

significant level. This confirms that we should use the model
PRICE = β0 + β1 SQFT + ε.
Wald test
We can also test on a linear combination of j independent variables.

Example 10. After fitting the model

PRICE = β0 + β1 SQFT + β2 BEDRMS + β3 BATHS + ε,

we want to test for the hypothesis

H0 : β2 + β3 = 0,
Ha : β2 + β3 ̸= 0.

We cannot reject the null hypothesis at 5% significant level.

Test Bank Chapter 6 Investment Bodie
100% (2)
Test Bank Chapter 6 Investment Bodie
44 pages
CDMP Mock Test 1
100% (3)
CDMP Mock Test 1
19 pages
Econometrics Cheatsheet en
100% (1)
Econometrics Cheatsheet en
3 pages
C2-English
No ratings yet
C2-English
33 pages
Multiple Regression
No ratings yet
Multiple Regression
36 pages
Multiple Regression (Compatibility Mode)
No ratings yet
Multiple Regression (Compatibility Mode)
24 pages
MultipleRegression 1
No ratings yet
MultipleRegression 1
40 pages
Week 11-2 Lecture 15 Student
No ratings yet
Week 11-2 Lecture 15 Student
54 pages
Multiple Linear Regression-I
No ratings yet
Multiple Linear Regression-I
6 pages
Topic 3 Multiple Regression Analysis Estimation
No ratings yet
Topic 3 Multiple Regression Analysis Estimation
31 pages
Multiple Regression
No ratings yet
Multiple Regression
60 pages
IST2024 Lecture02
No ratings yet
IST2024 Lecture02
31 pages
Chapter 3 - Multiple Linear Regression Models
No ratings yet
Chapter 3 - Multiple Linear Regression Models
29 pages
Stat 353 Study Guide
No ratings yet
Stat 353 Study Guide
44 pages
Multiple Regression Slides Mod-Ed
No ratings yet
Multiple Regression Slides Mod-Ed
32 pages
Chuong 6 - Hoi Quy Boi (SBE - 11e Ch15)
No ratings yet
Chuong 6 - Hoi Quy Boi (SBE - 11e Ch15)
67 pages
Multiple Linear Regression
No ratings yet
Multiple Linear Regression
73 pages
Week 8 - 10
No ratings yet
Week 8 - 10
72 pages
Multiple Regression
No ratings yet
Multiple Regression
57 pages
Chap3 - Multiple Regression
No ratings yet
Chap3 - Multiple Regression
56 pages
Chap.2 MultipleRegression
No ratings yet
Chap.2 MultipleRegression
20 pages
Chapter 15
No ratings yet
Chapter 15
67 pages
economatrics 3
No ratings yet
economatrics 3
32 pages
STATISTIQUE APPLIQUEE - Seance 4
No ratings yet
STATISTIQUE APPLIQUEE - Seance 4
60 pages
Emet2007 Notes
No ratings yet
Emet2007 Notes
6 pages
Chapter 2 P1
No ratings yet
Chapter 2 P1
20 pages
What Is Multiple Linear Regression
No ratings yet
What Is Multiple Linear Regression
23 pages
Regression and Multiple Regression Analysis
100% (1)
Regression and Multiple Regression Analysis
21 pages
Regression Analysis
No ratings yet
Regression Analysis
65 pages
2b Multiple Linear Regression
No ratings yet
2b Multiple Linear Regression
14 pages
Chapter 3 MLR
No ratings yet
Chapter 3 MLR
40 pages
Chapter 11
No ratings yet
Chapter 11
18 pages
125.785 Module 2.2
No ratings yet
125.785 Module 2.2
95 pages
MultipleRegression
No ratings yet
MultipleRegression
43 pages
Chapter 2
No ratings yet
Chapter 2
19 pages
Chapter 15
No ratings yet
Chapter 15
67 pages
Na9vr1 SZWvb69fvimVUw BF C2 W2 Multiple Regression Models
No ratings yet
Na9vr1 SZWvb69fvimVUw BF C2 W2 Multiple Regression Models
25 pages
Multiple Linear Regression
No ratings yet
Multiple Linear Regression
17 pages
RiP Final Study Doc
No ratings yet
RiP Final Study Doc
35 pages
UNIT 3 For ACfn & MGT
No ratings yet
UNIT 3 For ACfn & MGT
28 pages
Lec30
No ratings yet
Lec30
10 pages
Multiple Linear Regression & Nonlinear Regression Models
No ratings yet
Multiple Linear Regression & Nonlinear Regression Models
51 pages
Topic Simple Linear Regression
No ratings yet
Topic Simple Linear Regression
38 pages
Short - Notes - Econometric Methods
No ratings yet
Short - Notes - Econometric Methods
22 pages
WST 311 Notes part 2 2024
No ratings yet
WST 311 Notes part 2 2024
21 pages
AA3 - Linear Regression - 2024
No ratings yet
AA3 - Linear Regression - 2024
26 pages
4.1 Multiple Regression Models
No ratings yet
4.1 Multiple Regression Models
6 pages
Bivariate
No ratings yet
Bivariate
28 pages
Simple Regression 1
No ratings yet
Simple Regression 1
18 pages
Chapter14
No ratings yet
Chapter14
65 pages
Multiple Regression
No ratings yet
Multiple Regression
49 pages
Chapter 15
No ratings yet
Chapter 15
43 pages
Econometric lec3
No ratings yet
Econometric lec3
76 pages
Basic Econometrics III
No ratings yet
Basic Econometrics III
23 pages
Lect 7
No ratings yet
Lect 7
15 pages
High Yield Notes
No ratings yet
High Yield Notes
251 pages
Calculus I Essentials
From Everand
Calculus I Essentials
Editors of REA
1/5 (1)
Generalized Fermat Equation
From Everand
Generalized Fermat Equation
Ran Van Vo
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Chapter 4
No ratings yet
Chapter 4
8 pages
Chapter 3-Partnership
No ratings yet
Chapter 3-Partnership
26 pages
The International Businesss Strategy of Amazon
No ratings yet
The International Businesss Strategy of Amazon
8 pages
Iah6 - K234080887
No ratings yet
Iah6 - K234080887
1 page
K234080887 - KDQT
No ratings yet
K234080887 - KDQT
1 page
Download full Mathematical Methods for Physics and Engineering 1st Edition Mattias Blennow ebook all chapters
100% (1)
Download full Mathematical Methods for Physics and Engineering 1st Edition Mattias Blennow ebook all chapters
55 pages
Electromagnetic-Thermal Analysis With FDTD and Physics-Informed Neural Networks
No ratings yet
Electromagnetic-Thermal Analysis With FDTD and Physics-Informed Neural Networks
11 pages
Set16 CH 5 Congestion Control
No ratings yet
Set16 CH 5 Congestion Control
18 pages
Yadnesh Resume
No ratings yet
Yadnesh Resume
1 page
MATH F113 Chapter 1
No ratings yet
MATH F113 Chapter 1
50 pages
24sh101105 Engineering Mathematics –i
No ratings yet
24sh101105 Engineering Mathematics –i
24 pages
W1PS
No ratings yet
W1PS
12 pages
Nonex Kap5
No ratings yet
Nonex Kap5
34 pages
Statistics Formula Sheet and Tables 2020
No ratings yet
Statistics Formula Sheet and Tables 2020
6 pages
Kalo Solutions
No ratings yet
Kalo Solutions
36 pages
Post-Quantum Security Opportunities and Challenges
No ratings yet
Post-Quantum Security Opportunities and Challenges
26 pages
Algorithms sequential parallel a unified approach 3rd Edition Russ Miller download
100% (2)
Algorithms sequential parallel a unified approach 3rd Edition Russ Miller download
49 pages
PRTF MGMT SDA Sample Questions20160420092753 PDF
No ratings yet
PRTF MGMT SDA Sample Questions20160420092753 PDF
3 pages
fipy-3.1
No ratings yet
fipy-3.1
457 pages
Analytical, Statistical and Managerial Tools For Quality Improvement
No ratings yet
Analytical, Statistical and Managerial Tools For Quality Improvement
15 pages
cps ppt
No ratings yet
cps ppt
11 pages
Attempt To Answer All Questions: Assessment Exercise
100% (1)
Attempt To Answer All Questions: Assessment Exercise
2 pages
04 Jun 2021 MATHEMATICS-II
No ratings yet
04 Jun 2021 MATHEMATICS-II
12 pages
Mul Tiple Classification System For Fracture Detection in Human Bone X-Ray Images
No ratings yet
Mul Tiple Classification System For Fracture Detection in Human Bone X-Ray Images
8 pages
Activity No. 4 Linear Programming Simplex Method
No ratings yet
Activity No. 4 Linear Programming Simplex Method
8 pages
Firefly Algorithm Approach For The Optimization of Feature Selection To Perform Classification
No ratings yet
Firefly Algorithm Approach For The Optimization of Feature Selection To Perform Classification
4 pages
NNDL Lab
No ratings yet
NNDL Lab
33 pages
l5 Advanced Power System Optimization l5 Simplex Method Part1
No ratings yet
l5 Advanced Power System Optimization l5 Simplex Method Part1
27 pages
Regression and Correlation
No ratings yet
Regression and Correlation
19 pages
Soumya Ranjan Mishra
No ratings yet
Soumya Ranjan Mishra
2 pages
Panel Data Methods For Microeconometrics Using Stata: A. Colin Cameron Univ. of California - Davis
100% (1)
Panel Data Methods For Microeconometrics Using Stata: A. Colin Cameron Univ. of California - Davis
55 pages
Improving Voltage Stability in Power Systems Using Modal Analysis
No ratings yet
Improving Voltage Stability in Power Systems Using Modal Analysis
15 pages
Project Report of ECE841
No ratings yet
Project Report of ECE841
7 pages

Uploaded by

Uploaded by

Chapter 2.

Faculty of Economic Mathematics

1 Multiple regression model

2 Hypothesis testing and interval estimation

Multiple Regression Equation

Note: (x1j , x2j , . . . , xpj , yj ) is the j-th observation for j = 1, 2, . . . , N (N

ŷ = β̂0 + β̂1 x1 + β̂2 x2 + ... + β̂p xp .

Cobb-Douglas prodution functions

yi = β0 x1iβ1 x2iβ2 eεi ,

where yi : production, x1i : capital, x2i : labor, εi : error.

This is a multiple regression model for ln y, ln x1 and ln x2 .

Solving this, we get the OLS formula

Computation of coefficient values

Example 1. A software firm collected data for a sample of 20

The years of experience, score on the aptitude test, and

Exper. Test Salary Exper. Test Salary

Suppose we believe that salary (Salary) is related to the years of

Salary = β0 + β1 Experience + β2 TestScore + ε,

Estimated Regression Equation

\ = 3.174 + 1.404 · Experience + 0.251 · TestScore.

(Note: Predicted salary will be in thousands of dollars.)

Example 2. Interpretation of parameters in example 1:

\ = 3.174 + 1.404 · Experience + 0.251 · TestScore.

• Salary is expected to increase by $1,404 for each additional year

• Sum of Squares due to Regression

• Sum of Squares due to Errors

Relationship Among SST, SSR, SSE: TSS = ESS + RSS.

These value can be found in ANOVA Output of the regression result.

Example 3. From the result of example 1:

Assumptions About the Error Term ε

where In is the identity matrix of size n.

Gauss – Markov Theorem

Assumptions about the errors ε

Testing for Significance: t Test

F Test for Overall Significance

3 Rejection Rule: Reject H0 if p-value < α or if

Example 4. From the result of example 1, we want to test for overall

• Fα (p, n − p − 1) = F0.05 (2, 17) = 3.592.

Estimation of standard errors for coefficents

Var (β̂) = σ 2 (X T X )−1 .

σβ̂2 = σ 2 cii ≈ s2 cii ,

Note: To obtain the covariance matrix of β̂ from STATA, after fitting

t Test for Significance of Individual Parameters

3 Rejection Rule: Reject H0 if p-value < α or if |t| > tα/2 (n − p − 1).

Example 5. From the result of example 1, we want to test if TestScore

(this t statistics has already been computed by STATA).

Example 6. From the result of example 1, we want to test if one more

• tα/2 (n − p − 1) = t0.025 (17) = 2.11.

3 Rejection Rule: Reject H0 if p-value < α or if |t| > tα/2 (n − p − 1).

Example 7. From the result of example 1, we want to find the 95%

ε = tα/2 (n − p − 1)se(β̂1 ) = t0.025 (17)se(β̂1 )

The 95% confident interval for β1 is

Note: this confident interval has already been computed by STATA.

x10 x20 xp0 .

PRICE = β0 + β1 SQFT + β2 BEDRMS + β3 BATHS + ε,

Conclusion: we should use model 3 (PRICE = β0 + β1 SQFT ).

Wald test is an extension of F test to test for the significance of a

(ESSR − ESSU )/j (RU2 − RR2 )/j

where R is the restricted model (βp−j+1 = βp−j+2 = · · · = βp = 0)

Example 9. After fitting the model

PRICE = β0 + β1 SQFT + β2 BEDRMS + β3 BATHS + ε,

we want to test for the significance of the group BEDRMS, BATHS.

We cannot reject the null hypothesis H0 : β2 = β3 = 0 at 5%

Example 10. After fitting the model

PRICE = β0 + β1 SQFT + β2 BEDRMS + β3 BATHS + ε,

we want to test for the hypothesis

We cannot reject the null hypothesis at 5% significant level.

You might also like