0% found this document useful (0 votes)

57 views

Chapter 5: Regression: 5.1 Meaning and Purpose

This document discusses regression analysis and determining the best-fit regression line for correlated data. It introduces regression lines and using them to predict dependent variable (y) values from independent variable (x) values. The procedure is to determine the least squares regression line, where the average of the squared residuals between observed y-values and predicted y-values is minimized. Examples are provided to demonstrate calculating the regression line coefficients and using the line to predict values.

Uploaded by

Israel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

57 views

Chapter 5: Regression: 5.1 Meaning and Purpose

Uploaded by

Israel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

CHAPTER 5: REGRESSION

5.1 Meaning and purpose

If two quantitative variables (x and y) are correlated, the points in their scattergram
will tend to lie about a straight line; such a line is called a regression line. One is often
interested to determine what the line is, because it can be used to make reasonably reliable
predictions, if the variables are correlated fairly strongly (say, r < -0.5 or r > 0.5).
One could, of course, fit a line to the data by eye, ie. draw on the scattergram the line
which best seems to fit the data, but it is better to determine a model for the relationship in
the form of an equation for the line. The general form for the equation of a straight line may
be written as: y = a + bx, where “a” and “b” are constants.
In order to distinguish observed values of y from the y-values of points on the
regression line, we define the regression line as: ŷ  a  bx .

When the regression line has been determined, it can be used for predicting y-values
corresponding to given x-values, eg. if x = x0, what value of y (ie. ŷ 0) would be expected?
When the line is expressed in the form y = a + bx, x is called the independent
variable, y is called the dependent variable and the line is called the regression of y on x.
The choice of which variable to regard as independent is left to the user of the model,
depending on which variable he/she wants to predict.

5.2 Procedure for determining a regression line

How can the best-fitting line be determined?
For every x-value in the dataset, define y - ŷ  , ie. the difference between the
corresponding y-value in the dataset and the y-value given by the regression line, as the
residual. For any given line, the smaller the residuals are, the better will be the fit of the line
to the data points.

33
Notice that some residuals are positive and some are negative; in fact, their sum is
usually close to zero. In order to measure how well a line fits the data points, we can
calculate the average of the squares of the residuals (ie. square each residual, add the squares
and divide the total by the number of residuals). If we then choose the line for which this
average is smallest, we will have chosen the least squares regression line.
Example 5.2.1: Which of these three lines better fits the data points than either of the
others?

Clearly Line 3 is a better fit than the other two, because the residuals are generally smaller
for Line 3.
It can be shown mathematically that the values of “a” and “b” for the least squares
regression line are:

n  xy -  x  y 
b  and a  y - bx
n  x 2   x 
2

Example 5.2.2: Calculate the least squares regression line for the data in
Example 4.2.1.

Cable No. of Breaking xy x2

strands in strength
cable (tonne)
(x) (y)
A 4 15 60 16
B 3 10 30 9
C 2 8 16 4
D 5 17 85 25
E 5 16 80 25
Total 19 66 271 79

34
5271  1966
b  2.97
579  1919 and a = 66/5 – 2.97(19/5) = 1.91

so the regression line is ŷ  1.9  3.0x

5.3 Plotting a regression line

The regression line is: ŷ  a  bx
When x  0, ŷ  a; when x  x, ŷ  a  bx  y - bx   bx  y
.
Thus the least squares regression line passes through the points (0,a) and x, y .
Although one can readily plot the second of these points, which is in the middle of the scatter
of points, it is often not convenient to plot the first one. Since one need not show the origin
in a scattergram, it is not necessary to show the “y-intercept” (ie. a) either.
Example 5.3.1: Draw the least squares regression line derived in Example 5.2.2.
The regression line is ŷ  1.9  3.0x , so the y-intercept is 1.9.
Also x  19/5  3.8 and y  66/5  13.2.

In practice, one may plot any convenient points in order to draw the line; it is best to
plot at least three points, which should all be in a straight line, as a check.

5.4 Use of a regression line for prediction

Given a particular x-value, one simply has to substitute it into the regression equation
in order to determine the expected y-value. Alternatively one may read the predicted value
off the graph of the regression line.
Example 5.4.1: Using the least squares regression line derived in Example 5.2.2,
predict the breaking strength of a 6-stranded cable.
The regression line is ŷ  1.9  3.0 x, so when x  6, ŷ  1.9  3.0 * 6  19.9.
Thus we predict the breaking strength of a 6-stranded cable will be 20 tonnes.

35
36

01 Week 4 - Introduction To MS Project
100% (1)
01 Week 4 - Introduction To MS Project
13 pages
SPE/PS-CIM/CHOA 97907 PS2005-407 Assessment and Development of Heavy-Oil Viscosity Correlations
No ratings yet
SPE/PS-CIM/CHOA 97907 PS2005-407 Assessment and Development of Heavy-Oil Viscosity Correlations
9 pages
Topic 8 - Regression Analysis
No ratings yet
Topic 8 - Regression Analysis
51 pages
DMJAP-LinearRegression-3
No ratings yet
DMJAP-LinearRegression-3
28 pages
Regression
No ratings yet
Regression
6 pages
Regression and Correlation
No ratings yet
Regression and Correlation
14 pages
Unit 3 notes
No ratings yet
Unit 3 notes
35 pages
Business Stat 10 12 .PDF
No ratings yet
Business Stat 10 12 .PDF
144 pages
Regression-Analysis
No ratings yet
Regression-Analysis
31 pages
Module 3 Regression Notes
No ratings yet
Module 3 Regression Notes
3 pages
Regression
No ratings yet
Regression
1 page
-1
No ratings yet
-1
51 pages
Introduction To Linear Regression
No ratings yet
Introduction To Linear Regression
6 pages
Bio-L8- Correlation and Regression Analysis
No ratings yet
Bio-L8- Correlation and Regression Analysis
15 pages
Stats 3.2.pptx
No ratings yet
Stats 3.2.pptx
28 pages
(Revised) Simple Linear Regression and Correlation
No ratings yet
(Revised) Simple Linear Regression and Correlation
41 pages
1486016038da-mod12-Q1-e-text
No ratings yet
1486016038da-mod12-Q1-e-text
11 pages
Module III (Part II)(Regression and Time Series)
No ratings yet
Module III (Part II)(Regression and Time Series)
118 pages
Investigating Variables
No ratings yet
Investigating Variables
15 pages
Regression Coeffient
No ratings yet
Regression Coeffient
52 pages
Regression
No ratings yet
Regression
60 pages
Regression Analysis
No ratings yet
Regression Analysis
18 pages
Topic:-Regression: Name: - Teotia Nidhi Class: - M.SC Biotechnology
No ratings yet
Topic:-Regression: Name: - Teotia Nidhi Class: - M.SC Biotechnology
10 pages
Regression
No ratings yet
Regression
24 pages
Lecture 8 Linear and Multiple Regression
No ratings yet
Lecture 8 Linear and Multiple Regression
55 pages
Linear Regression and Correlation: Model
No ratings yet
Linear Regression and Correlation: Model
9 pages
Regression & Correlation
No ratings yet
Regression & Correlation
18 pages
Unit 2 - Scatterplots Correlation and Regression Summer 2021
No ratings yet
Unit 2 - Scatterplots Correlation and Regression Summer 2021
43 pages
Regression Analysis
No ratings yet
Regression Analysis
49 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
27 pages
Topic:-Regression: Name: - Teotia Nidhi Class: - M.SC Biotechnology
No ratings yet
Topic:-Regression: Name: - Teotia Nidhi Class: - M.SC Biotechnology
11 pages
Unit 2-Part 3-Linear Regression
No ratings yet
Unit 2-Part 3-Linear Regression
38 pages
ch12_0
No ratings yet
ch12_0
43 pages
Chapter 9
No ratings yet
Chapter 9
23 pages
Scatterplots and Regression
No ratings yet
Scatterplots and Regression
17 pages
Lecturer 10 UET
No ratings yet
Lecturer 10 UET
54 pages
Maths Project 2
No ratings yet
Maths Project 2
6 pages
Unit 07 Regression Correlation (1)
No ratings yet
Unit 07 Regression Correlation (1)
36 pages
Lectures 14 15
No ratings yet
Lectures 14 15
66 pages
Polynomial Curve Fitting
No ratings yet
Polynomial Curve Fitting
44 pages
Regression analysis
No ratings yet
Regression analysis
6 pages
Introducing Regression: Notes Unit 5: Regression Basics
No ratings yet
Introducing Regression: Notes Unit 5: Regression Basics
5 pages
(Mathe) Simple Linear Regression and Correlation
No ratings yet
(Mathe) Simple Linear Regression and Correlation
61 pages
Coding 2
No ratings yet
Coding 2
3 pages
Statistical Analysis: Linear Regression
No ratings yet
Statistical Analysis: Linear Regression
36 pages
DTB (ch5)
No ratings yet
DTB (ch5)
14 pages
Unit III
No ratings yet
Unit III
18 pages
Regression PDF
No ratings yet
Regression PDF
16 pages
OpenStax Chapter 12 Power Point
No ratings yet
OpenStax Chapter 12 Power Point
81 pages
9 Regression Analysis
No ratings yet
9 Regression Analysis
38 pages
5 - Part II - Regression Analysis w-notes(1)
No ratings yet
5 - Part II - Regression Analysis w-notes(1)
10 pages
Linear Regression Models
No ratings yet
Linear Regression Models
41 pages
University of Palestine Gaza Strip Civil Engineering College Numerical Analysis CIVL 3309 Dr. Suhail Lubbad
No ratings yet
University of Palestine Gaza Strip Civil Engineering College Numerical Analysis CIVL 3309 Dr. Suhail Lubbad
19 pages
03 Revisions L Regression
No ratings yet
03 Revisions L Regression
25 pages
Linear Regression
No ratings yet
Linear Regression
15 pages
6. Simple and Multiple Regression
No ratings yet
6. Simple and Multiple Regression
56 pages
Regression: by Vijeta Gupta Amity University
No ratings yet
Regression: by Vijeta Gupta Amity University
15 pages
SimpleMultipleLinearRegression_FoundationalMathofAI_S24
No ratings yet
SimpleMultipleLinearRegression_FoundationalMathofAI_S24
6 pages
Unit 3 Notes
100% (2)
Unit 3 Notes
32 pages
Standard-Slope Integration: A New Approach to Numerical Integration
From Everand
Standard-Slope Integration: A New Approach to Numerical Integration
Peter James Italia, MD
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Complex Integration and Cauchy's Theorem
From Everand
Complex Integration and Cauchy's Theorem
G. N. Watson
No ratings yet
Fuel Log.
No ratings yet
Fuel Log.
1 page
SOWA - NIPA - BOSAVI - 23 - Aug
No ratings yet
SOWA - NIPA - BOSAVI - 23 - Aug
2 pages
Ppe 6 (2023)
No ratings yet
Ppe 6 (2023)
2 pages
Improvement List
No ratings yet
Improvement List
8 pages
Quarry Payment Request
No ratings yet
Quarry Payment Request
5 pages
Ppe 4 (2023)
No ratings yet
Ppe 4 (2023)
3 pages
Petty Cash Acquitails
No ratings yet
Petty Cash Acquitails
5 pages
Ppe 3 (2023)
No ratings yet
Ppe 3 (2023)
3 pages
Introduction To Spreadsheets: Ce 202 - Civil Engineering Systems Ii
No ratings yet
Introduction To Spreadsheets: Ce 202 - Civil Engineering Systems Ii
31 pages
02 Project Planning - Unit 1
100% (1)
02 Project Planning - Unit 1
27 pages
Introduction To Microsoft Excel-Outline & Objectives
No ratings yet
Introduction To Microsoft Excel-Outline & Objectives
60 pages
CE202 - Major Project 2019
No ratings yet
CE202 - Major Project 2019
3 pages
Book
No ratings yet
Book
87 pages
MS Excel For Begineers
No ratings yet
MS Excel For Begineers
13 pages
CE352-Assignment 1 - Due Date 25-09-20
No ratings yet
CE352-Assignment 1 - Due Date 25-09-20
15 pages
CE 302 - Conc - Lecture 4 - Wk4
No ratings yet
CE 302 - Conc - Lecture 4 - Wk4
18 pages
CE 202 Engineering Project Group List
No ratings yet
CE 202 Engineering Project Group List
4 pages
CE 302 - Conc - Lecture 3 - Wk3
No ratings yet
CE 302 - Conc - Lecture 3 - Wk3
8 pages
CE 352 Lecture 8-9-Week 10-12 Project Management Crash Course
No ratings yet
CE 352 Lecture 8-9-Week 10-12 Project Management Crash Course
404 pages
Analysis For Turbine Maintenance Cost
No ratings yet
Analysis For Turbine Maintenance Cost
3 pages
Demand Forecasting
No ratings yet
Demand Forecasting
28 pages
Quick Help
No ratings yet
Quick Help
12 pages
Testing For Cross-Sectional Dependence in Panel-Data Models: 6, Number 4, Pp. 482-496
No ratings yet
Testing For Cross-Sectional Dependence in Panel-Data Models: 6, Number 4, Pp. 482-496
15 pages
The Information Content of Dividends: Do Dividends Provide Information About Future Earnings?
No ratings yet
The Information Content of Dividends: Do Dividends Provide Information About Future Earnings?
39 pages
An Empirical Study On General Insurance Agents' Performance in Sri Lankan Insurance Industry
No ratings yet
An Empirical Study On General Insurance Agents' Performance in Sri Lankan Insurance Industry
6 pages
Effect of Role Conflict
No ratings yet
Effect of Role Conflict
10 pages
Module 3 Tools, Techniques, and Approaches in FSMS
No ratings yet
Module 3 Tools, Techniques, and Approaches in FSMS
17 pages
Data Mining Technique Using Weka Tool
No ratings yet
Data Mining Technique Using Weka Tool
21 pages
Data Science Imp Questions and Answers
No ratings yet
Data Science Imp Questions and Answers
13 pages
Performance Slurry Pump
No ratings yet
Performance Slurry Pump
154 pages
New Guidelines For Speed/Power Trials
100% (1)
New Guidelines For Speed/Power Trials
11 pages
National Competency-Based Teacher Standards and Teaching Effectiveness
No ratings yet
National Competency-Based Teacher Standards and Teaching Effectiveness
7 pages
The Seven Classical OLS Assumptions: Ordinary Least Squares
No ratings yet
The Seven Classical OLS Assumptions: Ordinary Least Squares
7 pages
Car Price Prediction
No ratings yet
Car Price Prediction
18 pages
Construction and Building Materials: Ivan Navarrete, Mauricio Lopez
No ratings yet
Construction and Building Materials: Ivan Navarrete, Mauricio Lopez
8 pages
HW Multiple Regression Analysis
No ratings yet
HW Multiple Regression Analysis
5 pages
Bridging The Gap From Classroom Based Le
No ratings yet
Bridging The Gap From Classroom Based Le
19 pages
Wooldridge 6e Ch09 SSM
No ratings yet
Wooldridge 6e Ch09 SSM
8 pages
Two Mark Question and Answers: SRM Institute of Science and Technology
No ratings yet
Two Mark Question and Answers: SRM Institute of Science and Technology
9 pages
Ijts 31 (1) - 2012
No ratings yet
Ijts 31 (1) - 2012
160 pages
Balanced Leadership PDF
No ratings yet
Balanced Leadership PDF
20 pages
Global Islamic Finance Report
No ratings yet
Global Islamic Finance Report
14 pages
Statistics For Business and Economics: Bab 14
No ratings yet
Statistics For Business and Economics: Bab 14
31 pages
Nursing Professional Practice Environment and Its Relationship With Nursing Outcomes in Intensive Care Units: A Test of The Structural Equation Model
No ratings yet
Nursing Professional Practice Environment and Its Relationship With Nursing Outcomes in Intensive Care Units: A Test of The Structural Equation Model
8 pages
Paper JIC
No ratings yet
Paper JIC
28 pages
Convection Correlations For PCB
No ratings yet
Convection Correlations For PCB
12 pages
Five-Factor Model of Personality and Job Satisfaction
No ratings yet
Five-Factor Model of Personality and Job Satisfaction
12 pages
(EXTERNAL) 2013 Pattern PDF
No ratings yet
(EXTERNAL) 2013 Pattern PDF
100 pages

Uploaded by

Uploaded by

CHAPTER 5: REGRESSION

5.1 Meaning and purpose

5.2 Procedure for determining a regression line

Cable No. of Breaking xy x2

so the regression line is ŷ  1.9  3.0x

5.3 Plotting a regression line

5.4 Use of a regression line for prediction

You might also like