Uploaded by

ayshaf748

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

23 views5 pages

Bdba Notes

Uploaded by

ayshaf748

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

BDBA NOTES

Homoscedasticity and heteroscedasticity are terms used in regression analysis to describe the
variance of errors in a regression model.

Homoscedasticity:

In a regression model, homoscedasticity means that the variance of the errors (or residuals) is
constant across all levels of the predictor variables. In simpler terms, it implies that the spread
of the residuals is consistent as you move along the regression line.

Visually, when you plot the residuals against the predicted values, a pattern of equally spread
points around the horizontal line (zero) suggests homoscedasticity. It indicates that the
variability of the residuals doesn’t change significantly across the range of predicted values.

For example, the time taken for an ice cube to melt depends on the temperature. Here, the
temperature is the independent variable, and the time is the dependent one.

Heteroscedasticity:

Heteroscedasticity, on the other hand, occurs when the variance of the residuals is not constant
across different levels of the predictor variables. This means that the spread of the residuals
varies along the range of predicted values.

Visually, in a plot of residuals against predicted values, heteroscedasticity appears as a funnel-

like shape, where the spread of residuals widens or narrows as the predicted values increase or
decrease, respectively. It implies that the model might be better at predicting some ranges of
values than others, leading to unequal variability in the residuals.
Heteroscedasticity violates one of the assumptions of classical linear regression, which
assumes constant variance of errors across all levels of the predictor variables.

What is R Software?
R is a programming language and free software developed by Ross Ihaka
and Robert Gentleman in 1993. R possesses an extensive catalog of
statistical and graphical methods. It includes machine learning algorithms,
linear regression, time series, statistical inference to name a few. Most of
the R libraries are written in R, but for heavy computational tasks, C, C++
and Fortran codes are preferred.

R is not only entrusted by academic, but many large companies also use R
programming language, including Uber, Google, Airbnb, Facebook and so
on.

Data analysis with R is done in a series of steps; programming,

transforming, discovering, modeling and communicate the results

 Program: R is a clear and accessible programming tool

 Transform: R is made up of a collection of libraries designed
specifically for data science
 Discover: Investigate the data, refine your hypothesis and analyze
them
 Model: R provides a wide array of tools to capture the right model for
your data
 Communicate: Integrate codes, graphs, and outputs to a report with
R Markdown or build Shiny apps to share with the world
Q) How can we handle the impact of multi collinearity? Write any two methods

multicollinearity refers to high correlation between independent variables in a regression

model, which can cause issues in interpreting the model coefficients and can reduce the
reliability of the model's predictions. Here are two common methods to handle
multicollinearity:

Feature Selection or Dimensionality Reduction:

Remove highly correlated variables: Identify and remove one of the variables in a pair or set
of variables that are highly correlated. This helps in reducing multicollinearity by eliminating
redundant information.

Principal Component Analysis (PCA): PCA is a technique used to transform the original
variables into a smaller set of uncorrelated variables called principal components. It helps in
reducing the dimensionality of the data while preserving most of the variability present in the
dataset.

Regularization Techniques:
Ridge Regression: Ridge regression adds a penalty term to the regression equation that shrinks
the coefficients of correlated variables towards zero. This helps in reducing the impact of
multicollinearity by stabilizing the coefficients.

Lasso Regression: Similar to Ridge, Lasso adds a penalty term, but Lasso has the additional
property of performing variable selection by forcing some of the coefficients to be exactly zero.
It can automatically eliminate some variables, effectively reducing multicollinearity.

Assignment QMT533 RUJUKN
No ratings yet
Assignment QMT533 RUJUKN
33 pages
Qual 2 KW 51 B 52 A 07
0% (1)
Qual 2 KW 51 B 52 A 07
253 pages
Econometrics Board Questions
No ratings yet
Econometrics Board Questions
11 pages
Econometrics Board Questions
No ratings yet
Econometrics Board Questions
13 pages
DS Module 05
No ratings yet
DS Module 05
5 pages
regession assumptions -for students
No ratings yet
regession assumptions -for students
2 pages
HS Breakdown
No ratings yet
HS Breakdown
8 pages
Data Science Interview Preparation (30 Days of Interview Preparation)
No ratings yet
Data Science Interview Preparation (30 Days of Interview Preparation)
18 pages
Data Science Interview Preparation (30 Days of Interview Preparation)
No ratings yet
Data Science Interview Preparation (30 Days of Interview Preparation)
45 pages
Assignment For Viva
No ratings yet
Assignment For Viva
54 pages
Predictive Analytics Notes
No ratings yet
Predictive Analytics Notes
42 pages
Copy of Unit 5 Business Analytics
No ratings yet
Copy of Unit 5 Business Analytics
24 pages
What Is Linear Regression
No ratings yet
What Is Linear Regression
14 pages
Econometrics for finance (2017-I)
No ratings yet
Econometrics for finance (2017-I)
6 pages
LR Assumptions_05
No ratings yet
LR Assumptions_05
12 pages
day 3
No ratings yet
day 3
18 pages
Heteroscedasticity
No ratings yet
Heteroscedasticity
4 pages
Econometrics For Finance Chapter 4
No ratings yet
Econometrics For Finance Chapter 4
44 pages
multi economoetrics
No ratings yet
multi economoetrics
8 pages
Linear Regression Basic Interview Questions
No ratings yet
Linear Regression Basic Interview Questions
36 pages
Third, Regression Analysis Predicts Trends and Future Values
No ratings yet
Third, Regression Analysis Predicts Trends and Future Values
2 pages
Cross Sectional
No ratings yet
Cross Sectional
40 pages
LR Assumptions
No ratings yet
LR Assumptions
9 pages
ARM 2nd Mid
No ratings yet
ARM 2nd Mid
13 pages
Machine Learning and Linear Regression
100% (1)
Machine Learning and Linear Regression
55 pages
Linear Regression
No ratings yet
Linear Regression
38 pages
Unit-III (Data Analytics)
50% (2)
Unit-III (Data Analytics)
15 pages
2.3 Assumptions of Linear Regression
No ratings yet
2.3 Assumptions of Linear Regression
16 pages
Econometrics
No ratings yet
Econometrics
9 pages
Econometrics II Chapter Two
No ratings yet
Econometrics II Chapter Two
40 pages
Linear Regression
No ratings yet
Linear Regression
10 pages
Errors and Residuals: Correlation Vs Regression
No ratings yet
Errors and Residuals: Correlation Vs Regression
2 pages
Violations of Assumptions
No ratings yet
Violations of Assumptions
1 page
Basic Regression Analysis 2
No ratings yet
Basic Regression Analysis 2
6 pages
Heteros Ce Dasti City
No ratings yet
Heteros Ce Dasti City
43 pages
Regression Analysis (AI)
No ratings yet
Regression Analysis (AI)
9 pages
Regression and Introduction To Bayesian Network
No ratings yet
Regression and Introduction To Bayesian Network
12 pages
Economic
No ratings yet
Economic
11 pages
Chapter 6 (Part Ii)
No ratings yet
Chapter 6 (Part Ii)
41 pages
Heteroscedasticity Workshop
No ratings yet
Heteroscedasticity Workshop
72 pages
MODULE-3
No ratings yet
MODULE-3
34 pages
AI - Mod 5. Part 3
No ratings yet
AI - Mod 5. Part 3
26 pages
Unit-III
No ratings yet
Unit-III
13 pages
Yaregal Birhanu
No ratings yet
Yaregal Birhanu
8 pages
DA_UNIT_3_R22
No ratings yet
DA_UNIT_3_R22
15 pages
DS Unit-Iv
No ratings yet
DS Unit-Iv
34 pages
2023 Level II Key Facts and Formula Sheet (KFFS)
No ratings yet
2023 Level II Key Facts and Formula Sheet (KFFS)
14 pages
Linear Regression
No ratings yet
Linear Regression
24 pages
Intro to Econometrics Latter Half Chanon-1016098-17101310898743
No ratings yet
Intro to Econometrics Latter Half Chanon-1016098-17101310898743
15 pages
DA-3rd unit
No ratings yet
DA-3rd unit
16 pages
Topic - 9 PDF
No ratings yet
Topic - 9 PDF
12 pages
Problem in Regression Analysis
No ratings yet
Problem in Regression Analysis
7 pages
15 Types of Regression You Should Know
No ratings yet
15 Types of Regression You Should Know
30 pages
ML unit-2 ppt
No ratings yet
ML unit-2 ppt
34 pages
2023 CFA L2 Book 1 Quants Eco Multiple
No ratings yet
2023 CFA L2 Book 1 Quants Eco Multiple
63 pages
Unit - II_DA
No ratings yet
Unit - II_DA
22 pages
Assumption Checking On Linear Regression
No ratings yet
Assumption Checking On Linear Regression
65 pages
Multicollinearity AND Heteroskedasticity
No ratings yet
Multicollinearity AND Heteroskedasticity
75 pages
unit5_R
No ratings yet
unit5_R
5 pages
Computer Programming and Problem Solving Explorations
From Everand
Computer Programming and Problem Solving Explorations
Pasquale De Marco
No ratings yet
Gale Researcher Guide for: Econometric Models
From Everand
Gale Researcher Guide for: Econometric Models
Chupp
No ratings yet
STATA 2 Class
No ratings yet
STATA 2 Class
3 pages
02 - Blackburn Et Al. - 2019 - Semtex A Spectral Element-Fourier Solver For The
No ratings yet
02 - Blackburn Et Al. - 2019 - Semtex A Spectral Element-Fourier Solver For The
13 pages
Application Guide Over-Current Protection Tutorial
100% (2)
Application Guide Over-Current Protection Tutorial
37 pages
The Simple Regression Model
No ratings yet
The Simple Regression Model
10 pages
Ch. 8 Notes
No ratings yet
Ch. 8 Notes
5 pages
Quadratic Interpolation
No ratings yet
Quadratic Interpolation
22 pages
Maxxflow HTC Manual
No ratings yet
Maxxflow HTC Manual
24 pages
Introduction - To - Econometrics - Solved Problems
No ratings yet
Introduction - To - Econometrics - Solved Problems
8 pages
Islp 5
No ratings yet
Islp 5
5 pages
Tecnicas Interpolacion Var Clima
No ratings yet
Tecnicas Interpolacion Var Clima
34 pages
DALAL & ZICKAR (2012)-ORM-CENTERING-in-moderated-multiple-regression-and-polynomial-1
No ratings yet
DALAL & ZICKAR (2012)-ORM-CENTERING-in-moderated-multiple-regression-and-polynomial-1
24 pages
Modeling and Optimization of Dynamical Systems by Unconventional Spreadsheet Functions
No ratings yet
Modeling and Optimization of Dynamical Systems by Unconventional Spreadsheet Functions
12 pages
Satish Jaiswal .. Project
No ratings yet
Satish Jaiswal .. Project
37 pages
ME685A: Applied Numerical Methods: ST ND
No ratings yet
ME685A: Applied Numerical Methods: ST ND
3 pages
Predicting The 2024 Olympics Using Numerical Analysis and Lagrange
No ratings yet
Predicting The 2024 Olympics Using Numerical Analysis and Lagrange
19 pages
MN2196_Commentary_October_2023
No ratings yet
MN2196_Commentary_October_2023
12 pages
Newbold-presentación Regresión Cap 11
No ratings yet
Newbold-presentación Regresión Cap 11
43 pages
Maths
No ratings yet
Maths
33 pages
SiFAC Curs 2017 PDF
No ratings yet
SiFAC Curs 2017 PDF
357 pages
Assignment 14
No ratings yet
Assignment 14
3 pages
Eviews VAR Stata
No ratings yet
Eviews VAR Stata
17 pages
T1 Eng Math 4 2 20072008-Without Answer Scheme
No ratings yet
T1 Eng Math 4 2 20072008-Without Answer Scheme
2 pages
Ridge and Lasso Regression in Python
No ratings yet
Ridge and Lasso Regression in Python
18 pages
Ha01 - PP Test
No ratings yet
Ha01 - PP Test
17 pages
Yy 1 Xy
No ratings yet
Yy 1 Xy
4 pages
afroz_21_assignment-2
No ratings yet
afroz_21_assignment-2
19 pages
A Zaenal Mufaqih - Tugas6
No ratings yet
A Zaenal Mufaqih - Tugas6
6 pages
eYt2Kr3P PDF
100% (1)
eYt2Kr3P PDF
458 pages
Linear Equations: - Linear Equations - Example: Polynomial Interpolation - Applications - Geometrical Interpretation
No ratings yet
Linear Equations: - Linear Equations - Example: Polynomial Interpolation - Applications - Geometrical Interpretation
16 pages

Uploaded by

Uploaded by

BDBA NOTES

Visually, in a plot of residuals against predicted values, heteroscedasticity appears as a funnel-

Data analysis with R is done in a series of steps; programming,

 Program: R is a clear and accessible programming tool

multicollinearity refers to high correlation between independent variables in a regression

Feature Selection or Dimensionality Reduction:

You might also like