0% found this document useful (0 votes)

20 views

Logistic Regression

Logistic regression is used when the dependent variable is binary. It violates assumptions of linear regression by allowing the error term to only take on two values. Weight of evidence and information value are used to measure the strength of predictor variables, with higher values indicating a stronger relationship. Dummy variables are used to code categorical predictors.

Uploaded by

tedom14127

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views

Logistic Regression

Uploaded by

tedom14127

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

Logistic Regression

Why do we ever need Logistic Regression?

Violates the assumption of Linear Regression!

Assumption says that the residulas should be normally distributed.

The error term can only take on two values, hence it's impossible for it to
have a normal distribution.

Violates the assumption of Homoscedasticity!

Homoscedasticity describes a situation in which the error term is the

same across all values of the independent variables.
Logistic Regression
Odds
Weight of Evidence (WoE) and Information Value (IV)

Weight of Evidence
The Weight of Evidence or WoE value is a widely used measure of the “strength” of a
grouping for separating good and bad risk (default). It is computed from the basic
odds ratio:

Information Value (IV)

The Information Value (IV) of a predictor is related to the sum of the
(absolute) values for WoE over all groups.
Weight of Evidence (WoE) and Information Value (IV)

According to Siddiqi (2006), by convention the values of the IV statistic can be interpreted as follows. If
the IV statistic is:
•Less than 0.02, then the predictor is not useful for modeling (separating the Goods from the Bads)
•0.02 to 0.1, then the predictor has only a weak relationship to the Goods/Bads odds ratio
•0.1 to 0.3, then the predictor has a medium strength relationship to the Goods/Bads odds ratio
•0.3 or higher, then the predictor has a strong relationship to the Goods/Bads odds ratio.

Indicates a weak relationship to the binary dependent variable.

What are Dummy Variable, Design Variable, Boolean
Indicators and Proxies?

These are all the synonyms for dummy variable

Categorical Variables – Male / Female, High Low Bank Bal etc

They are coded with 1 and 0

Class Class_Dummy1 Class_Dummy2
1 1 0
1 1 0
1 1 0
2 0 1
2 0 1
2 0 1
3 0 0
3 0 0
3 0 0
Results and Interpretation

Independent p value interpretation – p value less than 0.05 (alpha)

should be retained in the model, else remove them from the model!

Analysis of Maximum Likelihood Estimates

Parameter DF Estimate Standard Error Wald Chi-Square Pr > ChiSq

Intercept 1 "-2.6516" 0.6748 15.4424 <.0001

blackd 1 0.5952 0.3939 2.2827 0.1308

whitvic 1 0.2565 0.4002 0.4107 0.5216
serious 1 0.1871 0.0612 9.3342 0.0022
Baseline, R Square and Max-rescaled R square and C

What is R square?
R square of Logistic Regression?

How much the goodness of fit improves!!

C statistics – based on receiver operating characteristic (ROC)

curve
Ranges from 0.5 to 1; closer to 1 better the model
Gini – 2*C statistics -1
Ranges from 0 to 1; closer to 1 better the model
Check Multicollinearity!!

Check the VIF / Tolerance to detect the

multicollinearity!!
Results and Interpretation – Classification Table

Correct Incorrect Percentages

Prob Non- Non- Sensi- Speci- FALSE FALSE
Level Event Event Event Event Correct tivity ficity POS NEG
0.05 30 47 23 0 77 100 67.1 43.4 0
0.1 30 53 17 0 83 100 75.7 36.2 0
0.15 30 55 15 0 85 100 78.6 33.3 0
0.2 30 60 10 0 90 100 85.7 25 0
0.25 29 61 9 1 90 96.7 87.1 23.7 1.6
0.3 25 62 8 5 87 83.3 88.6 24.2 7.5
0.35 23 62 8 7 85 76.7 88.6 25.8 10.1
0.4 23 63 7 7 86 76.7 90 23.3 10
0.45 23 63 7 7 86 76.7 90 23.3 10
0.5 23 63 7 7 86 76.7 90 23.3 10

Higher sensitivity and specificity indicates better fit.

Results and Interpretation – Predicted Probability

Obs CURED INTERVENTION DURATION _LEVEL_ pred

1 0 0 7 1 0.42812

2 0 0 7 1 0.42812

3 0 0 6 1 0.43004

4 1 0 8 1 0.42621

5 1 1 7 1 0.71991

6 1 0 6 1 0.43004
Logistic Regression – KS Stat
KS lies between 0 – 1
Closer to 1 better the model

Day 13 Logistic Regression
No ratings yet
Day 13 Logistic Regression
28 pages
Econometrics II CH 1
No ratings yet
Econometrics II CH 1
48 pages
ppt4
No ratings yet
ppt4
54 pages
Bio2 Module 5 - Logistic Regression
No ratings yet
Bio2 Module 5 - Logistic Regression
19 pages
5.1) Binary logistic regression
No ratings yet
5.1) Binary logistic regression
32 pages
Logistic Regression
No ratings yet
Logistic Regression
20 pages
Psy 512 Logistic Regression
No ratings yet
Psy 512 Logistic Regression
12 pages
Dissertation Using Logistic Regression
100% (2)
Dissertation Using Logistic Regression
6 pages
Logistic Regression Playbook
No ratings yet
Logistic Regression Playbook
19 pages
Logistic Regression and Discriminant Analysis: Jerry D.T. Purnomo, PH.D
No ratings yet
Logistic Regression and Discriminant Analysis: Jerry D.T. Purnomo, PH.D
54 pages
Logistic Regression Monograph - DSBA v2
No ratings yet
Logistic Regression Monograph - DSBA v2
54 pages
Logistic Regression Lecture Notes
No ratings yet
Logistic Regression Lecture Notes
11 pages
SPSS Binary Logistic Regression Demo 1 Terminate
No ratings yet
SPSS Binary Logistic Regression Demo 1 Terminate
22 pages
Unit V
No ratings yet
Unit V
27 pages
02 LogisticRegression
No ratings yet
02 LogisticRegression
29 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
Business Analytics and Operations Research
No ratings yet
Business Analytics and Operations Research
34 pages
Regression Analysis
No ratings yet
Regression Analysis
16 pages
Binary Logistic Regression - 6.2
No ratings yet
Binary Logistic Regression - 6.2
34 pages
Logistic+Regression+Monograph+ +DSBA+v2
No ratings yet
Logistic+Regression+Monograph+ +DSBA+v2
54 pages
Logistic Regression
No ratings yet
Logistic Regression
42 pages
Logistic Regression Lecture Notes
No ratings yet
Logistic Regression Lecture Notes
11 pages
2 Modele lineare
No ratings yet
2 Modele lineare
43 pages
Group 1 Biostat Assignement@
No ratings yet
Group 1 Biostat Assignement@
20 pages
ML Unit 3
No ratings yet
ML Unit 3
40 pages
Logistic Regression
100% (1)
Logistic Regression
37 pages
A Simple But Effective Logistic Regression Derivation
No ratings yet
A Simple But Effective Logistic Regression Derivation
6 pages
Business Analytics: Advance: Logistic Regression
100% (1)
Business Analytics: Advance: Logistic Regression
26 pages
Home Lesson 15: Logistic, Poisson & Nonlinear Regression
No ratings yet
Home Lesson 15: Logistic, Poisson & Nonlinear Regression
32 pages
Logistic Regression
No ratings yet
Logistic Regression
18 pages
Ilovepdf Merged (24)
No ratings yet
Ilovepdf Merged (24)
208 pages
Regression3 Slides
No ratings yet
Regression3 Slides
47 pages
5.2) Multinomial logistic regression
No ratings yet
5.2) Multinomial logistic regression
34 pages
Cheat Sheet Statistics
No ratings yet
Cheat Sheet Statistics
3 pages
Logistic Regression[2]
No ratings yet
Logistic Regression[2]
36 pages
Logistic Regression
No ratings yet
Logistic Regression
25 pages
Logistic Regression
No ratings yet
Logistic Regression
41 pages
Research Paper Logistic Regression
100% (1)
Research Paper Logistic Regression
4 pages
79 LogisticReg - Cleaned
No ratings yet
79 LogisticReg - Cleaned
4 pages
CSS
No ratings yet
CSS
15 pages
Logistic Regression
No ratings yet
Logistic Regression
54 pages
Logistic Regression
0% (1)
Logistic Regression
71 pages
Practical Guide To Logistic Regression - Joseph M. Hilbe (2017)
No ratings yet
Practical Guide To Logistic Regression - Joseph M. Hilbe (2017)
170 pages
(Book) Bayesian Logistik - Hilbe Practical Guide To Logistic Regression (PDFDrive)
No ratings yet
(Book) Bayesian Logistik - Hilbe Practical Guide To Logistic Regression (PDFDrive)
170 pages
Thesis Using Logistic Regression
100% (2)
Thesis Using Logistic Regression
7 pages
W5S01 - PM-Logistic Regression
No ratings yet
W5S01 - PM-Logistic Regression
17 pages
Results
No ratings yet
Results
11 pages
03 Logistic Regression
No ratings yet
03 Logistic Regression
23 pages
Binary Logistic (5)
No ratings yet
Binary Logistic (5)
29 pages
Logistic Regression Analysis
No ratings yet
Logistic Regression Analysis
48 pages
Unit - 5
No ratings yet
Unit - 5
111 pages
Lecture 22. Glm
No ratings yet
Lecture 22. Glm
41 pages
Logistic Regression: Psy 524 Ainsworth
No ratings yet
Logistic Regression: Psy 524 Ainsworth
37 pages
STAT22209 - Chapter 03-Multiple Regression - 2022
No ratings yet
STAT22209 - Chapter 03-Multiple Regression - 2022
41 pages
Log Reg
No ratings yet
Log Reg
32 pages
Logistic Regression
100% (2)
Logistic Regression
47 pages
Lecture 10 Logistic Regression Part 1
No ratings yet
Lecture 10 Logistic Regression Part 1
19 pages
Mr Project Group 7
No ratings yet
Mr Project Group 7
6 pages

Uploaded by

Uploaded by

Logistic Regression

Why do we ever need Logistic Regression?

Violates the assumption of Linear Regression!

Assumption says that the residulas should be normally distributed.

Violates the assumption of Homoscedasticity!

Homoscedasticity describes a situation in which the error term is the

Information Value (IV)

Indicates a weak relationship to the binary dependent variable.

These are all the synonyms for dummy variable

They are coded with 1 and 0

Independent p value interpretation – p value less than 0.05 (alpha)

Analysis of Maximum Likelihood Estimates

Parameter DF Estimate Standard Error Wald Chi-Square Pr > ChiSq

blackd 1 0.5952 0.3939 2.2827 0.1308

How much the goodness of fit improves!!

C statistics – based on receiver operating characteristic (ROC)

Check the VIF / Tolerance to detect the

Correct Incorrect Percentages

Higher sensitivity and specificity indicates better fit.

Obs CURED INTERVENTION DURATION _LEVEL_ pred

You might also like