0% found this document useful (0 votes)

5 views

Performance Parameters

The document outlines performance metrics essential for evaluating machine learning models, focusing on classification and regression metrics. It emphasizes the importance of precision, recall, and F1-score, particularly in cases of skewed classes, and discusses the use of ROC curves for model comparison. Additionally, it highlights the benefits of ranking instances based on predicted probabilities rather than solely classifying them.

Uploaded by

mranonymousgotyou

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views

Performance Parameters

Uploaded by

mranonymousgotyou

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 14

Lesson 7.

DATA SCIENCE AND

AUTOMATION COURSE

MASTER DEGREE SMART

TECHNOLOGY ENGINEERING

Performance metrics
TEACHER
Mirko Mazzoleni
PLACE
University of Bergamo
Outline
1. Metrics

2. Precision and recall

3. Receiver Operating Characteristic (ROC) curves

2 /14
Outline
1. Metrics

2. Precision and recall

3. Receiver Operating Characteristic (ROC) curves

3 /14
Metrics
It is extremely important to use quantitative metrics for evaluating a machine learning
model

• Until now, we relied on the cost function value for regression and classification

• Other metrics can be used to better evaluate and understand the model

• For classification
 Accuracy/Precision/Recall/F1-score, ROC curves,…
• For regression
 Normalized RMSE, Normalized Mean Absolute Error (NMAE),…

4 /14
Classification case: metrics for skewed classes
Disease dichotomic classification example

Train logistic regression model ℎ 𝒙 , with 𝑦 = 1 if disease, 𝑦 = 0 otherwise.

Find that you got 1% error on test set (99% correct diagnoses)

The 𝑦 = 1 class has very few examples with

Only 0.50% of patients actually have disease
respect to the 𝑦 = 0 class

If I use a predictor that predicts always the 𝟎 class, I get 99.5% of accuracy!!

For skewed classes, the accuracy metric can be deceptive

5 /14
Outline
1. Metrics

2. Precision and recall

3. Receiver Operating Characteristic (ROC) curves

6 /14
Precision and recall
Suppose that 𝑦 = 1 in presence of a rare class that we want to detect

Precision (How much we are precise in the detection) Confusion matrix

Of all patients where we predicted 𝑦 = 1,
what fraction actually has the disease? Actual class

Predicted class
1 (p) 0 (n)
True Positive True Positive
=
# Predicted Positive True Positive + False Positive
True positive False positive
1 (Y)
(TP) (FP)
Recall (How much we are good at detecting)
Of all patients that actually have the disease, what False negative True negative
fraction did we correctly detect as having the disease? 0 (N)
(FN) (TN)
True Positive True Positive
=
# Actual Positive True Positive + False Negative

7 /14
Trading off precision and recall
Logistic regression: 0 ≤ ℎ 𝒙 ≤ 1
At different thresholds, correspond
• Predict 1 if ℎ 𝒙 ≥ 0.5 different confusion matrices!
These thresholds can
be different from 0.5!
• Predict 0 if ℎ 𝒙 < 0.5

Suppose we want to predict 𝑦 = 1 (disease) only if very confident

• Increase threshold → Higher precision, lower recall

Suppose we want to avoid missing too many cases of disease (avoid false negatives).
• Decrease threshold → Higher recall, lower precision

8 /14
F1-score
It is usually better to compare models by means of one number only. The F1 − score can
be used to combine precision and recall

Precision(P) Recall (R) Average F1 Score

Algorithm 1 0.5 0.4 0.45 0.444 The best is Algorithm 1
Algorithm 2 0.7 0.1 0.4 0.175
Algorithm 3 0.02 1.0 0.51 0.0392
Algorithm 3 predict always 𝟏 Average says not correctly
that Algorithm 3 is the best

P+R PR • P = 0 or R = 0 ⇒ F1 score = 0
Average = F1 score = 2
2 P+R
• P = 1 and R = 1 ⇒ F1 score = 1

9 /14
Summaries of the confusion matrix
Different metrics can be computed from the confusion matrix, depending on the class of
interest (https://en.wikipedia.org/wiki/Precision_and_recall)

10 /14
Outline
1. Metrics

2. Precision and recall

3. Receiver Operating Characteristic (ROC) curves

11 /14
Ranking instead of classifying
Classifiers such as logistic regression can output a probability of belonging to a class (or
something similar).

• We can use this to rank the different istances and take actions on the cases at top of
the list

• We may have a budget, so we have to target most promising individuals

• Ranking enables to use different techniques for visualizing model performance

12 /14
Ranking instead of classifying
p n

Y 0 0 p n
Instance
True class Score N 100 100 Y
1 0
description
99 100
…………… 1 0,99 N

…………… 1 0,98
…………… 0 0,96 p n
2 0
…………… 0 0,90 Y

…………… 1 0,88 N 98 100

p n
…………… 1 0,87 2 1
Y
…………… 0 0,85 98 99
N
…………… 1 0,80 p n
…………… 0 0,70 Y
6 4
Different confusion
N 94 96
matrices by changing
Adapated from [1] the threshold

13 /14
ROC curves
ROC curves are a very general way to represent and compare the performance of
different models (on a binary classification task)

Perfection Observations
• 0,0 : predict always negative
Random • 1,1 : predict always positive
True positive rate

guessing
• Diagonal line: random classifier
• Below diagonal line: worse than random classifier
• Different classifiers can be compared
• Area Under the Curve (AUC): probability that a randomly
chosen positive instance will be ranked ahead of randomly
chosen negative instance
False positive rate

14 /14

ML-Lecture-11-Evaluation
No ratings yet
ML-Lecture-11-Evaluation
17 pages
ML-Lecture-12 (Evaluation Metrics For Classification)
No ratings yet
ML-Lecture-12 (Evaluation Metrics For Classification)
15 pages
19-Performance Metrics
No ratings yet
19-Performance Metrics
23 pages
l09_machine_learning
No ratings yet
l09_machine_learning
39 pages
Ca 3 Merged
No ratings yet
Ca 3 Merged
275 pages
Machine Learning Evaluation Metrics Lecturer
No ratings yet
Machine Learning Evaluation Metrics Lecturer
30 pages
DL_IT324a_4
No ratings yet
DL_IT324a_4
52 pages
3 - Model Evaluation & Validation
No ratings yet
3 - Model Evaluation & Validation
47 pages
Evaluation Measures
No ratings yet
Evaluation Measures
8 pages
lecture11evaluationmetricsforclassification-240913060639-0c766554
No ratings yet
lecture11evaluationmetricsforclassification-240913060639-0c766554
28 pages
Chap3 Part1 Classification
No ratings yet
Chap3 Part1 Classification
38 pages
A10-Model-Performance-v2-2up
No ratings yet
A10-Model-Performance-v2-2up
11 pages
L 13 Choose Your Own Algorithm D 07062024 111828am
No ratings yet
L 13 Choose Your Own Algorithm D 07062024 111828am
36 pages
Model Evaluation - II
No ratings yet
Model Evaluation - II
12 pages
Performance Metrics
No ratings yet
Performance Metrics
12 pages
Evaluation Metrics
No ratings yet
Evaluation Metrics
11 pages
Classification Metrics.pptx
No ratings yet
Classification Metrics.pptx
39 pages
DSML Clasification
No ratings yet
DSML Clasification
44 pages
ML Metrics
No ratings yet
ML Metrics
9 pages
Performance Metrics (Classification) : Enrique J. de La Hoz D
100% (1)
Performance Metrics (Classification) : Enrique J. de La Hoz D
30 pages
Evaluation Metrics:: Confusion Matrix
No ratings yet
Evaluation Metrics:: Confusion Matrix
7 pages
Machine_Learning_II
No ratings yet
Machine_Learning_II
61 pages
Machine Learningassignment
No ratings yet
Machine Learningassignment
10 pages
جلسه 13
No ratings yet
جلسه 13
76 pages
Unit 4 Model Evaluation
No ratings yet
Unit 4 Model Evaluation
24 pages
Intermediate Analytics-Regression-Week 3-1
No ratings yet
Intermediate Analytics-Regression-Week 3-1
44 pages
Lect_02_Evaluation_Part_1
No ratings yet
Lect_02_Evaluation_Part_1
33 pages
Session 1 Evaluation Model
No ratings yet
Session 1 Evaluation Model
58 pages
Analytics in Practice: Model Evaluation
No ratings yet
Analytics in Practice: Model Evaluation
40 pages
11.2 - Classification Evaluation Metrics
No ratings yet
11.2 - Classification Evaluation Metrics
22 pages
Module 2
No ratings yet
Module 2
72 pages
Unit III Iml Final
No ratings yet
Unit III Iml Final
36 pages
W6 CSE 4781 Classification Metrics
No ratings yet
W6 CSE 4781 Classification Metrics
28 pages
Performance Parameters
No ratings yet
Performance Parameters
23 pages
AD3501-DL-UNIT 4 NOTES
No ratings yet
AD3501-DL-UNIT 4 NOTES
16 pages
IT 138 - Lecture 4
No ratings yet
IT 138 - Lecture 4
30 pages
Session-11 Machine Learning - Jupyter Notebook
No ratings yet
Session-11 Machine Learning - Jupyter Notebook
11 pages
IAI&ML UNIT-5
No ratings yet
IAI&ML UNIT-5
15 pages
ML CH 5
No ratings yet
ML CH 5
45 pages
Exp7_MLAI2
No ratings yet
Exp7_MLAI2
8 pages
Classification Metrics Mod 6
No ratings yet
Classification Metrics Mod 6
8 pages
6.evaluation Metrics - UNIT 2
No ratings yet
6.evaluation Metrics - UNIT 2
4 pages
3-Performance Measures
No ratings yet
3-Performance Measures
35 pages
Confusion Matrix
No ratings yet
Confusion Matrix
8 pages
Hands On Machine Learning 3 Edition
No ratings yet
Hands On Machine Learning 3 Edition
31 pages
Module 5 ML
No ratings yet
Module 5 ML
12 pages
Unit 2 Chap 4
No ratings yet
Unit 2 Chap 4
14 pages
Unit6 -7 Issues_23bc7150-918a-4ebe-9af6-01db96af986a
No ratings yet
Unit6 -7 Issues_23bc7150-918a-4ebe-9af6-01db96af986a
53 pages
Ai DS 2 Book-Chpt-5
No ratings yet
Ai DS 2 Book-Chpt-5
17 pages
5.3
No ratings yet
5.3
31 pages
ML-2-PPT-UNIT-2
No ratings yet
ML-2-PPT-UNIT-2
214 pages
ML3 Evaluating Models
No ratings yet
ML3 Evaluating Models
40 pages
Basics of ML and Evaluation
No ratings yet
Basics of ML and Evaluation
42 pages
Confusion Matrix
No ratings yet
Confusion Matrix
5 pages
lec5_Classification
No ratings yet
lec5_Classification
27 pages
Lecture 10
No ratings yet
Lecture 10
16 pages
Chapter 7 - LAST
No ratings yet
Chapter 7 - LAST
29 pages
IS4242 W6 Model Evaluation and Selection
No ratings yet
IS4242 W6 Model Evaluation and Selection
86 pages
Lecture 3b - Evaluation
No ratings yet
Lecture 3b - Evaluation
37 pages
GCSE Maths Revision: Cheeky Revision Shortcuts
From Everand
GCSE Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (2)
Linear Regression Machine Learning Model
No ratings yet
Linear Regression Machine Learning Model
10 pages
Introduction To Xenobots
No ratings yet
Introduction To Xenobots
7 pages
Jamovi Application
No ratings yet
Jamovi Application
1 page
Laboratory Report On Practical 1 - Isozyme Analysis (A184381)
No ratings yet
Laboratory Report On Practical 1 - Isozyme Analysis (A184381)
11 pages
Program -7
No ratings yet
Program -7
4 pages
wk02 - GLS - Hand Written Notes 100822
No ratings yet
wk02 - GLS - Hand Written Notes 100822
32 pages
E32 - Analytical Performance - v.01
No ratings yet
E32 - Analytical Performance - v.01
2 pages
Chapter 2
No ratings yet
Chapter 2
41 pages
Cronbach's Alpha Calculator
No ratings yet
Cronbach's Alpha Calculator
5 pages
Sample Size and Estimation New
No ratings yet
Sample Size and Estimation New
4 pages
Practical 9
No ratings yet
Practical 9
6 pages
27.02.2024 For Students
No ratings yet
27.02.2024 For Students
7 pages
Autocorrelation
No ratings yet
Autocorrelation
52 pages
10 Advice for Applying Machine Learning
No ratings yet
10 Advice for Applying Machine Learning
25 pages
Course Breakup Econometrics
No ratings yet
Course Breakup Econometrics
3 pages
Lecture 3
No ratings yet
Lecture 3
35 pages
Untitled
No ratings yet
Untitled
8 pages
Metrics 2019 Lec3
No ratings yet
Metrics 2019 Lec3
59 pages
Biostatistics and Epidemiology Review.notes
No ratings yet
Biostatistics and Epidemiology Review.notes
23 pages
Tutorial Sheet 1
No ratings yet
Tutorial Sheet 1
3 pages
Panel Data V
No ratings yet
Panel Data V
28 pages
Econometrics Question M.Phil II 2020
No ratings yet
Econometrics Question M.Phil II 2020
4 pages
Chapter6 Sampling Regression Method Estimation PDF
No ratings yet
Chapter6 Sampling Regression Method Estimation PDF
12 pages
Uji Hipotesis Sample T-Test
No ratings yet
Uji Hipotesis Sample T-Test
2 pages
A Simulation Study On Some Restricted Ridge Regression Estimators
No ratings yet
A Simulation Study On Some Restricted Ridge Regression Estimators
22 pages
(eBook PDF) Introduction to Econometrics, 4th Global Edition instant download
100% (6)
(eBook PDF) Introduction to Econometrics, 4th Global Edition instant download
57 pages
Phil Iri GST Report Filipino 2021 2022
No ratings yet
Phil Iri GST Report Filipino 2021 2022
21 pages
Kementerian Pendidikan, Kebudayaan, Riset, Dan Teknologi Universitas Negeri Semarang (Unnes)
No ratings yet
Kementerian Pendidikan, Kebudayaan, Riset, Dan Teknologi Universitas Negeri Semarang (Unnes)
5 pages
COSO ERM Framework
No ratings yet
COSO ERM Framework
60 pages
Stats Project Hafid El Hassani Alaoui
No ratings yet
Stats Project Hafid El Hassani Alaoui
12 pages