3 - Machine Learning Cheat Sheet _ DataCamp
3 - Machine Learning Cheat Sheet _ DataCamp
EN
C H E AT S H E E T S category
Richie Cotton
Webinar & podcast host, course and book author, spends all day chit-chatting about data
TO P I C S
Machine Learning
Data Science
When working with machine learning, it's easy to try them all out without understanding
what each model does, and when to use them. In this cheat sheet, you'll find a handy guide
https://www.datacamp.com/cheat-sheet/machine-learning-cheat-sheet?utm_source=chatgpt.com 1/22
5/12/25, 11:48 AM Machine Learning Cheat Sheet | DataCamp
describing the most widely used machine learning models, their advantages, disadvantages,
and some key use-cases.
Get started on your way to becoming a machine learnng expert by starting our Machine
Learning Scientist with Python track today.
https://www.datacamp.com/cheat-sheet/machine-learning-cheat-sheet?utm_source=chatgpt.com 2/22
5/12/25, 11:48 AM Machine Learning Cheat Sheet | DataCamp
Download PDF
Supervised Learning
Supervised learning models are models that map inputs to outputs, and attempt to
extrapolate patterns learned in past data on unseen data. Supervised learning models can
be either regression models, where we try to predict a continuous variable, like stock prices
—or classification models, where we try to predict a binary or multi-class variable, like
whether a customer will churn or not. In the section below, we'll explain two popular types of
supervised learning models: linear models, and tree-based models.
Become an ML Scientist
Upskill in Python to become a machine learning scientist.
Linear Models
In a nutshell, linear models create a best-fit line to predict unseen data. Linear models imply
that outputs are a linear combination of features. In this section, we'll specify commonly
used linear models in machine learning, their advantages, and disadvantages.
https://www.datacamp.com/cheat-sheet/machine-learning-cheat-sheet?utm_source=chatgpt.com 3/22
5/12/25, 11:48 AM Machine Learning Cheat Sheet | DataCamp
1. Explainabl 1. Assumes
e method linearity
1. Stock
between
Price 2. Interpreta
inputs and
Prediction ble results
output
by its
A simple algorithm 2. Predicting
output 2. Sensitive
that models a linear housing
Linear relationship between coefficient to outliers
prices
Regression inputs and a
3. Faster to 3. Can
continuous numerical 3. Predicting
train than underfit
output variable customer
other with small,
lifetime
machine high-
value
learning dimension
models al data
https://www.datacamp.com/cheat-sheet/machine-learning-cheat-sheet?utm_source=chatgpt.com 4/22
5/12/25, 11:48 AM Machine Learning Cheat Sheet | DataCamp
1. Interpreta
ble and
explainabl
e 1. Assumes
linearity
2. Less prone between
1. Predicting to inputs and
A simple algorithm
credit risk overfitting outputs
that models a linear score when
Logistic relationship between using 2. Can
2. Customer
Regression inputs and a overfit
churn regularizat
categorical output (1 with small,
prediction ion
or 0)
high-
3. Applicable dimension
for multi- al data
class
prediction
s
https://www.datacamp.com/cheat-sheet/machine-learning-cheat-sheet?utm_source=chatgpt.com 5/22
5/12/25, 11:48 AM Machine Learning Cheat Sheet | DataCamp
1. Less prone
to
overfitting
https://www.datacamp.com/cheat-sheet/machine-learning-cheat-sheet?utm_source=chatgpt.com 6/22
5/12/25, 11:48 AM Machine Learning Cheat Sheet | DataCamp
1. Less prone
1. Predicting to
housing overfitting 1. Can lead
Part of the regression
prices to poor
family — it penalizes 2. Can
interpreta
features that have low
2. Predicting handle
predictive outcomes
bility as it
Lasso clinical high-
by shrinking their can keep
Regression outcomes dimension
coefficients to zero. highly
based on al data
Can be used for correlated
classification or health 3. No need variables
regression data for feature
selection
Tree-based models
In a nutshell, tree-based models use a series of "if-then" rules to predict from decision trees.
In this section, we'll specify commonly used linear models in machine learning, their
advantages, and disadvantages.
https://www.datacamp.com/cheat-sheet/machine-learning-cheat-sheet?utm_source=chatgpt.com 7/22
5/12/25, 11:48 AM Machine Learning Cheat Sheet | DataCamp
1. Customer 1. Explaina
churn ble and
prediction interpret 1. Prone to
Decision Tree models
make decision rules on 2. Credit able overfitting
Decision the features to produce score 2. Can 2. Sensitive
Tree predictions. It can be
modeling handle to outliers
used for classification or
regression 3. Disease missing
prediction values
1. Reduces
overfittin 1. Training
1. Credit
g complexit
score
y can be
An ensemble learning modeling 2. Higher
high
Random method that combines accuracy
2. Predicting
Forests the output of multiple compare 2. Not very
decision trees
housing
d to interpreta
prices
other ble
models
https://www.datacamp.com/cheat-sheet/machine-learning-cheat-sheet?utm_source=chatgpt.com 8/22
5/12/25, 11:48 AM Machine Learning Cheat Sheet | DataCamp
1. Better
accuracy
compare 1. Sensitive
d to to outliers
other and can
1. Predicting
regressio therefore
car
n models cause
Gradient Boosting emissions
overfitting
Regression employs 2. It can
Gradient 2. Predicting
boosting to make handle 2. Computat
Boosting ride-
predictive models from multicolli ionally
Regression hailing
an ensemble of weak nearity expensive
predictive learners fare
It can and has
amount
handle high
non- complexit
linear y
relations
hips
https://www.datacamp.com/cheat-sheet/machine-learning-cheat-sheet?utm_source=chatgpt.com 9/22
5/12/25, 11:48 AM Machine Learning Cheat Sheet | DataCamp
1. Hyperpar
1. Provides ameter
1. Churn accurate tuning
prediction results can be
Gradient Boosting
complex
algorithm that is efficient 2. Claims 2. Captures
XGBoost & flexible. Can be used processing non- 2. Does not
for both classification and in linear perform
regression tasks
insurance relations well on
hips sparse
datasets
https://www.datacamp.com/cheat-sheet/machine-learning-cheat-sheet?utm_source=chatgpt.com 10/22
5/12/25, 11:48 AM Machine Learning Cheat Sheet | DataCamp
1. Can
handle 1. Can
large overfit
1. Predicting amounts due to
flight time of data leaf-wise
for airlines
2. Computa splitting
A gradient boosting
2. Predicting tional and high
framework that is
LightGBM cholesterol efficient sensitivity
designed to be more
Regressor levels
efficient than other & fast 2. Hyperpar
implementations based on training ameter
health speed tuning
data
3. Low can be
memory complex
usage
Unsupervised Learning
Unsupervised learning is about discovering general patterns in data. The most popular
example is clustering or segmenting customers and users. This type of segmentation is
generalizable and can be applied broadly, such as to documents, companies, and genes.
Unsupervised learning consists of clustering models, that learn how to group similar data
points together, or association algorithms, that group different data points based on pre-
defined rules.
Clustering models
https://www.datacamp.com/cheat-sheet/machine-learning-cheat-sheet?utm_source=chatgpt.com 11/22
5/12/25, 11:48 AM Machine Learning Cheat Sheet | DataCamp
1. Requires
the
1. Scales to expected
large number of
1. Customer datasets clusters
segmentati 2. Simple to from the
K-Means is the most on beginning
implemen
widely used clustering
K-Means approach—it determines K 2. Recommen t and 2. Has
clusters based on dation interpret troubles
euclidean distances systems with
3. Results in
tight varying
clusters cluster
sizes and
densities
https://www.datacamp.com/cheat-sheet/machine-learning-cheat-sheet?utm_source=chatgpt.com 12/22
5/12/25, 11:48 AM Machine Learning Cheat Sheet | DataCamp
1. There is 1. Doesn’t
no need always
to specify result in
1. Fraud the the best
A "bottom-up" approach
detection number clustering
where each data point is of clusters
2. Document 2. Not
Hierarchical treated as its own cluster
clustering 2. The suitable
Clustering —and then the closest
two clusters are merged
based on resulting for large
together iteratively similarity dendrogr datasets
am is due to
informativ high
e complexity
https://www.datacamp.com/cheat-sheet/machine-learning-cheat-sheet?utm_source=chatgpt.com 13/22
5/12/25, 11:48 AM Machine Learning Cheat Sheet | DataCamp
1. Computes
a
probabilit
y for an
observati
on 1. Requires
belonging complex
to a tuning
1. Customer
segmentati cluster 2. Requires
A probabilistic model for on 2. Can setting the
Gaussian
modeling normally identify number of
Mixture 2. Recommen
distributed clusters within
overlappi expected
Models dation
a dataset mixture
systems ng
clusters componen
ts or
3. More clusters
accurate
results
compared
to K-
means
Association
https://www.datacamp.com/cheat-sheet/machine-learning-cheat-sheet?utm_source=chatgpt.com 14/22
5/12/25, 11:48 AM Machine Learning Cheat Sheet | DataCamp
1. Results
are 1. Generates
intuitive many
1. Product and uninteresti
placements Interpreta ng
ble itemsets
Rule based approach that 2. Recommen
identifies the most dation 2. Exhaustive 2. Computati
Apriori frequent itemset in a given engines approach onally and
Algorithm dataset where prior as it finds memory
knowledge of frequent 3. Promotion all rules intensive.
itemset properties is used optimizatio based on Results in
n the many
confidenc overlappin
e and g item sets
support
TO P I C S
https://www.datacamp.com/cheat-sheet/machine-learning-cheat-sheet?utm_source=chatgpt.com 15/22
5/12/25, 11:48 AM Machine Learning Cheat Sheet | DataCamp
For Business
Related
B LO G
B LO G
A Beginner's Guide to The
Machine Learning Workflow
C H E AT- S H E E T
Supervised Machine Learning
Cheat Sheet
https://www.datacamp.com/cheat-sheet/machine-learning-cheat-sheet?utm_source=chatgpt.com 16/22
5/12/25, 11:48 AM Machine Learning Cheat Sheet | DataCamp
See More
LEARN
Learn Python
Learn AI
Learn Power BI
Assessments
Career Tracks
Skill Tracks
https://www.datacamp.com/cheat-sheet/machine-learning-cheat-sheet?utm_source=chatgpt.com 17/22
5/12/25, 11:48 AM Machine Learning Cheat Sheet | DataCamp
Courses
DATA C O U R S E S
Python Courses
R Courses
SQL Courses
Power BI Courses
Tableau Courses
Alteryx Courses
Azure Courses
AWS Courses
Excel Courses
AI Courses
https://www.datacamp.com/cheat-sheet/machine-learning-cheat-sheet?utm_source=chatgpt.com 18/22
5/12/25, 11:48 AM Machine Learning Cheat Sheet | DataCamp
DATA L A B
Get Started
Pricing
Security
Documentation
C E R T I F I C AT I O N
Certifications
Data Scientist
Data Analyst
Data Engineer
SQL Associate
https://www.datacamp.com/cheat-sheet/machine-learning-cheat-sheet?utm_source=chatgpt.com 19/22
5/12/25, 11:48 AM Machine Learning Cheat Sheet | DataCamp
Azure Fundamentals
AI Fundamentals
RESOURCES
Resource Center
Upcoming Events
Blog
Code-Alongs
Tutorials
Docs
Open Source
RDocumentation
Data Portfolio
PLANS
Pricing
For Students
https://www.datacamp.com/cheat-sheet/machine-learning-cheat-sheet?utm_source=chatgpt.com 20/22
5/12/25, 11:48 AM Machine Learning Cheat Sheet | DataCamp
For Business
For Universities
DataCamp Donates
FO R B U S I N E S S
Business Pricing
Teams Plan
Customer Stories
Partner Program
ABOUT
About Us
Learner Stories
Careers
Become an Instructor
Press
https://www.datacamp.com/cheat-sheet/machine-learning-cheat-sheet?utm_source=chatgpt.com 21/22
5/12/25, 11:48 AM Machine Learning Cheat Sheet | DataCamp
Leadership
Contact Us
DataCamp Español
DataCamp Português
DataCamp Deutsch
DataCamp Français
S U P PO R T
Help Center
Become an Affiliate
Privacy Policy Cookie Notice Do Not Sell My Personal Information Accessibility Security Terms of Use
https://www.datacamp.com/cheat-sheet/machine-learning-cheat-sheet?utm_source=chatgpt.com 22/22