0% found this document useful (0 votes)

401 views16 pages

AI & ML Unit 4 Notes

Uploaded by

Anandakumar A

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

401 views16 pages

AI & ML Unit 4 Notes

Uploaded by

Anandakumar A

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

Unit – IV

Combining Multiple Learners:

Ensemble Methods:

• It is a machine learning technique that combines several base models in order to

produce one optimal predictive model.
• Decision tree is best to outline the definition of ensemble methods.
• A Decision tree determines the predictive value based on series of questions and
conditions.
• For example, simple decision tree determining on whether an individual should
play outside or not.
• The tree takes several weather factors into account, and given each factor either
males a decision or asks another question.
• In this example, every time it is overcast.
• However if it is raining, it is needed to ask if it is windy or not? If windy, do not
play.
• But given no wind, so ready to go outside to play.
Simple Ensemble Techniques:

• Max Voting
• Averaging
• Weighted Averaging

Max-Voting:

• The max-voting method is generally used for classification problems.

• In this technique, multiple models are used to make predictions for each data point.
• The predictions by each model are considered as a ‘Vote’.
• The predictions which we get from the majority of models are used as final
prediction.
• For example, when asked 5 of our colleagues to rate movie(out of 5).
• Since the majority gave a rating of 4, the final rating will be taken as 4.
Colleague Colleague Colleague Colleague Colleague Final
1 2 3 4 5 rating
5 4 5 4 4 4
Averaging:

• Similar to max-voting technique, multiple predictions are made for each data point
in averaging.
• In this method, the average is taken of predictions from all the models and use it to
make final prediction.
• It can be used for making predictions in regression problems.
Colleague Colleague Colleague Colleague Colleague Final
1 2 3 4 5 rating
5 4 5 4 4 4.4
Weighted Average:

• This is an extension of averaging method.

• All models are assigned different weights defining the importance of each model
for prediction.
• The result is calculated as [(5 x 0.23) + (4 x 0.23) + (5 x 0.18) + (4 x 0,18) + (4 x
0,18)] = 4.41
Colleague Colleague Colleague Colleague Colleague Final
1 2 3 4 5 rating
Weight 0.23 0.23 0.18 0.18 0.18
Rating 5 4 5 4 4 4.41

Ensemble Learning:
Bagging:
• The idea behind bagging is combining the results of multiple models to get a
generalized result.
• Bagging, the short form for bootstrap aggregating, is mainly applied in
classification and regression.
• It increases the accuracy of models through decision trees.
• It is mainly applied in supervised learning problems such as classification and
regression.
• It involves two steps, i.e., bootstrapping and aggregation,
• Bootstrapping: It is a random sampling technique in which samples are derived
from the data using the replacement procedure.
• Aggregation: It is in bagging is done to incorporate all possible outcomes of the
prediction and randomize the outcome.
Advantages:
✓ Eliminates variance
✓ Weak base learners are combined to form single strong learner

Boosting:

• Boosting is an ensemble technique that learns from previous predictor mistakes

to make better predictions in the future.
• The technique combines several weak base learners to form one strong learner.
• Boosting works by arranging weak learners in a sequence, such that weak learners
learn from the next learner in the sequence to create better predictive models .
• It is a sequential process, where each subsequent model attempts to correct the
errors of previous model.
• Boosting takes many forms, including gradient boosting, Adaptive Boosting
(AdaBoost), and XGBoost (Extreme Gradient Boosting).
• AdaBoost uses weak learners in the form of decision trees, which mostly include
one split that is popularly known as decision stumps
• Gradient boosting adds predictors sequentially to the ensemble, where preceding
predictors correct their successors, thereby increasing the model’s accuracy.
• The way of boosting works in the below steps:
Step 1: A subset is created from original dataset.
Step 2: Initially, all data points are given equal weights.
Step 3: A base model is created on this subset.
Step 4: This model is used to make predictions on the whole dataset.
Step 5: Errors are calculated using actual and predicted values.
Step 6: Observations which are incorrectly predicted are given higher weights
Step 7: Another model is created and predictions are made on dataset

Step 8: Similarly, multiple models are created, each correcting the errors
Step 9: The final model is the weighted mean of all models

Step 10: Thus boosting algorithm combines a number of weak learners to form a
strong learner.

Stacking:

• Stacking, another ensemble method is often referred to as stacked generalization.

• This technique works by allowing a training algorithm to ensemble several other
similar learning algorithm predictions.
• Stacking has been successfully implemented in regression, density estimations,
distance learning, and classifications.
• It can also be used to measure the error rate involved during bagging.
• Step-wise explanation for simple-stacked ensemble:
Step 1: The train set is split into 10 parts.

Step 2: A base model is fitted on 9 parts and predictions are made for 10th part.

Step 3: The base model is then fitted on the whole train dataset. Using this model,
predictions are make on the test set.

Step 4: Steps 2 to 4 are repeated for another base model resulting in another set
of predictions for train set and test set.

Step 5: The predictions from the train set are used as features to build a new
model.
Unsupervised Learning:

• Unsupervised learning is a learning method in which a machine learns without

supervision.
• It is machine learning technique in which models are not supervised using
training dataset.
• It cannot be directly applied to regression or classification.
• The goal of unsupervised learning is to find the underlying structure of dataset,
group that data according to similarities and represent that dataset in compressed
format.

Types of unsupervised learning:

Clustering:
o It is unsupervised method of grouping objects into clusters such that
objects with most similarities remains into a group and has less or no
similarities of another group.

Association:

• An association rule is an unsupervised learning method which finds the

relationships between variables in large database.
List of unsupervised learning algorithm:

Clustering:

• K-means clustering
• KNN(K-nearest neighbor)
• Gaussian Mixture model
• Expectation Maximization

Association:

• Apriori algorithm
• FP growth algorithm

Applications:

• Market Basket Analysis

• Medical Diagnosis
• Marketing
• Insurance

Advantages:

• Used for more complex tasks

• It is easy to get unlabeled data

Disadvantages:

• More difficult than supervised learning.

K-Means clustering algorithm:

• K-Means Clustering is an unsupervised learning algorithm that is used to solve

the clustering problems in machine learning or data science.
• K-Means Clustering is an Unsupervised Learning algorithm, which groups the
unlabeled dataset into different clusters.
• Here K defines the number of pre-defined clusters that need to be created in the
process, as if K=2, there will be two clusters, and for K=3, there will be three
clusters, and so on.
• It is a centroid-based algorithm, where each cluster is associated with a centroid.
• The main aim of this algorithm is to minimize the sum of distances between the
data point and their corresponding clusters.
• The algorithm takes the unlabeled dataset as input, divides the dataset into k-
number of clusters, and repeats the process until it does not find the best clusters.
• The k-means clustering algorithm mainly performs two tasks:
o Determines the best value for K center points or centroids by an iterative
process.
o Assigns each data point to its closest k-center

Working:

Step-1: Select the number K to decide the number of clusters

Step-2: Select random K points or centroids. (It can be other from the input dataset).

Step-3: Assign each data point to their closest centroid, which will form the predefined
K clusters.

Step-4: Calculate the variance and place a new centroid of each cluster.

Step-5: Repeat the third steps, which mean reassign each datapoint to the new closest
centroid of each cluster.

Step-6: If any reassignment occurs, then go to step-4 else go to FINISH.

Step-7: The model is ready

• Suppose we have two variables M1 and M2. The x-y axis scatter plot of these two
variables is given below: Let's take number k of clusters, i.e., K=2, to identify the
dataset and to put them into different clusters. It means here we will try to group
these datasets into two different clusters.
• We need to choose some random k points or centroid to form the cluster. These
points can be either the points from the dataset or any other point.

Applications:

• Academic performance
• Diagnostic systems
• Search engines

Advantages:

• Simple
• Easy to implement

K-Nearest Neighbor(KNN) Algorithm:

• K-Nearest Neighbour is one of the simplest Machine Learning algorithms based

on Supervised Learning technique.
• K-NN algorithm assumes the similarity between the new case/data and available
cases and put the new case into the category that is most similar to the available
categories.
• K-NN algorithm stores all the available data and classifies a new data point based
on the similarity.
• K-NN algorithm can be used for Regression as well as for Classification but
mostly it is used for the Classification problems.
• K-NN is a non-parametric algorithm, which means it does not make any
assumption on underlying data.
• It is also called a lazy learner algorithm because it does not learn from the training
set immediately.
• Example: Suppose, we have an image of a creature that looks similar to cat and
dog, but we want to know either it is a cat or dog. So for this identification, we
can use the KNN algorithm, as it works on a similarity measure. Our KNN model
will find the similar features of the new data set to the cats and dogs images and
based on the most similar features it will put it in either cat or dog category.
• Suppose there are two categories, i.e., Category A and Category B, and we have
a new data point x1, so this data point will lie in which of these categories. To
solve this type of problem, we need a K-NN algorithm.

Working:

Step-1: Select the number K of the neighbors

Step-2: Calculate the Euclidean distance of K number of neighbors

Step-3: Take the K nearest neighbors as per the calculated Euclidean distance.
Step-4: Among these k neighbors, count the number of the data points in each category.

Step-5: Assign the new data points to that category for which the number of the neighbor
is maximum.

Step-6: Our model is ready.

• Firstly, we will choose the number of neighbors, so we will choose the k=5.
• Next, we will calculate the Euclidean distance between the data points.

Advantages:

• Simple to implement
• Robust
• More effective to train data
Disadvantage:

• Computation cost is high

Instance Based Learning:

• These systems that learn the training examples by heart and then generalizes to
new instances based on some similarity measure.
• It builds the hypotheses from the training instances.
• It is also known as Memory-based learning or lazy learning.

Gaussian Mixture Models:

• Gaussian mixture models (GMMs) are a type of machine learning algorithm.

• They are used to classify data into different categories based on the probability
distribution.
• Gaussian mixture models can be used in many different areas, including finance,
marketing and so much more.
• Gaussian Mixture Models (GMMs) give us more flexibility than K-Means.
• Taking an example in two dimensions, this means that the clusters can take any
kind of elliptical shape (since we have standard deviation in both the x and y
directions).
• Thus, each Gaussian distribution is assigned to a single cluster. In order to find
the parameters of the Gaussian for each cluster (e.g the mean and standard
deviation) we will use an optimization algorithm called Expectation–
Maximization (EM).
• Gaussian mixture models (GMM) are a probabilistic concept used to model real-
world data sets.
• GMMs are a generalization of Gaussian distributions and can be used to represent
any data set that can be clustered into multiple Gaussian distributions
• The Gaussian mixture model is a probabilistic model that assumes all the data
points are generated from a mix of Gaussian distributions with unknown
parameters.
• A Gaussian mixture model can be used for clustering, which is the task of
grouping a set of data points into clusters.
• GMMs can be used to find clusters in data sets where the clusters may not be
clearly defined.
• This makes GMMs a flexible and powerful tool for clustering data.
• GMM has many applications, such as density estimation, clustering, and image
segmentation. For density estimation, GMM can be used to estimate the
probability density function of a set of data points.
• For clustering, GMM can be used to group together data points that come from
the same Gaussian distribution.
• Image segmentation, GMM can be used to partition an image into different
regions.
• Gaussian mixture models can be used for a variety of use cases, including
identifying customer segments.

Advantages:

• Flexibility
• Robustness
• Speed
Disadvantages:

• Sensitivity to initialization
• High dimensional data.

Expectation Maximization:

• It is defined as the combination of various unsupervised machine learning

algorithms such as K-means clustering algorithm.
• Being an iterative approach, it consists of two modes.
• In the first mode, estimate the missing or latent variables.
• Hence it is referred to as the Expectation step(E-step)
• Other mode is used to optimize the parameters of the models so that it can explain
the data more clearly.
• The second mode is known as Maximization step(M-step)
• Expectation step: It involves the estimation of all missing values in dataset so
that after completing this step, there should not be any missing value.
• Maximization step: It involves the use of estimated data in the E-step and
updating the parameters.

• The primary goal of the EM algorithm is to use the available observed data of the
dataset to estimate the missing data.
• Convergence is defined as the specific situation in probability based on intuition.
Steps in EM algorithm:
Step 1: Initialize the parameter values. Further the system is provided with
incomplete observed data with assumption.
Step 2: This step is known as E-step. It is used to estimate the values of missing
or incomplete data using the observed data.
Step 3: This step is known as M-step. It use complete data obtained from 2nd step
to update the parameter values.
Step 4: The last step is to check if the values of latent variables are converging or
not. If it gets “yes”, then stop the process; else repeat the process from step 2 until
the convergence occurs.

Applications:

• Data clustering
• Used in Computer vision and NLP
• Used in medical and healthcare industry

Advantages:

• Easy to implement
• It often generates a solution for M-step in closed form

Disadvantages:

• Convergence of EM algorithm is very slow.

• It takes both forward and backward probability

Crop Prediction System Final Report
No ratings yet
Crop Prediction System Final Report
46 pages
JNTUK R20 B.Tech CSE 3-2 Machine Learning Unit 3 Notes
No ratings yet
JNTUK R20 B.Tech CSE 3-2 Machine Learning Unit 3 Notes
21 pages
Circular Linked List
No ratings yet
Circular Linked List
59 pages
ML unit-2
100% (1)
ML unit-2
28 pages
Final
No ratings yet
Final
63 pages
ml unit 2
No ratings yet
ml unit 2
23 pages
ML unit-1
100% (1)
ML unit-1
15 pages
AL3502DEEP LEARNING FOR VISIONL T P C
No ratings yet
AL3502DEEP LEARNING FOR VISIONL T P C
3 pages
Machine Learning-4
No ratings yet
Machine Learning-4
18 pages
Calorie Burnt
No ratings yet
Calorie Burnt
45 pages
ML-UNIT-5
No ratings yet
ML-UNIT-5
20 pages
Unit 4
No ratings yet
Unit 4
17 pages
Unit III Knowledge, Reasoning and Planning
No ratings yet
Unit III Knowledge, Reasoning and Planning
99 pages
gradient_boosting
No ratings yet
gradient_boosting
39 pages
ML Unit-3 Notes
No ratings yet
ML Unit-3 Notes
26 pages
Mini Project - Merged
No ratings yet
Mini Project - Merged
48 pages
A review of machine learning applications in wildfire science and mngt
No ratings yet
A review of machine learning applications in wildfire science and mngt
71 pages
Unit 4 Part 1
No ratings yet
Unit 4 Part 1
47 pages
Advanced short-term load forecasting for residential demand response
No ratings yet
Advanced short-term load forecasting for residential demand response
11 pages
Ruijun Chen Improving Building Resilience in The Face
No ratings yet
Ruijun Chen Improving Building Resilience in The Face
19 pages
Internship project on Fraud Detection
No ratings yet
Internship project on Fraud Detection
17 pages
1-s2.0-S2238785424020192-main
No ratings yet
1-s2.0-S2238785424020192-main
27 pages
IEEE_usa (3)
No ratings yet
IEEE_usa (3)
7 pages
Chapter-V CLASSIFICATION & CLUSTERING
No ratings yet
Chapter-V CLASSIFICATION & CLUSTERING
153 pages
Lecture5 FGV
No ratings yet
Lecture5 FGV
25 pages
Applied Machine Learning Supervised Machine Learning (Part 2)
No ratings yet
Applied Machine Learning Supervised Machine Learning (Part 2)
47 pages
Short-Term Load Forecasting Using Smart Meter Data
No ratings yet
Short-Term Load Forecasting Using Smart Meter Data
22 pages
InfoMat - 2023 - Li - Methods Progresses and Opportunities of Materials Informatics
No ratings yet
InfoMat - 2023 - Li - Methods Progresses and Opportunities of Materials Informatics
30 pages
chapter 3- boosting theory
No ratings yet
chapter 3- boosting theory
7 pages
ML Unit 1
No ratings yet
ML Unit 1
42 pages
Fake Profile Identification Using Machine Learning
No ratings yet
Fake Profile Identification Using Machine Learning
7 pages
UNIT-5 Foundations of Deep Learning
No ratings yet
UNIT-5 Foundations of Deep Learning
9 pages
Machine Learning Based Models To Support Decision Maki - 2021 - International Jo
No ratings yet
Machine Learning Based Models To Support Decision Maki - 2021 - International Jo
7 pages
Beta Survival Models: David Hubbard, Benoît Rostykus, Yves Raimond, Tony Jebara
No ratings yet
Beta Survival Models: David Hubbard, Benoît Rostykus, Yves Raimond, Tony Jebara
11 pages
Web Phishing Detection Using Machine Learning
No ratings yet
Web Phishing Detection Using Machine Learning
22 pages
XG Boost Research Paper (2)
No ratings yet
XG Boost Research Paper (2)
5 pages
Predictionof Diabetesusing Machine Learning
No ratings yet
Predictionof Diabetesusing Machine Learning
6 pages
AI & ML Unit 3 Notes
No ratings yet
AI & ML Unit 3 Notes
20 pages
ML Mid Sem Question Bank
No ratings yet
ML Mid Sem Question Bank
11 pages
2.building Blocks of Neural Networks
100% (1)
2.building Blocks of Neural Networks
2 pages
Predictive Accuracy: A Misleading Performance Measure For Highly Imbalanced Data
No ratings yet
Predictive Accuracy: A Misleading Performance Measure For Highly Imbalanced Data
12 pages
Final Report Srini
No ratings yet
Final Report Srini
24 pages
Convolutional Neural Network Based Energy Consumption Management Model For The Full Life Cycle
No ratings yet
Convolutional Neural Network Based Energy Consumption Management Model For The Full Life Cycle
9 pages
Unit 2 (Second Order Methods)
No ratings yet
Unit 2 (Second Order Methods)
9 pages
ML Question Bank
No ratings yet
ML Question Bank
7 pages
@vtucode - in Module 4 AI 2021 Scheme 5th Sem
No ratings yet
@vtucode - in Module 4 AI 2021 Scheme 5th Sem
11 pages
AI-unit 3
No ratings yet
AI-unit 3
55 pages
Harnessing Machine Learning for Diabetes Prediction: Optimizing Classifiers to Tackle Canada's Growing Health Challenge
No ratings yet
Harnessing Machine Learning for Diabetes Prediction: Optimizing Classifiers to Tackle Canada's Growing Health Challenge
9 pages
Data Structures Unit 1-Linear Structures
No ratings yet
Data Structures Unit 1-Linear Structures
44 pages
Anticipating Consumer Demand Using ML
No ratings yet
Anticipating Consumer Demand Using ML
8 pages
Machine Learing Algorithms
No ratings yet
Machine Learing Algorithms
13 pages
ML-3-Decision Tree
No ratings yet
ML-3-Decision Tree
17 pages
Deep Learning Notes
No ratings yet
Deep Learning Notes
14 pages
Analytical Methods of Machine Learning Model For E-Commerce Sales Analysis and Prediction
No ratings yet
Analytical Methods of Machine Learning Model For E-Commerce Sales Analysis and Prediction
6 pages
Unit - 3 ML
No ratings yet
Unit - 3 ML
17 pages
Unit -3-NNDL- Notes
No ratings yet
Unit -3-NNDL- Notes
17 pages
ML (U1&u2)
No ratings yet
ML (U1&u2)
51 pages
Ai Unit 2 Notes
No ratings yet
Ai Unit 2 Notes
52 pages
Machine Learning Full Question Bank
No ratings yet
Machine Learning Full Question Bank
14 pages
Unit 4
No ratings yet
Unit 4
24 pages
ML Unit 1
No ratings yet
ML Unit 1
44 pages
Unit-5 Alt
No ratings yet
Unit-5 Alt
15 pages
Imbalanced Data: How To Handle Imbalanced Classification Problems
No ratings yet
Imbalanced Data: How To Handle Imbalanced Classification Problems
17 pages
Mastering Machine Learning - A Comprehensive Guide
No ratings yet
Mastering Machine Learning - A Comprehensive Guide
19 pages
AIML Module - 03 21CS4
No ratings yet
AIML Module - 03 21CS4
34 pages
Unit 2 v1.
No ratings yet
Unit 2 v1.
41 pages
AIML Module - 03
No ratings yet
AIML Module - 03
34 pages
AI Unit 4 QA
No ratings yet
AI Unit 4 QA
22 pages
Concept Learning
No ratings yet
Concept Learning
85 pages
Daa M-4
No ratings yet
Daa M-4
28 pages
Machine Learning QB
No ratings yet
Machine Learning QB
3 pages
ML First Unit
No ratings yet
ML First Unit
70 pages
Unit 3
No ratings yet
Unit 3
99 pages
Back Propagation
100% (1)
Back Propagation
27 pages
Unit - IV - DIMENSIONALITY REDUCTION AND GRAPHICAL MODELS
No ratings yet
Unit - IV - DIMENSIONALITY REDUCTION AND GRAPHICAL MODELS
59 pages
ML Lab
No ratings yet
ML Lab
21 pages
ML Unit-4
No ratings yet
ML Unit-4
9 pages
Unit I Notes Machine Learning Techniques 1
No ratings yet
Unit I Notes Machine Learning Techniques 1
21 pages
Thyroid Disease Classification Using Machine Learning Project
No ratings yet
Thyroid Disease Classification Using Machine Learning Project
34 pages
Question Bank Module-1: Department of Computer Applications 18mca53 - Machine Learning
No ratings yet
Question Bank Module-1: Department of Computer Applications 18mca53 - Machine Learning
7 pages
Concept Learning
No ratings yet
Concept Learning
62 pages
Question Bank AML
No ratings yet
Question Bank AML
4 pages
LP I ML Viva Questions
100% (1)
LP I ML Viva Questions
9 pages
BTCS9202 Data Sciences Lab Manual
No ratings yet
BTCS9202 Data Sciences Lab Manual
39 pages
18AI61
No ratings yet
18AI61
3 pages
Deep Learning: Prof:Naveen Ghorpade
No ratings yet
Deep Learning: Prof:Naveen Ghorpade
43 pages
Unit-3-Second Chapter
No ratings yet
Unit-3-Second Chapter
9 pages
ML Unit-Iv
No ratings yet
ML Unit-Iv
19 pages
ML Question Bank - Beena Kapadia
No ratings yet
ML Question Bank - Beena Kapadia
3 pages
Gujarat Technological University: Computer Engineering Machine Learning SUBJECT CODE: 3710216
No ratings yet
Gujarat Technological University: Computer Engineering Machine Learning SUBJECT CODE: 3710216
2 pages