0% found this document useful (0 votes)

1K views

Principal Component Analysis Numericals

The document describes principal component analysis (PCA) as a dimension reduction technique that transforms variables into a new set of principal components. PCA identifies the components that account for the most variance in the data. The document provides an example of applying PCA to a two-dimensional dataset by calculating the covariance matrix, eigenvectors and eigenvalues to identify the principal component that explains the most variance. PCA is demonstrated to reduce the two dimensions of the example dataset to one dimension, compressing the data.

Uploaded by

Beast Incarnate

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

1K views

Principal Component Analysis Numericals

Uploaded by

Beast Incarnate

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

12/3/21, 10:08 AM Principal Component Analysis Numerical Example | Gate Vidyalay

Tag: Principal Component Analysis Numerical Example

Principal Component Analysis | Dimension Reduction

Pattern Recognition
Dimension Reduction-

In pattern recognition, Dimension Reduction is defined as-

It is a process of converting a data set having vast dimensions into a data set
with lesser dimensions.
It ensures that the converted data set conveys similar information concisely.

Example-

Consider the following example-

The following graph shows two dimensions x1 and x2.

x1 represents the measurement of several objects in cm.
x2 represents the measurement of several objects in inches.

https://www.gatevidyalay.com/tag/principal-component-analysis-numerical-example/ 1/15
12/3/21, 10:08 AM Principal Component Analysis Numerical Example | Gate Vidyalay

In machine learning,

Using both these dimensions convey similar information.

Also, they introduce a lot of noise in the system.
So, it is better to use just one dimension.

Using dimension reduction techniques-

We convert the dimensions of data from 2 dimensions (x1 and x2) to 1

dimension (z1).
It makes the data relatively easier to explain.

Benefits-
https://www.gatevidyalay.com/tag/principal-component-analysis-numerical-example/ 2/15
12/3/21, 10:08 AM Principal Component Analysis Numerical Example | Gate Vidyalay

Dimension reduction offers several benefits such as-

It compresses the data and thus reduces the storage space requirements.
It reduces the time required for computation since less dimensions require
less computation.
It eliminates the redundant features.
It improves the model performance.

Dimension Reduction Techniques-

The two popular and well-known dimension reduction techniques are-

1. Principal Component Analysis (PCA)

2. Fisher Linear Discriminant Analysis (LDA)

In this article, we will discuss about Principal Component Analysis.

Principal Component Analysis-

https://www.gatevidyalay.com/tag/principal-component-analysis-numerical-example/ 3/15
12/3/21, 10:08 AM Principal Component Analysis Numerical Example | Gate Vidyalay

Principal Component Analysis is a well-known dimension reduction technique.

It transforms the variables into a new set of variables called as principal
components.
These principal components are linear combination of original variables and
are orthogonal.
The first principal component accounts for most of the possible variation of
original data.
The second principal component does its best to capture the variance in the
data.
There can be only two principal components for a two-dimensional data set.

PCA Algorithm-

The steps involved in PCA Algorithm are as follows-

Step-01: Get data.

Step-02: Compute the mean vector (µ).

Step-03: Subtract mean from the given data.

Step-04: Calculate the covariance matrix.

Step-05: Calculate the eigen vectors and eigen values of the covariance matrix.

Step-06: Choosing components and forming a feature vector.

Step-07: Deriving the new data set.

https://www.gatevidyalay.com/tag/principal-component-analysis-numerical-example/ 4/15
12/3/21, 10:08 AM Principal Component Analysis Numerical Example | Gate Vidyalay

PRACTICE PROBLEMS BASED ON PRINCIPAL COMPONENT

ANALYSIS-

Problem-01:

Given data = { 2, 3, 4, 5, 6, 7 ; 1, 5, 3, 6, 7, 8 }.

Compute the principal component using PCA Algorithm.

Consider the two dimensional patterns (2, 1), (3, 5), (4, 3), (5, 6), (6, 7), (7, 8).

Compute the principal component using PCA Algorithm.

Compute the principal component of following data-

CLASS 1

X=2,3,4

Y=1,5,3

CLASS 2

X=5,6,7

https://www.gatevidyalay.com/tag/principal-component-analysis-numerical-example/ 5/15
12/3/21, 10:08 AM Principal Component Analysis Numerical Example | Gate Vidyalay

Y=6,7,8

Solution-

We use the above discussed PCA Algorithm-

Step-01:

Get data.

The given feature vectors are-

x1 = (2, 1)
x2 = (3, 5)
x3 = (4, 3)
x4 = (5, 6)
x5 = (6, 7)
x6 = (7, 8)

Step-02:

https://www.gatevidyalay.com/tag/principal-component-analysis-numerical-example/ 6/15
12/3/21, 10:08 AM Principal Component Analysis Numerical Example | Gate Vidyalay

Calculate the mean vector (µ).

Mean vector (µ)

= ((2 + 3 + 4 + 5 + 6 + 7) / 6, (1 + 5 + 3 + 6 + 7 + 8) / 6)

= (4.5, 5)

Thus,

Step-03:

Subtract mean vector (µ) from the given feature vectors.

x1 – µ = (2 – 4.5, 1 – 5) = (-2.5, -4)

x2 – µ = (3 – 4.5, 5 – 5) = (-1.5, 0)
x3 – µ = (4 – 4.5, 3 – 5) = (-0.5, -2)
x4 – µ = (5 – 4.5, 6 – 5) = (0.5, 1)
x5 – µ = (6 – 4.5, 7 – 5) = (1.5, 2)
x6 – µ = (7 – 4.5, 8 – 5) = (2.5, 3)

Feature vectors (xi) after subtracting mean vector (µ) are-

https://www.gatevidyalay.com/tag/principal-component-analysis-numerical-example/ 7/15
12/3/21, 10:08 AM Principal Component Analysis Numerical Example | Gate Vidyalay

Step-04:

Calculate the covariance matrix.

Covariance matrix is given by-

Now,

https://www.gatevidyalay.com/tag/principal-component-analysis-numerical-example/ 8/15
12/3/21, 10:08 AM Principal Component Analysis Numerical Example | Gate Vidyalay

Now,

Covariance matrix

= (m1 + m2 + m3 + m4 + m5 + m6) / 6
https://www.gatevidyalay.com/tag/principal-component-analysis-numerical-example/ 9/15
12/3/21, 10:08 AM Principal Component Analysis Numerical Example | Gate Vidyalay

On adding the above matrices and dividing by 6, we get-

Step-05:

Calculate the eigen values and eigen vectors of the covariance matrix.

λ is an eigen value for a matrix M if it is a solution of the characteristic equation |M –

λI| = 0.

So, we have-

https://www.gatevidyalay.com/tag/principal-component-analysis-numerical-example/ 10/15
12/3/21, 10:08 AM Principal Component Analysis Numerical Example | Gate Vidyalay

From here,

(2.92 – λ)(5.67 – λ) – (3.67 x 3.67) = 0

16.56 – 2.92λ – 5.67λ + λ2 – 13.47 = 0

λ2 – 8.59λ + 3.09 = 0

Solving this quadratic equation, we get λ = 8.22, 0.38

Thus, two eigen values are λ1 = 8.22 and λ2 = 0.38.

Clearly, the second eigen value is very small compared to the first eigen value.

So, the second eigen vector can be left out.

Eigen vector corresponding to the greatest eigen value is the principal component
for the given data set.

So. we find the eigen vector corresponding to eigen value λ1.

https://www.gatevidyalay.com/tag/principal-component-analysis-numerical-example/ 11/15
12/3/21, 10:08 AM Principal Component Analysis Numerical Example | Gate Vidyalay

We use the following equation to find the eigen vector-

MX = λX

where-

M = Covariance Matrix
X = Eigen vector
λ = Eigen value

Substituting the values in the above equation, we get-

Solving these, we get-

2.92X1 + 3.67X2 = 8.22X1

3.67X1 + 5.67X2 = 8.22X2

On simplification, we get-

5.3X1 = 3.67X2 ………(1)

3.67X1 = 2.55X2 ………(2)

https://www.gatevidyalay.com/tag/principal-component-analysis-numerical-example/ 12/15
12/3/21, 10:08 AM Principal Component Analysis Numerical Example | Gate Vidyalay

From (1) and (2), X1 = 0.69X2

From (2), the eigen vector is-

Thus, principal component for the given data set is-

Lastly, we project the data points onto the new subspace as-

https://www.gatevidyalay.com/tag/principal-component-analysis-numerical-example/ 13/15
12/3/21, 10:08 AM Principal Component Analysis Numerical Example | Gate Vidyalay

Problem-02:

Use PCA Algorithm to transform the pattern (2, 1) onto the eigen vector in the
previous question.

Solution-

The given feature vector is (2, 1).

https://www.gatevidyalay.com/tag/principal-component-analysis-numerical-example/ 14/15
12/3/21, 10:08 AM Principal Component Analysis Numerical Example | Gate Vidyalay

The feature vector gets transformed to

= Transpose of Eigen vector x (Feature Vector – Mean Vector)

To gain better understanding about Principal Component Analysis,

Watch this Video Lecture

Get more notes and other study material of Pattern Recognition.

Watch video lectures by visiting our YouTube channel LearnVidFun.

https://www.gatevidyalay.com/tag/principal-component-analysis-numerical-example/ 15/15

Predictive Analytics Complete Notes
No ratings yet
Predictive Analytics Complete Notes
82 pages
Case Study On AVL Trees
No ratings yet
Case Study On AVL Trees
11 pages
Service (Repair) Manual For Sony KDL-52LX900
100% (8)
Service (Repair) Manual For Sony KDL-52LX900
138 pages
Motorcycle Repair Agreement (Dipolog Main)
0% (1)
Motorcycle Repair Agreement (Dipolog Main)
3 pages
Data Science and Big Data Analytics
No ratings yet
Data Science and Big Data Analytics
2 pages
West Borough Primary School 2
No ratings yet
West Borough Primary School 2
2 pages
KL Transform
100% (1)
KL Transform
22 pages
FIND-S Algorithm: Machine Learning 15CSL76
No ratings yet
FIND-S Algorithm: Machine Learning 15CSL76
3 pages
Cross-Validation in Machine Learning
No ratings yet
Cross-Validation in Machine Learning
18 pages
30-Longest Sequence of 1 After Flipping A Bit-25-05-2023
No ratings yet
30-Longest Sequence of 1 After Flipping A Bit-25-05-2023
16 pages
Module 2 PDF
No ratings yet
Module 2 PDF
83 pages
Weka Lab Record Experiments
No ratings yet
Weka Lab Record Experiments
21 pages
KNN Solved Numerical Problem( Regression)
No ratings yet
KNN Solved Numerical Problem( Regression)
3 pages
Data Analytics With Python - Unit 12 - Week 10
No ratings yet
Data Analytics With Python - Unit 12 - Week 10
4 pages
Eda Question Paper
No ratings yet
Eda Question Paper
4 pages
Data Mining-Constraint Based Cluster Analysis
100% (1)
Data Mining-Constraint Based Cluster Analysis
4 pages
Assignments Week08
No ratings yet
Assignments Week08
4 pages
Python Lab Manual Detail
No ratings yet
Python Lab Manual Detail
49 pages
Associative Memory Neural Networks
100% (1)
Associative Memory Neural Networks
35 pages
Three Address Code
No ratings yet
Three Address Code
4 pages
Ad3351 Daa Important Questions
No ratings yet
Ad3351 Daa Important Questions
94 pages
CST308 - KQB KtuQbank
No ratings yet
CST308 - KQB KtuQbank
13 pages
Machine Learning: in Telugu
No ratings yet
Machine Learning: in Telugu
14 pages
©®™ Aeraxia - in C Programming Question Bank Part 1
No ratings yet
©®™ Aeraxia - in C Programming Question Bank Part 1
119 pages
Capgemini Reckoner V2.0
No ratings yet
Capgemini Reckoner V2.0
26 pages
A Project On Stock Market Prediction Sub
No ratings yet
A Project On Stock Market Prediction Sub
24 pages
Question Bank Oose
No ratings yet
Question Bank Oose
4 pages
Unit WISE 2 MARKS PDF
100% (2)
Unit WISE 2 MARKS PDF
38 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
4 pages
Non-Dominated Sorting Genetic Algorithm With Numerical Example Step-by-Step
No ratings yet
Non-Dominated Sorting Genetic Algorithm With Numerical Example Step-by-Step
185 pages
A Hybrid Forecasting Model For Prediction of Stock Value of Tata Steel Using Support Vector Regression and Particle Swarm Optimization
No ratings yet
A Hybrid Forecasting Model For Prediction of Stock Value of Tata Steel Using Support Vector Regression and Particle Swarm Optimization
10 pages
Module-02 AIML NOTES
No ratings yet
Module-02 AIML NOTES
29 pages
DAA Unit-1
No ratings yet
DAA Unit-1
19 pages
Big Data Unit 2
No ratings yet
Big Data Unit 2
19 pages
JNTUK R20 B.Tech CSE 3-2 Machine Learning Unit 3 Notes
No ratings yet
JNTUK R20 B.Tech CSE 3-2 Machine Learning Unit 3 Notes
21 pages
Bda Unit 5
No ratings yet
Bda Unit 5
14 pages
Module - 3 - ANALYSIS OF TIME SERIES
No ratings yet
Module - 3 - ANALYSIS OF TIME SERIES
21 pages
CA2-Question Bank MCQ (PEC-CSBS601D)
No ratings yet
CA2-Question Bank MCQ (PEC-CSBS601D)
9 pages
CS8392 - Oop - Unit - 3 - PPT - 3.1
No ratings yet
CS8392 - Oop - Unit - 3 - PPT - 3.1
17 pages
Accuracy Prediction Using Machine Learning Techniques For Patient Liver Disease
100% (1)
Accuracy Prediction Using Machine Learning Techniques For Patient Liver Disease
15 pages
Artificial Neural Networks - Short Answers
No ratings yet
Artificial Neural Networks - Short Answers
5 pages
Image Processing 7-FrequencyFiltering
No ratings yet
Image Processing 7-FrequencyFiltering
66 pages
Assignment Nptel
No ratings yet
Assignment Nptel
5 pages
Neural Representation of AND, OR, NOT, XOR and XNOR Logic Gates (Perceptron Algorithm)
No ratings yet
Neural Representation of AND, OR, NOT, XOR and XNOR Logic Gates (Perceptron Algorithm)
14 pages
Applied Machine Learning Question Paper
100% (1)
Applied Machine Learning Question Paper
2 pages
The University of The South Pacific: EE326 Embedded Systems
No ratings yet
The University of The South Pacific: EE326 Embedded Systems
2 pages
Lecture 13 State Minimization of Sequential Machines
No ratings yet
Lecture 13 State Minimization of Sequential Machines
42 pages
Method Overloading and Overidding
No ratings yet
Method Overloading and Overidding
6 pages
Anna University Maths Question Bank
No ratings yet
Anna University Maths Question Bank
56 pages
Solved - Mca-1-Sem-Discrete-Mathematics-Kca104-2023
No ratings yet
Solved - Mca-1-Sem-Discrete-Mathematics-Kca104-2023
19 pages
Pps Answer Key Final Paper Nmims
No ratings yet
Pps Answer Key Final Paper Nmims
13 pages
Mathematics of Asymmetric Cryptography
100% (1)
Mathematics of Asymmetric Cryptography
26 pages
Cs8602 Unit 4 Access To Nonlocal Data On The Stack
No ratings yet
Cs8602 Unit 4 Access To Nonlocal Data On The Stack
15 pages
Phase 1 Project Report
No ratings yet
Phase 1 Project Report
44 pages
Embedded Sample Paper
0% (3)
Embedded Sample Paper
16 pages
FDSA Unit-2
No ratings yet
FDSA Unit-2
41 pages
Booth's Multiplication Algorithm
No ratings yet
Booth's Multiplication Algorithm
27 pages
Classical Logic and Fuzzy Logic
No ratings yet
Classical Logic and Fuzzy Logic
27 pages
Register, Bus and Memory Transfer-5
No ratings yet
Register, Bus and Memory Transfer-5
17 pages
Branch and Bound
No ratings yet
Branch and Bound
30 pages
Lab Records With Solution
No ratings yet
Lab Records With Solution
31 pages
Instance Based Learning
100% (1)
Instance Based Learning
49 pages
Unit 1 - Data Mining - WWW - Rgpvnotes.in PDF
100% (1)
Unit 1 - Data Mining - WWW - Rgpvnotes.in PDF
13 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
12 pages
Bearinx® Calculation of Shaft Systems
No ratings yet
Bearinx® Calculation of Shaft Systems
16 pages
Tip 0402-19
No ratings yet
Tip 0402-19
22 pages
Draft Deed of Partnership of Universal Enterprise - Old
No ratings yet
Draft Deed of Partnership of Universal Enterprise - Old
42 pages
Bro The Beast
No ratings yet
Bro The Beast
4 pages
2019년 5월 종로 해설-2
No ratings yet
2019년 5월 종로 해설-2
6 pages
The Financial System
No ratings yet
The Financial System
10 pages
Baqai Medical University: Application Form M.Phil PH.D MS MD
No ratings yet
Baqai Medical University: Application Form M.Phil PH.D MS MD
6 pages
Aeroquip Thread Identification Guide
No ratings yet
Aeroquip Thread Identification Guide
12 pages
Deck Checked! Improve Your Clash Royale Deck
No ratings yet
Deck Checked! Improve Your Clash Royale Deck
1 page
Comparison: Micrometer Caliper
No ratings yet
Comparison: Micrometer Caliper
2 pages
User Manager - Mikrotik
100% (4)
User Manager - Mikrotik
76 pages
Tic Tac Toe Game
No ratings yet
Tic Tac Toe Game
27 pages
Difference Between IMF and World Bank
No ratings yet
Difference Between IMF and World Bank
4 pages
Employee Satisfaction Report
75% (4)
Employee Satisfaction Report
27 pages
AGRO-235 New Notes
No ratings yet
AGRO-235 New Notes
100 pages
DIPaio7000_3U_Installation_Manual_enUS_82493039627
No ratings yet
DIPaio7000_3U_Installation_Manual_enUS_82493039627
44 pages
Electronically Filed by Superior Court of CA, County of Santa Clara, On 7/9/2019 12:03 AM Reviewed By: F. Miller Case #19CV345966 Envelope: 3100639
No ratings yet
Electronically Filed by Superior Court of CA, County of Santa Clara, On 7/9/2019 12:03 AM Reviewed By: F. Miller Case #19CV345966 Envelope: 3100639
19 pages
Nego Finals Reviewer PDF
0% (1)
Nego Finals Reviewer PDF
70 pages
Ciencia e Ingeniería de Materiales. 7a Ed. Donald R. Askeland y Wendelin J. Wright by Cengage - Issuu
No ratings yet
Ciencia e Ingeniería de Materiales. 7a Ed. Donald R. Askeland y Wendelin J. Wright by Cengage - Issuu
5 pages
0 - ISOM2007 Getting Started 2024 Fall
No ratings yet
0 - ISOM2007 Getting Started 2024 Fall
34 pages
FTS-RECRUITMENT-EXAM-PAST-QUESTIONS-AND-ANSWERS
No ratings yet
FTS-RECRUITMENT-EXAM-PAST-QUESTIONS-AND-ANSWERS
8 pages
Cool Facts
No ratings yet
Cool Facts
36 pages
Econometrics II ReExam
No ratings yet
Econometrics II ReExam
8 pages
ID 2187539.1
No ratings yet
ID 2187539.1
1 page
NYU Poly Mechanical Faculty
No ratings yet
NYU Poly Mechanical Faculty
7 pages
Chapter 3 OE C Programming
No ratings yet
Chapter 3 OE C Programming
10 pages