Correlation Analysis Notes-2
Correlation Analysis Notes-2
Statistical Methods
Study Material – B.Com VI Semester
Prepared By:
SHANKARAIAH.P
Lecturer
Module: 1
CORRELATION ANALYSIS
Correlation between groups of data is a measure of how well they are related. The most
common measure of correlation in stats is the Pearson Correlation. The full name is
the Pearson Product Moment Correlation. It shows the linear relationship between two
groups of data. Two letters are used to represent the correlation: Greek letter rho (ρ) for a
population and the letter “r” for a sample. It is the most popular and widely used method to
calculate the correlation coefficient.
Karl Pearson, a reputed statistician, in 1890, had constructed a well set formula based
on mathematical treatment for determining the co-efficient of correlation. It is method of
calculating coefficient of correlation is based on covariance of the concerned variables.
Covariance is a statistical representation of the degree to which two variables vary together.
Definition:
“It is the study of degree of relationship between two or more variables”.
According to A.M. Tuttle, “Correlation is an analysis of the co-variation between two
or more variables.”
Importance of correlation:
1. The correlation coefficient helps in measuring the extent of relationship between two
variables in one figure.
2. Correlation analysis facilitates understanding of economic behaviour and helps in
locating the critically important variables on which others depend.
3. When two variables are correlated, then value of one variable can be estimated, given
the value of another.
4. Correlation facilitates the decision-making in the business world. It reduces the range
of uncertainty as predictions based on correlation are likely to be more reliable and closer to
reality.
5. The estimations made on the basis of correlation analysis are considered to be nearer
to reality and hence reliable.
Therefore, correlation analysis contributes to the understanding of economic behaviour,
and helps in locating the critically important variables on which others depend.
Types of Correlation:
1. Positive Correlation: When an increase in one variable results in an increase in another
variable or decrease in one variable results an decrease in another variable, then the
relationship is said to be positive. That is, when both the variables move in the same direction
is called positive correlation.
Examples: Relationship between height and weight, price and quantity supplied of a good etc.
2. Negative Correlation: When an increase in one variable results in a decrease in another
variable or a decrease in one variable results in increase in another variable, then the
relationship is said to be negative. That is, when both the variables move in the opposite
direction is called negative correlation.
Examples: Relationship between price and quantity demanded of a good, day temperature and
sale of woolen garments, etc.
3. Simple Correlation: When study involves relationship between only two variables is called
simple correlation.
Examples: study of relationship between the two variables such demand and price.
X
6. Non-Linear Correlation: When a constant ration of change in one variable will not results
in a constant ratio of change in another variable is said to be non-linear correlation. It is also
called as Curvi-Line Correlation as plotted values of pair observations on a graph gives a
curved line.
Example:
X Variable 10 20 30 40 50
Y Variable 12 18 22 8 5
X
Assumptions:
1. Linear Relationship: In this method, it is assumed that there is a linear relationship
between the two variables.
2. Normal Distribution: It assumes the normal distribution of sample population.
3. Cause and Effect: It assumes the existence between the cause and effect relationship
between the variables.
Properties of Co-efficient of Correlation:
1. It is a relative measure correlation. Since, calculated statistical value stated in
proportion.
2. It is independent of unit of measurement.
3. It is independent of change of origin and change of scale.
r = bxy X byx.
7. Co-efficient of correlation works both ways: rxy and rys
Formulas:
I. Individual Observations:
1. Direct Method: When deviations are taken from Actual Mean (X – X )
∑ 𝐝𝐱𝐝𝐲
𝐫=
√∑ 𝐝𝐱 𝟐 𝐗 ∑ 𝐝𝐲 𝟐