SlideShare ist ein Scribd-Unternehmen logo
1 von 53
Downloaden Sie, um offline zu lesen
CORRELATION &
REGRESSION
BirinderSingh,AssistantProfessor,PCTE
CORRELATION
 Correlation is a statistical tool that helps to
measure and analyze the degree of relationship
between two variables.
 Correlation analysis deals with the association
between two or more variables.
BirinderSingh,AssistantProfessor,PCTE
CORRELATION
 The degree of relationship between the variables
under consideration is measure through the
correlation analysis.
 The measure of correlation called the correlation
coefficient .
 The degree of relationship is expressed by
coefficient which range from correlation
( -1 ≤ r ≥ +1)
 The direction of change is indicated by a sign.
 The correlation analysis enable us to have an
idea about the degree & direction of the
relationship between the two variables under
study.
BirinderSingh,AssistantProfessor,PCTE
Correlation
Positive Correlation Negative Correlation
TYPES OF CORRELATION - TYPE I
TYPES OF CORRELATION TYPE I
 Positive Correlation: The correlation is said to be
positive correlation if the values of two variables
changing with same direction.
Ex. Pub. Exp. & Sales, Height & Weight.
 Negative Correlation: The correlation is said to be
negative correlation when the values of variables change
with opposite direction.
Ex. Price & Quantity demanded.
DIRECTION OF THE CORRELATION
 Positive relationship – Variables change in the
same direction.
 As X is increasing, Y is increasing
 As X is decreasing, Y is decreasing
 E.g., As height increases, so does weight.
 Negative relationship – Variables change in
opposite directions.
 As X is increasing, Y is decreasing
 As X is decreasing, Y is increasing
 E.g., As TV time increases, grades decrease
Indicated by
sign; (+) or (-).
EXAMPLES
BirinderSingh,AssistantProfessor,PCTE
 Water consumption
and temperature.
 Study time and
grades.
 Alcohol consumption
and driving ability.
 Price & quantity
demanded
Positive Correlation Negative Correlation
Correlation
Simple Multiple
Partial Total
TYPES OF CORRELATION TYPE II
TYPES OF CORRELATION TYPE II
 Simple correlation: Under simple correlation
problem there are only two variables are studied.
 Multiple Correlation: Under Multiple
Correlation three or more than three variables
are studied. Ex. Qd = f ( P,PC, PS, t, y )
 Partial correlation: analysis recognizes more
than two variables but considers only two
variables keeping the other constant.
 Total correlation: is based on all the relevant
variables, which is normally not feasible.
Types of Correlation
Type III
Correlation
LINEAR NON LINEAR
TYPES OF CORRELATION TYPE
III
 Linear correlation: Correlation is said to be
linear when the amount of change in one
variable tends to bear a constant ratio to the
amount of change in the other. The graph of the
variables having a linear relationship will form
a straight line.
Ex X = 1, 2, 3, 4, 5, 6, 7, 8,
Y = 5, 7, 9, 11, 13, 15, 17, 19,
Y = 3 + 2x
 Non Linear correlation: The correlation
would be non linear if the amount of change in
one variable does not bear a constant ratio to
the amount of change in the other variable.
CORRELATION & CAUSATION
 Causation means cause & effect relation.
 Correlation denotes the interdependency among the
variables for correlating two phenomenon, it is
essential that the two phenomenon should have
cause-effect relationship,& if such relationship does
not exist then the two phenomenon can not be
correlated.
 If two variables vary in such a way that movement
in one are accompanied by movement in other, these
variables are called cause and effect relationship.
 Causation always implies correlation but correlation
does not necessarily implies causation.
DEGREE OF CORRELATION
 Perfect Correlation
 High Degree of Correlation
 Moderate Degree of Correlation
 Low Degree of Correlation
 No Correlation
BirinderSingh,AssistantProfessor,PCTE
METHODS OF STUDYING CORRELATION
Methods
Graphic
Methods
Scatter
Diagram
Correlation
Graph
Algebraic
Methods
Karl
Pearson’s
Coefficient
Rank
Correlation
Concurrent
Deviation
BirinderSingh,AssistantProfessor,PCTE
SCATTER DIAGRAM METHOD
 Scatter Diagram is a graph of
observed plotted points where each
points represents the values of X & Y
as a coordinate.
 It portrays the relationship between
these two variables graphically.
A PERFECT POSITIVE
CORRELATION
Height
Weight
Height
of A
Weight
of A
Height
of B
Weight
of B
A linear
relationship
HIGH DEGREE OF POSITIVE
CORRELATION
 Positive relationship
Height
Weight
r = +.80
DEGREE OF CORRELATION
 Moderate Positive Correlation
Weight
Shoe
Size
r = + 0.4
DEGREE OF CORRELATION
 Perfect Negative Correlation
Exam score
TV
watching
per
week
r = -1.0
DEGREE OF CORRELATION
 Moderate Negative Correlation
Exam score
TV
watching
per
week
r = -.80
DEGREE OF CORRELATION
 Weak negative Correlation
Weight
Shoe
Size r = - 0.2
DEGREE OF CORRELATION
 No Correlation (horizontal line)
Height
IQ
r = 0.0
DEGREE OF CORRELATION (R)
r = +.80 r = +.60
r = +.40 r = +.20
DIRECTION OF THE RELATIONSHIP
 Positive relationship – Variables change in the same
direction.
 As X is increasing, Y is increasing
 As X is decreasing, Y is decreasing
 E.g., As height increases, so does weight.
 Negative relationship – Variables change in opposite
directions.
 As X is increasing, Y is decreasing
 As X is decreasing, Y is increasing
 E.g., As TV time increases, grades decrease
Indicated by
sign; (+) or (-).
ADVANTAGES OF SCATTER DIAGRAM
Simple & Non Mathematical method
Not influenced by the size of extreme
item
First step in investing the relationship
between two variables
DISADVANTAGE OF SCATTER DIAGRAM
Can not adopt the an exact
degree of correlation
CORRELATION GRAPH
0
50
100
150
200
250
300
2012 2013 2014 2015 2016 2017
Consumption
Production
BirinderSingh,AssistantProfessor,PCTE
KARL PEARSON’S COEFFICIENT OF
CORRELATION
 It is quantitative method of measuring
correlation
 This method has been given by Karl Pearson
 It’s the best method
BirinderSingh,AssistantProfessor,PCTE
CALCULATION OF COEFFICIENT OF
CORRELATION – ACTUAL MEAN METHOD
 Formula used is:
 r =
Σ𝑥𝑦
Σ𝑥2 . Σ𝑦2
where x = X – 𝑋 ; y = Y – 𝑌
Q1: Find Karl Pearson’s coefficient of correlation:
Ans: 0.96
Q2: Find Karl Pearson’s coefficient of correlation:
Summation of product of deviations of X & Y series from their respective
arithmetic means = 122 Ans: 0.89
BirinderSingh,AssistantProfessor,PCTE
X 2 3 4 5 6 7 8
Y 4 7 8 9 10 14 18
X- Series Y-series
No. of items 15 15
AM 25 18
Squares of deviations from mean 136 138
PRACTICE PROBLEMS - CORRELATION
Q3: Find Karl Pearson’s coefficient of correlation:
Arithmetic Means of X & Y are 6 & 8 respectively. Ans: – 0.92
Q4: Find the number of items as per the given data:
r = 0.5, Ʃxy = 120, σy = 8, Ʃx2 = 90
where x & y are deviations from arithmetic means
Ans: 10
Q5: Find r:
ƩX = 250, ƩY = 300, Ʃ(X – 25)2 = 480, Ʃ(Y – 30)2 = 600
Ʃ(X – 25)(Y – 30) = 150 , N = 10 Ans: 0.28
BirinderSingh,AssistantProfessor,PCTE
X 6 2 10 4 8
Y 9 11 ? 8 7
CALCULATION OF COEFFICIENT OF
CORRELATION – ASSUMED MEAN METHOD
 Formula used is:
 r =
𝑁 .Σ𝑑𝑥𝑑𝑦 − Σ𝑑𝑥.Σ𝑑𝑦
𝑁.Σ𝑑𝑥2 −(Σ𝑑𝑥)2 𝑁.Σ𝑑𝑦2 −(Σ𝑑𝑦)2
Q6:Find r:
Ans: 0.98
Q7: Find r, when deviations of two series from assumed mean
are as follows: Ans: 0.895
BirinderSingh,AssistantProfessor,PCTE
X 10 12 18 16 15 19 18 17
Y 30 35 45 44 42 48 47 46
Dx +5 -4 -2 +20 -10 0 +3 0 -15 -5
Dy +5 -12 -7 +25 -10 -3 0 +2 -9 -15
CALCULATION OF COEFFICIENT OF
CORRELATION – ACTUAL DATA METHOD
 Formula used is:
 r =
𝑁.Σ𝑋𝑌 − Σ𝑋.Σ𝑌
𝑁.Σ𝑋2 −(Σ𝑋)2 𝑁.Σ𝑌2 −(Σ𝑌)2
Q8:Find r:
Ans: 0.98
Q9: Calculate product moment correlation coefficient from the
following data: Ans: 0.996
BirinderSingh,AssistantProfessor,PCTE
X 10 12 18 16 15 19 18 17
Y 30 35 45 44 42 48 47 46
X -5 -10 -15 -20 -25 -30
Y 50 40 30 20 10 5
IMPORTANT TYPICAL PROBLEMS
Q10: Calculate the coefficient of correlation from the following
data and interpret the result: Ans: 0.76
N = 10, ƩXY = 8425, 𝑋 = 28.5, 𝑌 = 28.0, 𝜎𝑥 = 10.5, 𝜎𝑦 = 5.6
Q11: Following results were obtained from an analysis:
N = 12, ƩXY = 334, ƩX = 30, ƩY = 5, ƩX2 = 670, ƩY2 = 285
Later on it was discovered that one pair of values (X = 11, Y = 4) were
wrongly copied. The correct value of the pair was (X = 10, Y = 14).
Find the correct value of correlation coefficient. Ans: 0.774
BirinderSingh,AssistantProfessor,PCTE
VARIANCE – COVARIANCE METHOD
 This method of determining correlation coefficient is based on
covariance.
 r =
𝐶𝑜𝑣 (𝑋,𝑌)
𝑉𝑎𝑟 𝑋 𝑉𝑎𝑟 (𝑌)
=
𝐶𝑜𝑣 (𝑋,𝑌)
σ 𝑥
.σ 𝑦
where Cov X, Y =
Σ𝑥𝑦
𝑁
=
Σ(𝑋−𝑋)(𝑌−𝑌)
𝑁
=
Σ𝑋𝑌
𝑁
− 𝑋 𝑌
 Another Way of calculating r =
Σ𝑥𝑦
𝑁. σ 𝑥
.σ 𝑦
.
Q12: For two series X & Y, Cov(X,Y) = 15, Var(X)=36, Var (Y)=25.
Find r. Ans: 0.5
Q13: Find r when N = 30, 𝑋 = 40, 𝑌 = 50, 𝜎 𝑥 = 6, 𝜎 𝑦 = 7, Σ𝑥𝑦 = 360
Ans: 0.286
Q14: For two series X & Y, Cov(X,Y) = 25, Var(X)=36, r = 0.6.
Find 𝜎 𝑦. Ans: 6.94
BirinderSingh,AssistantProfessor,PCTE
CALCULATION OF CORRELATION
COEFFICIENT – GROUPED DATA
 Formula used is:
 r =
𝑁 .Σ𝑓𝑑𝑥𝑑𝑦 − Σ𝑓𝑑𝑥.Σ𝑓𝑑𝑦
𝑁.Σ𝑓𝑑𝑥2 −(Σ𝑓𝑑𝑥)2 𝑁.Σ𝑓𝑑𝑦2 −(Σ𝑓𝑑𝑦)2
Q15: Calculate Karl Pearson’s coefficient of correlation:
Ans: 0.33
BirinderSingh,AssistantProfessor,PCTE
X / Y 10-25 25-40 40-55
0-20 10 4 6
20-40 5 40 9
40-60 3 8 15
BirinderSingh,AssistantProfessor,PCTE
PROPERTIES OF COEFFICIENT OF
CORRELATION
 Karl Pearson’s coefficient of correlation lies between -
1 & 1, i.e. – 1 ≤ r ≤ +1
 If the scale of a series is changed or the origin is
shifted, there is no effect on the value of ‘r’.
 ‘r’ is the geometric mean of the regression coefficients
byx & bxy, i.e. r = 𝑏 𝑥𝑦 . 𝑏𝑦𝑥
 If X & Y are independent variables, then coefficient of
correlation is zero but the converse is not necessarily
true.
 ‘r’ is a pure number and is independent of the units of
measurement.
 The coefficient of correlation between the two
variables x & y is symmetric. i.e. ryx = rxy
BirinderSingh,AssistantProfessor,PCTE
PROBABLE ERROR & STANDARD ERROR
 Probable Error is used to test the reliability of Karl
Pearson’s correlation coefficient.
 Probable Error (P.E.) = 0.6745 x
1 − 𝑟2
𝑁
 Probable Error is used to interpret the value of the
correlation coefficient as per the following:
 If 𝑟 > 6 P.E., then ‘r’ is significant.
 If 𝑟 < 6 P.E., then ‘r’ is insignificant. It means that there
is no evidence of the existence of correlation in both the
series.
 Probable Error also determines the upper & lower
limits within which the correlation of randomly
selected sample from the same universe will fall.
 Upper Limit = r + P.E.
 Lowe Limit = r – P.E.
BirinderSingh,AssistantProfessor,PCTE
PRACTICE PROBLEM – PROBABLE ERROR
Q16: Find Karl Pearson’s coefficient of correlation
from the following data:
Also calculate probable error and check whether it
is significant or not. Ans: – 0.94, 0.032
Q17: A student calculates the value of r as 0.7
when N = 5. He concludes that r is highly
significant. Comment. Ans: Insignificant
BirinderSingh,AssistantProfessor,PCTE
X 9 28 45 60 70 50
Y 100 60 50 40 33 57
SPEARMAN’S RANK CORRELATION
METHOD
 Given by Prof. Spearman in 1904
 By this method, correlation between qualitative
aspects like intelligence, honesty, beauty etc. can be
calculated.
 These variables can be assigned ranks but their
quantitative measurement is not possible.
 It is denoted by R = 1 –
𝟔 𝜮𝑫 𝟐
𝑵 (𝑵 𝟐 −𝟏)
 R = Rank correlation coefficient
 D = Difference between two ranks (R1 – R2)
 N = Number of pair of observations
 As in case of r, – 1 ≤ R ≤ 1
 The sum total of Rank Difference is always equal to
zero. i.e. ƩD = 0.
BirinderSingh,AssistantProfessor,PCTE
THREE CASES
Spearman’s
Rank
Correlation
Method
When ranks are
given
When ranks are
not given
When equal or
tied ranks exist
BirinderSingh,AssistantProfessor,PCTE
PRACTICE PROBLEMS – RANK
CORRELATION (WHEN RANKS ARE GIVEN)
Q18: In a fancy dress competition, two judges accorded the
following ranks to eight participants:
Calculate the coefficient of rank correlation. Ans: .62
Q19: Ten competitors in a beauty contest are ranked by three
judges X, Y, Z:
Use the rank correlation coefficient to determine which pair of
judges has the nearest approach to common tastes in beauty.
Ans: X & Z
BirinderSingh,AssistantProfessor,PCTE
Judge X 8 7 6 3 2 1 5 4
Judge Y 7 5 4 1 3 2 6 8
X 1 6 5 10 3 2 4 9 7 8
Y 3 5 8 4 7 10 2 1 6 9
Z 6 4 9 8 1 2 3 10 5 7
BirinderSingh,AssistantProfessor,PCTE
PRACTICE PROBLEMS – RANK CORRELATION
(WHEN RANKS ARE NOT GIVEN)
Q20: Find out the coefficient of Rank Correlation
between X & Y:
Ans: 0.48
BirinderSingh,AssistantProfessor,PCTE
X 15 17 14 13 11 12 16 18 10 9
Y 18 12 4 6 7 9 3 10 2 5
PRACTICE PROBLEMS – RANK CORRELATION
(WHEN RANKS ARE EQUAL OR TIED)
 When two or more items have equal values in a
series, so common ranks i.e. average of the ranks
are assigned to equal values.
 Here R = 1 –
𝟔 𝜮𝑫 𝟐+
𝒎 𝟑 −𝒎
𝟏𝟐
+
𝒎 𝟑 −𝒎
𝟏𝟐
+ …………..
𝑵 (𝑵 𝟐 −𝟏)
 m = No. of items of equal ranks
 The correction factor of
𝒎 𝟑 −𝒎
𝟏𝟐
is added to 𝜮𝑫 𝟐
for such
number of times as the cases of equal ranks in the
question
BirinderSingh,AssistantProfessor,PCTE
PRACTICE PROBLEMS – RANK CORRELATION
(WHEN RANKS ARE EQUAL OR TIED)
Q21: Calculate R:
Ans: – 0.37
Q22: Calculate Rank Correlation:
Ans: 0.43
BirinderSingh,AssistantProfessor,PCTE
X 15 10 20 28 12 10 16 18
Y 16 14 10 12 11 15 18 10
X 40 50 60 60 80 50 70 60
Y 80 120 160 170 130 200 210 130
IMPORTANT TYPICAL PROBLEMS –
RANK CORRELATION
Q23: Calculate Rank Correlation from the following data:
Ans: 0.64
Q24: The coefficient of rank correlation of marks obtained by 10
students in English & Math was found to be 0.5. It was later
discovered that the difference in the ranks in two subjects was
wrongly taken as 3 instead of 7. Find the correct rank correlation.
Ans: 0.26
Q25: The rank correlation coefficient between marks obtained by
some students in English & Math is found to be 0.8. If the total of
squares of rank differences is 33, find the number of students.
Ans: 10
BirinderSingh,AssistantProfessor,PCTE
Serial No. 1 2 3 4 5 6 7 8 9 10
Rank
Difference
-2 ? -1 +3 +2 0 -4 +3 +3 -2
BirinderSingh,AssistantProfessor,PCTE
CONCURRENT DEVIATION METHOD
 Correlation is determined on the basis of direction of the
deviations.
 Under this method, the direction of deviations are assigned
(+) or (-) or (0) signs.
 If the value is more than its preceding value, then its deviation
is assigned (+) sign.
 If the value is less than its preceding value, then its deviation
is assigned (-) sign.
 If the value is equal to its preceding value, then its deviation is
assigned (0) sign.
 The deviations dx & dy are multiplied to get dxdy. Product of
similar signs will be (+) and for opposite signs will be (-).
 Summing the positive dxdy signs, their number is counted. It is
called CONCURRENT DEVIATIONS. It is denoted by C.
 Formula used: rc = ± ±
𝟐𝑪 −𝒏
𝒏
where rc = Correlation of
CD, C = No. of Concurrent Deviations, n = N – 1.
BirinderSingh,AssistantProfessor,PCTE
PRACTICE PROBLEMS – COEFFICIENT OF
CONCURRENT DEVIATIONS
Q26: Find the Coefficient of Concurrent Deviation from
the following data:
Ans: – 1
Q27: Find the Coefficient of Concurrent Deviation from
the following data:
Ans: – 0.75
BirinderSingh,AssistantProfessor,PCTE
Year 2001 2002 2003 2004 2005 2006 2007
Demand 150 154 160 172 160 165 180
Price 200 180 170 160 190 180 172
X 112 125 126 118 118 121 125 125 131 135
Y 106 102 102 104 98 96 97 97 95 90
COEFFICIENT OF DETERMINATION (COD)
 CoD is used for the interpretation of coefficient of correlation and
comparing the two or more correlation coefficients.
 It is the square of the coefficient of correlation i.e. r2.
 It explains the percentage variation in the dependent variable Y
that can be explained in terms of the independent variable X.
 If r = 0.8, r2 = 0.64, it implies that 64% of the total variations in Y
occurs due to X. The remaining 34% variation occurs due to
external factors.
 So, CoD = r2 =
𝐸𝑥𝑝𝑙𝑎𝑖𝑛𝑒𝑑 𝑉𝑎𝑟𝑖𝑎𝑛𝑐𝑒
𝑇𝑜𝑡𝑎𝑙 𝑉𝑎𝑟𝑖𝑎𝑛𝑐𝑒
 Coefficient of Non Determination= K2 = 1 – r2 =
𝑈𝑛𝑒𝑥𝑝𝑙𝑎𝑖𝑛𝑒𝑑 𝑉𝑎𝑟𝑖𝑎𝑛𝑐𝑒
𝑇𝑜𝑡𝑎𝑙 𝑉𝑎𝑟𝑖𝑎𝑛𝑐𝑒
 Coefficient of Alienation = 1 – r2
BirinderSingh,AssistantProfessor,PCTE
PRACTICE PROBLEMS – COD
Q28: The coefficient of correlation between
consumption expenditure (C) and disposable
income (Y) in a study was found to be +0.8. What
percentage of variation in C are explained by
variation in Y? Ans: 64%
BirinderSingh,AssistantProfessor,PCTE
CLASS TEST
Q1: In a fancy dress competition, two judges accorded the
following ranks to eight participants:
Calculate the coefficient of rank correlation.
Q2: Following results were obtained from an analysis:
N = 12, ƩXY = 334, ƩX = 30, ƩY = 5, ƩX2 = 670, ƩY2 = 285
Later on it was discovered that one pair of values (X = 11, Y = 4) were
wrongly copied. The correct value of the pair was (X = 10, Y = 14).
Find the correct value of correlation coefficient.
BirinderSingh,AssistantProfessor,PCTE
Judge X 8 7 6 3 2 1 5 4
Judge Y 7 5 4 1 3 2 6 8

Weitere ähnliche Inhalte

Was ist angesagt? (20)

Regression analysis.
Regression analysis.Regression analysis.
Regression analysis.
 
T test statistics
T test statisticsT test statistics
T test statistics
 
correlation and regression
correlation and regressioncorrelation and regression
correlation and regression
 
Standard deviation
Standard deviationStandard deviation
Standard deviation
 
T test
T testT test
T test
 
Karl pearson's correlation
Karl pearson's correlationKarl pearson's correlation
Karl pearson's correlation
 
Chi -square test
Chi -square testChi -square test
Chi -square test
 
Correlation analysis
Correlation analysisCorrelation analysis
Correlation analysis
 
Regression
Regression Regression
Regression
 
Correlation and partial correlation
Correlation and partial correlationCorrelation and partial correlation
Correlation and partial correlation
 
Multiple Correlation - Thiyagu
Multiple Correlation - ThiyaguMultiple Correlation - Thiyagu
Multiple Correlation - Thiyagu
 
Chi squared test
Chi squared testChi squared test
Chi squared test
 
Measures of Dispersion
Measures of DispersionMeasures of Dispersion
Measures of Dispersion
 
Regression analysis
Regression analysisRegression analysis
Regression analysis
 
Multiple regression presentation
Multiple regression presentationMultiple regression presentation
Multiple regression presentation
 
One way anova final ppt.
One way anova final ppt.One way anova final ppt.
One way anova final ppt.
 
Standard error
Standard error Standard error
Standard error
 
coefficient correlation
 coefficient correlation coefficient correlation
coefficient correlation
 
Introduction to correlation and regression analysis
Introduction to correlation and regression analysisIntroduction to correlation and regression analysis
Introduction to correlation and regression analysis
 
Probability Theory
Probability TheoryProbability Theory
Probability Theory
 

Ähnlich wie Correlation & Regression Analysis Guide

Pearson's correlation coefficient 
Pearson's correlation coefficient Pearson's correlation coefficient 
Pearson's correlation coefficient Topu Kawser
 
Pearson's correlation
Pearson's  correlationPearson's  correlation
Pearson's correlationTRIPTI DUBEY
 
Pearson's correlation
Pearson's  correlationPearson's  correlation
Pearson's correlationTRIPTI DUBEY
 
Correlation analysis in Biostatistics .pptx
Correlation analysis in Biostatistics .pptxCorrelation analysis in Biostatistics .pptx
Correlation analysis in Biostatistics .pptxHamdiMichaelCC
 
correlationcoefficient-20090414 0531.pdf
correlationcoefficient-20090414 0531.pdfcorrelationcoefficient-20090414 0531.pdf
correlationcoefficient-20090414 0531.pdfDrAmanSaxena
 
Simple Correlation : Karl Pearson’s Correlation co- efficient and Spearman’s ...
Simple Correlation : Karl Pearson’s Correlation co- efficient and Spearman’s ...Simple Correlation : Karl Pearson’s Correlation co- efficient and Spearman’s ...
Simple Correlation : Karl Pearson’s Correlation co- efficient and Spearman’s ...RekhaChoudhary24
 
Unit 1 Correlation- BSRM.pdf
Unit 1 Correlation- BSRM.pdfUnit 1 Correlation- BSRM.pdf
Unit 1 Correlation- BSRM.pdfRavinandan A P
 
correlation.final.ppt (1).pptx
correlation.final.ppt (1).pptxcorrelation.final.ppt (1).pptx
correlation.final.ppt (1).pptxChieWoo1
 
Module 2_ Regression Models..pptx
Module 2_ Regression Models..pptxModule 2_ Regression Models..pptx
Module 2_ Regression Models..pptxnikshaikh786
 
Pearson product moment correlation
Pearson product moment correlationPearson product moment correlation
Pearson product moment correlationSharlaine Ruth
 

Ähnlich wie Correlation & Regression Analysis Guide (20)

Correlation.pdf
Correlation.pdfCorrelation.pdf
Correlation.pdf
 
correlation.ppt
correlation.pptcorrelation.ppt
correlation.ppt
 
12943625.ppt
12943625.ppt12943625.ppt
12943625.ppt
 
Pearson's correlation coefficient 
Pearson's correlation coefficient Pearson's correlation coefficient 
Pearson's correlation coefficient 
 
Correlation and Regression
Correlation and Regression Correlation and Regression
Correlation and Regression
 
Correlation
CorrelationCorrelation
Correlation
 
Pearson's correlation
Pearson's  correlationPearson's  correlation
Pearson's correlation
 
Pearson's correlation
Pearson's  correlationPearson's  correlation
Pearson's correlation
 
correlation.pptx
correlation.pptxcorrelation.pptx
correlation.pptx
 
Correlation analysis in Biostatistics .pptx
Correlation analysis in Biostatistics .pptxCorrelation analysis in Biostatistics .pptx
Correlation analysis in Biostatistics .pptx
 
Correlation continued
Correlation continuedCorrelation continued
Correlation continued
 
correlationcoefficient-20090414 0531.pdf
correlationcoefficient-20090414 0531.pdfcorrelationcoefficient-20090414 0531.pdf
correlationcoefficient-20090414 0531.pdf
 
Simple Correlation : Karl Pearson’s Correlation co- efficient and Spearman’s ...
Simple Correlation : Karl Pearson’s Correlation co- efficient and Spearman’s ...Simple Correlation : Karl Pearson’s Correlation co- efficient and Spearman’s ...
Simple Correlation : Karl Pearson’s Correlation co- efficient and Spearman’s ...
 
UNIT 4.pptx
UNIT 4.pptxUNIT 4.pptx
UNIT 4.pptx
 
Unit 1 Correlation- BSRM.pdf
Unit 1 Correlation- BSRM.pdfUnit 1 Correlation- BSRM.pdf
Unit 1 Correlation- BSRM.pdf
 
Correlation
CorrelationCorrelation
Correlation
 
correlation.final.ppt (1).pptx
correlation.final.ppt (1).pptxcorrelation.final.ppt (1).pptx
correlation.final.ppt (1).pptx
 
Statistical technique exercise 23 and 24 correlational study
Statistical technique exercise 23 and 24 correlational studyStatistical technique exercise 23 and 24 correlational study
Statistical technique exercise 23 and 24 correlational study
 
Module 2_ Regression Models..pptx
Module 2_ Regression Models..pptxModule 2_ Regression Models..pptx
Module 2_ Regression Models..pptx
 
Pearson product moment correlation
Pearson product moment correlationPearson product moment correlation
Pearson product moment correlation
 

Mehr von Birinder Singh Gulati (12)

Probability Distributions
Probability DistributionsProbability Distributions
Probability Distributions
 
Chi Square & Anova
Chi Square & AnovaChi Square & Anova
Chi Square & Anova
 
Hypothesis Testing
Hypothesis TestingHypothesis Testing
Hypothesis Testing
 
Sampling & Sampling Distribtutions
Sampling & Sampling DistribtutionsSampling & Sampling Distribtutions
Sampling & Sampling Distribtutions
 
Introduction to Quantitative Techniques
Introduction to Quantitative TechniquesIntroduction to Quantitative Techniques
Introduction to Quantitative Techniques
 
Probability Distributions
Probability DistributionsProbability Distributions
Probability Distributions
 
Probability - I
Probability - IProbability - I
Probability - I
 
Central Tendency & Dispersion
Central Tendency & DispersionCentral Tendency & Dispersion
Central Tendency & Dispersion
 
Matrices & Determinants
Matrices & DeterminantsMatrices & Determinants
Matrices & Determinants
 
Set Theory
Set TheorySet Theory
Set Theory
 
Time Series - 1
Time Series - 1Time Series - 1
Time Series - 1
 
Logarithms
LogarithmsLogarithms
Logarithms
 

Kürzlich hochgeladen

Measures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped dataMeasures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped dataBabyAnnMotar
 
Oppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmOppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmStan Meyer
 
Mental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young mindsMental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young mindsPooky Knightsmith
 
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDecoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDhatriParmar
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptxmary850239
 
ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxVanesaIglesias10
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptxmary850239
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)lakshayb543
 
Textual Evidence in Reading and Writing of SHS
Textual Evidence in Reading and Writing of SHSTextual Evidence in Reading and Writing of SHS
Textual Evidence in Reading and Writing of SHSMae Pangan
 
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptx
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptxDIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptx
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptxMichelleTuguinay1
 
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptxBIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptxSayali Powar
 
MS4 level being good citizen -imperative- (1) (1).pdf
MS4 level   being good citizen -imperative- (1) (1).pdfMS4 level   being good citizen -imperative- (1) (1).pdf
MS4 level being good citizen -imperative- (1) (1).pdfMr Bounab Samir
 
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITWQ-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITWQuiz Club NITW
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfVanessa Camilleri
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management systemChristalin Nelson
 
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...DhatriParmar
 
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxQ4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxlancelewisportillo
 
Mythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWMythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWQuiz Club NITW
 

Kürzlich hochgeladen (20)

Measures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped dataMeasures of Position DECILES for ungrouped data
Measures of Position DECILES for ungrouped data
 
Oppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and FilmOppenheimer Film Discussion for Philosophy and Film
Oppenheimer Film Discussion for Philosophy and Film
 
Mental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young mindsMental Health Awareness - a toolkit for supporting young minds
Mental Health Awareness - a toolkit for supporting young minds
 
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptxDecoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
Decoding the Tweet _ Practical Criticism in the Age of Hashtag.pptx
 
4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx4.16.24 Poverty and Precarity--Desmond.pptx
4.16.24 Poverty and Precarity--Desmond.pptx
 
ROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptxROLES IN A STAGE PRODUCTION in arts.pptx
ROLES IN A STAGE PRODUCTION in arts.pptx
 
4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx4.16.24 21st Century Movements for Black Lives.pptx
4.16.24 21st Century Movements for Black Lives.pptx
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
 
Textual Evidence in Reading and Writing of SHS
Textual Evidence in Reading and Writing of SHSTextual Evidence in Reading and Writing of SHS
Textual Evidence in Reading and Writing of SHS
 
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptx
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptxDIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptx
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptx
 
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptxBIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
BIOCHEMISTRY-CARBOHYDRATE METABOLISM CHAPTER 2.pptx
 
MS4 level being good citizen -imperative- (1) (1).pdf
MS4 level   being good citizen -imperative- (1) (1).pdfMS4 level   being good citizen -imperative- (1) (1).pdf
MS4 level being good citizen -imperative- (1) (1).pdf
 
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITWQ-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
Q-Factor HISPOL Quiz-6th April 2024, Quiz Club NITW
 
ICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdfICS2208 Lecture6 Notes for SL spaces.pdf
ICS2208 Lecture6 Notes for SL spaces.pdf
 
Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"Mattingly "AI & Prompt Design: Large Language Models"
Mattingly "AI & Prompt Design: Large Language Models"
 
prashanth updated resume 2024 for Teaching Profession
prashanth updated resume 2024 for Teaching Professionprashanth updated resume 2024 for Teaching Profession
prashanth updated resume 2024 for Teaching Profession
 
Concurrency Control in Database Management system
Concurrency Control in Database Management systemConcurrency Control in Database Management system
Concurrency Control in Database Management system
 
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
Beauty Amidst the Bytes_ Unearthing Unexpected Advantages of the Digital Wast...
 
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxQ4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
 
Mythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWMythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITW
 

Correlation & Regression Analysis Guide

  • 2. CORRELATION  Correlation is a statistical tool that helps to measure and analyze the degree of relationship between two variables.  Correlation analysis deals with the association between two or more variables. BirinderSingh,AssistantProfessor,PCTE
  • 3. CORRELATION  The degree of relationship between the variables under consideration is measure through the correlation analysis.  The measure of correlation called the correlation coefficient .  The degree of relationship is expressed by coefficient which range from correlation ( -1 ≤ r ≥ +1)  The direction of change is indicated by a sign.  The correlation analysis enable us to have an idea about the degree & direction of the relationship between the two variables under study. BirinderSingh,AssistantProfessor,PCTE
  • 4. Correlation Positive Correlation Negative Correlation TYPES OF CORRELATION - TYPE I
  • 5. TYPES OF CORRELATION TYPE I  Positive Correlation: The correlation is said to be positive correlation if the values of two variables changing with same direction. Ex. Pub. Exp. & Sales, Height & Weight.  Negative Correlation: The correlation is said to be negative correlation when the values of variables change with opposite direction. Ex. Price & Quantity demanded.
  • 6. DIRECTION OF THE CORRELATION  Positive relationship – Variables change in the same direction.  As X is increasing, Y is increasing  As X is decreasing, Y is decreasing  E.g., As height increases, so does weight.  Negative relationship – Variables change in opposite directions.  As X is increasing, Y is decreasing  As X is decreasing, Y is increasing  E.g., As TV time increases, grades decrease Indicated by sign; (+) or (-).
  • 7. EXAMPLES BirinderSingh,AssistantProfessor,PCTE  Water consumption and temperature.  Study time and grades.  Alcohol consumption and driving ability.  Price & quantity demanded Positive Correlation Negative Correlation
  • 9. TYPES OF CORRELATION TYPE II  Simple correlation: Under simple correlation problem there are only two variables are studied.  Multiple Correlation: Under Multiple Correlation three or more than three variables are studied. Ex. Qd = f ( P,PC, PS, t, y )  Partial correlation: analysis recognizes more than two variables but considers only two variables keeping the other constant.  Total correlation: is based on all the relevant variables, which is normally not feasible.
  • 10. Types of Correlation Type III Correlation LINEAR NON LINEAR
  • 11. TYPES OF CORRELATION TYPE III  Linear correlation: Correlation is said to be linear when the amount of change in one variable tends to bear a constant ratio to the amount of change in the other. The graph of the variables having a linear relationship will form a straight line. Ex X = 1, 2, 3, 4, 5, 6, 7, 8, Y = 5, 7, 9, 11, 13, 15, 17, 19, Y = 3 + 2x  Non Linear correlation: The correlation would be non linear if the amount of change in one variable does not bear a constant ratio to the amount of change in the other variable.
  • 12. CORRELATION & CAUSATION  Causation means cause & effect relation.  Correlation denotes the interdependency among the variables for correlating two phenomenon, it is essential that the two phenomenon should have cause-effect relationship,& if such relationship does not exist then the two phenomenon can not be correlated.  If two variables vary in such a way that movement in one are accompanied by movement in other, these variables are called cause and effect relationship.  Causation always implies correlation but correlation does not necessarily implies causation.
  • 13. DEGREE OF CORRELATION  Perfect Correlation  High Degree of Correlation  Moderate Degree of Correlation  Low Degree of Correlation  No Correlation BirinderSingh,AssistantProfessor,PCTE
  • 14. METHODS OF STUDYING CORRELATION Methods Graphic Methods Scatter Diagram Correlation Graph Algebraic Methods Karl Pearson’s Coefficient Rank Correlation Concurrent Deviation BirinderSingh,AssistantProfessor,PCTE
  • 15. SCATTER DIAGRAM METHOD  Scatter Diagram is a graph of observed plotted points where each points represents the values of X & Y as a coordinate.  It portrays the relationship between these two variables graphically.
  • 16. A PERFECT POSITIVE CORRELATION Height Weight Height of A Weight of A Height of B Weight of B A linear relationship
  • 17. HIGH DEGREE OF POSITIVE CORRELATION  Positive relationship Height Weight r = +.80
  • 18. DEGREE OF CORRELATION  Moderate Positive Correlation Weight Shoe Size r = + 0.4
  • 19. DEGREE OF CORRELATION  Perfect Negative Correlation Exam score TV watching per week r = -1.0
  • 20. DEGREE OF CORRELATION  Moderate Negative Correlation Exam score TV watching per week r = -.80
  • 21. DEGREE OF CORRELATION  Weak negative Correlation Weight Shoe Size r = - 0.2
  • 22. DEGREE OF CORRELATION  No Correlation (horizontal line) Height IQ r = 0.0
  • 23. DEGREE OF CORRELATION (R) r = +.80 r = +.60 r = +.40 r = +.20
  • 24. DIRECTION OF THE RELATIONSHIP  Positive relationship – Variables change in the same direction.  As X is increasing, Y is increasing  As X is decreasing, Y is decreasing  E.g., As height increases, so does weight.  Negative relationship – Variables change in opposite directions.  As X is increasing, Y is decreasing  As X is decreasing, Y is increasing  E.g., As TV time increases, grades decrease Indicated by sign; (+) or (-).
  • 25. ADVANTAGES OF SCATTER DIAGRAM Simple & Non Mathematical method Not influenced by the size of extreme item First step in investing the relationship between two variables
  • 26. DISADVANTAGE OF SCATTER DIAGRAM Can not adopt the an exact degree of correlation
  • 27. CORRELATION GRAPH 0 50 100 150 200 250 300 2012 2013 2014 2015 2016 2017 Consumption Production BirinderSingh,AssistantProfessor,PCTE
  • 28. KARL PEARSON’S COEFFICIENT OF CORRELATION  It is quantitative method of measuring correlation  This method has been given by Karl Pearson  It’s the best method BirinderSingh,AssistantProfessor,PCTE
  • 29. CALCULATION OF COEFFICIENT OF CORRELATION – ACTUAL MEAN METHOD  Formula used is:  r = Σ𝑥𝑦 Σ𝑥2 . Σ𝑦2 where x = X – 𝑋 ; y = Y – 𝑌 Q1: Find Karl Pearson’s coefficient of correlation: Ans: 0.96 Q2: Find Karl Pearson’s coefficient of correlation: Summation of product of deviations of X & Y series from their respective arithmetic means = 122 Ans: 0.89 BirinderSingh,AssistantProfessor,PCTE X 2 3 4 5 6 7 8 Y 4 7 8 9 10 14 18 X- Series Y-series No. of items 15 15 AM 25 18 Squares of deviations from mean 136 138
  • 30. PRACTICE PROBLEMS - CORRELATION Q3: Find Karl Pearson’s coefficient of correlation: Arithmetic Means of X & Y are 6 & 8 respectively. Ans: – 0.92 Q4: Find the number of items as per the given data: r = 0.5, Ʃxy = 120, σy = 8, Ʃx2 = 90 where x & y are deviations from arithmetic means Ans: 10 Q5: Find r: ƩX = 250, ƩY = 300, Ʃ(X – 25)2 = 480, Ʃ(Y – 30)2 = 600 Ʃ(X – 25)(Y – 30) = 150 , N = 10 Ans: 0.28 BirinderSingh,AssistantProfessor,PCTE X 6 2 10 4 8 Y 9 11 ? 8 7
  • 31. CALCULATION OF COEFFICIENT OF CORRELATION – ASSUMED MEAN METHOD  Formula used is:  r = 𝑁 .Σ𝑑𝑥𝑑𝑦 − Σ𝑑𝑥.Σ𝑑𝑦 𝑁.Σ𝑑𝑥2 −(Σ𝑑𝑥)2 𝑁.Σ𝑑𝑦2 −(Σ𝑑𝑦)2 Q6:Find r: Ans: 0.98 Q7: Find r, when deviations of two series from assumed mean are as follows: Ans: 0.895 BirinderSingh,AssistantProfessor,PCTE X 10 12 18 16 15 19 18 17 Y 30 35 45 44 42 48 47 46 Dx +5 -4 -2 +20 -10 0 +3 0 -15 -5 Dy +5 -12 -7 +25 -10 -3 0 +2 -9 -15
  • 32. CALCULATION OF COEFFICIENT OF CORRELATION – ACTUAL DATA METHOD  Formula used is:  r = 𝑁.Σ𝑋𝑌 − Σ𝑋.Σ𝑌 𝑁.Σ𝑋2 −(Σ𝑋)2 𝑁.Σ𝑌2 −(Σ𝑌)2 Q8:Find r: Ans: 0.98 Q9: Calculate product moment correlation coefficient from the following data: Ans: 0.996 BirinderSingh,AssistantProfessor,PCTE X 10 12 18 16 15 19 18 17 Y 30 35 45 44 42 48 47 46 X -5 -10 -15 -20 -25 -30 Y 50 40 30 20 10 5
  • 33. IMPORTANT TYPICAL PROBLEMS Q10: Calculate the coefficient of correlation from the following data and interpret the result: Ans: 0.76 N = 10, ƩXY = 8425, 𝑋 = 28.5, 𝑌 = 28.0, 𝜎𝑥 = 10.5, 𝜎𝑦 = 5.6 Q11: Following results were obtained from an analysis: N = 12, ƩXY = 334, ƩX = 30, ƩY = 5, ƩX2 = 670, ƩY2 = 285 Later on it was discovered that one pair of values (X = 11, Y = 4) were wrongly copied. The correct value of the pair was (X = 10, Y = 14). Find the correct value of correlation coefficient. Ans: 0.774 BirinderSingh,AssistantProfessor,PCTE
  • 34. VARIANCE – COVARIANCE METHOD  This method of determining correlation coefficient is based on covariance.  r = 𝐶𝑜𝑣 (𝑋,𝑌) 𝑉𝑎𝑟 𝑋 𝑉𝑎𝑟 (𝑌) = 𝐶𝑜𝑣 (𝑋,𝑌) σ 𝑥 .σ 𝑦 where Cov X, Y = Σ𝑥𝑦 𝑁 = Σ(𝑋−𝑋)(𝑌−𝑌) 𝑁 = Σ𝑋𝑌 𝑁 − 𝑋 𝑌  Another Way of calculating r = Σ𝑥𝑦 𝑁. σ 𝑥 .σ 𝑦 . Q12: For two series X & Y, Cov(X,Y) = 15, Var(X)=36, Var (Y)=25. Find r. Ans: 0.5 Q13: Find r when N = 30, 𝑋 = 40, 𝑌 = 50, 𝜎 𝑥 = 6, 𝜎 𝑦 = 7, Σ𝑥𝑦 = 360 Ans: 0.286 Q14: For two series X & Y, Cov(X,Y) = 25, Var(X)=36, r = 0.6. Find 𝜎 𝑦. Ans: 6.94 BirinderSingh,AssistantProfessor,PCTE
  • 35. CALCULATION OF CORRELATION COEFFICIENT – GROUPED DATA  Formula used is:  r = 𝑁 .Σ𝑓𝑑𝑥𝑑𝑦 − Σ𝑓𝑑𝑥.Σ𝑓𝑑𝑦 𝑁.Σ𝑓𝑑𝑥2 −(Σ𝑓𝑑𝑥)2 𝑁.Σ𝑓𝑑𝑦2 −(Σ𝑓𝑑𝑦)2 Q15: Calculate Karl Pearson’s coefficient of correlation: Ans: 0.33 BirinderSingh,AssistantProfessor,PCTE X / Y 10-25 25-40 40-55 0-20 10 4 6 20-40 5 40 9 40-60 3 8 15
  • 37. PROPERTIES OF COEFFICIENT OF CORRELATION  Karl Pearson’s coefficient of correlation lies between - 1 & 1, i.e. – 1 ≤ r ≤ +1  If the scale of a series is changed or the origin is shifted, there is no effect on the value of ‘r’.  ‘r’ is the geometric mean of the regression coefficients byx & bxy, i.e. r = 𝑏 𝑥𝑦 . 𝑏𝑦𝑥  If X & Y are independent variables, then coefficient of correlation is zero but the converse is not necessarily true.  ‘r’ is a pure number and is independent of the units of measurement.  The coefficient of correlation between the two variables x & y is symmetric. i.e. ryx = rxy BirinderSingh,AssistantProfessor,PCTE
  • 38. PROBABLE ERROR & STANDARD ERROR  Probable Error is used to test the reliability of Karl Pearson’s correlation coefficient.  Probable Error (P.E.) = 0.6745 x 1 − 𝑟2 𝑁  Probable Error is used to interpret the value of the correlation coefficient as per the following:  If 𝑟 > 6 P.E., then ‘r’ is significant.  If 𝑟 < 6 P.E., then ‘r’ is insignificant. It means that there is no evidence of the existence of correlation in both the series.  Probable Error also determines the upper & lower limits within which the correlation of randomly selected sample from the same universe will fall.  Upper Limit = r + P.E.  Lowe Limit = r – P.E. BirinderSingh,AssistantProfessor,PCTE
  • 39. PRACTICE PROBLEM – PROBABLE ERROR Q16: Find Karl Pearson’s coefficient of correlation from the following data: Also calculate probable error and check whether it is significant or not. Ans: – 0.94, 0.032 Q17: A student calculates the value of r as 0.7 when N = 5. He concludes that r is highly significant. Comment. Ans: Insignificant BirinderSingh,AssistantProfessor,PCTE X 9 28 45 60 70 50 Y 100 60 50 40 33 57
  • 40. SPEARMAN’S RANK CORRELATION METHOD  Given by Prof. Spearman in 1904  By this method, correlation between qualitative aspects like intelligence, honesty, beauty etc. can be calculated.  These variables can be assigned ranks but their quantitative measurement is not possible.  It is denoted by R = 1 – 𝟔 𝜮𝑫 𝟐 𝑵 (𝑵 𝟐 −𝟏)  R = Rank correlation coefficient  D = Difference between two ranks (R1 – R2)  N = Number of pair of observations  As in case of r, – 1 ≤ R ≤ 1  The sum total of Rank Difference is always equal to zero. i.e. ƩD = 0. BirinderSingh,AssistantProfessor,PCTE
  • 41. THREE CASES Spearman’s Rank Correlation Method When ranks are given When ranks are not given When equal or tied ranks exist BirinderSingh,AssistantProfessor,PCTE
  • 42. PRACTICE PROBLEMS – RANK CORRELATION (WHEN RANKS ARE GIVEN) Q18: In a fancy dress competition, two judges accorded the following ranks to eight participants: Calculate the coefficient of rank correlation. Ans: .62 Q19: Ten competitors in a beauty contest are ranked by three judges X, Y, Z: Use the rank correlation coefficient to determine which pair of judges has the nearest approach to common tastes in beauty. Ans: X & Z BirinderSingh,AssistantProfessor,PCTE Judge X 8 7 6 3 2 1 5 4 Judge Y 7 5 4 1 3 2 6 8 X 1 6 5 10 3 2 4 9 7 8 Y 3 5 8 4 7 10 2 1 6 9 Z 6 4 9 8 1 2 3 10 5 7
  • 44. PRACTICE PROBLEMS – RANK CORRELATION (WHEN RANKS ARE NOT GIVEN) Q20: Find out the coefficient of Rank Correlation between X & Y: Ans: 0.48 BirinderSingh,AssistantProfessor,PCTE X 15 17 14 13 11 12 16 18 10 9 Y 18 12 4 6 7 9 3 10 2 5
  • 45. PRACTICE PROBLEMS – RANK CORRELATION (WHEN RANKS ARE EQUAL OR TIED)  When two or more items have equal values in a series, so common ranks i.e. average of the ranks are assigned to equal values.  Here R = 1 – 𝟔 𝜮𝑫 𝟐+ 𝒎 𝟑 −𝒎 𝟏𝟐 + 𝒎 𝟑 −𝒎 𝟏𝟐 + ………….. 𝑵 (𝑵 𝟐 −𝟏)  m = No. of items of equal ranks  The correction factor of 𝒎 𝟑 −𝒎 𝟏𝟐 is added to 𝜮𝑫 𝟐 for such number of times as the cases of equal ranks in the question BirinderSingh,AssistantProfessor,PCTE
  • 46. PRACTICE PROBLEMS – RANK CORRELATION (WHEN RANKS ARE EQUAL OR TIED) Q21: Calculate R: Ans: – 0.37 Q22: Calculate Rank Correlation: Ans: 0.43 BirinderSingh,AssistantProfessor,PCTE X 15 10 20 28 12 10 16 18 Y 16 14 10 12 11 15 18 10 X 40 50 60 60 80 50 70 60 Y 80 120 160 170 130 200 210 130
  • 47. IMPORTANT TYPICAL PROBLEMS – RANK CORRELATION Q23: Calculate Rank Correlation from the following data: Ans: 0.64 Q24: The coefficient of rank correlation of marks obtained by 10 students in English & Math was found to be 0.5. It was later discovered that the difference in the ranks in two subjects was wrongly taken as 3 instead of 7. Find the correct rank correlation. Ans: 0.26 Q25: The rank correlation coefficient between marks obtained by some students in English & Math is found to be 0.8. If the total of squares of rank differences is 33, find the number of students. Ans: 10 BirinderSingh,AssistantProfessor,PCTE Serial No. 1 2 3 4 5 6 7 8 9 10 Rank Difference -2 ? -1 +3 +2 0 -4 +3 +3 -2
  • 49. CONCURRENT DEVIATION METHOD  Correlation is determined on the basis of direction of the deviations.  Under this method, the direction of deviations are assigned (+) or (-) or (0) signs.  If the value is more than its preceding value, then its deviation is assigned (+) sign.  If the value is less than its preceding value, then its deviation is assigned (-) sign.  If the value is equal to its preceding value, then its deviation is assigned (0) sign.  The deviations dx & dy are multiplied to get dxdy. Product of similar signs will be (+) and for opposite signs will be (-).  Summing the positive dxdy signs, their number is counted. It is called CONCURRENT DEVIATIONS. It is denoted by C.  Formula used: rc = ± ± 𝟐𝑪 −𝒏 𝒏 where rc = Correlation of CD, C = No. of Concurrent Deviations, n = N – 1. BirinderSingh,AssistantProfessor,PCTE
  • 50. PRACTICE PROBLEMS – COEFFICIENT OF CONCURRENT DEVIATIONS Q26: Find the Coefficient of Concurrent Deviation from the following data: Ans: – 1 Q27: Find the Coefficient of Concurrent Deviation from the following data: Ans: – 0.75 BirinderSingh,AssistantProfessor,PCTE Year 2001 2002 2003 2004 2005 2006 2007 Demand 150 154 160 172 160 165 180 Price 200 180 170 160 190 180 172 X 112 125 126 118 118 121 125 125 131 135 Y 106 102 102 104 98 96 97 97 95 90
  • 51. COEFFICIENT OF DETERMINATION (COD)  CoD is used for the interpretation of coefficient of correlation and comparing the two or more correlation coefficients.  It is the square of the coefficient of correlation i.e. r2.  It explains the percentage variation in the dependent variable Y that can be explained in terms of the independent variable X.  If r = 0.8, r2 = 0.64, it implies that 64% of the total variations in Y occurs due to X. The remaining 34% variation occurs due to external factors.  So, CoD = r2 = 𝐸𝑥𝑝𝑙𝑎𝑖𝑛𝑒𝑑 𝑉𝑎𝑟𝑖𝑎𝑛𝑐𝑒 𝑇𝑜𝑡𝑎𝑙 𝑉𝑎𝑟𝑖𝑎𝑛𝑐𝑒  Coefficient of Non Determination= K2 = 1 – r2 = 𝑈𝑛𝑒𝑥𝑝𝑙𝑎𝑖𝑛𝑒𝑑 𝑉𝑎𝑟𝑖𝑎𝑛𝑐𝑒 𝑇𝑜𝑡𝑎𝑙 𝑉𝑎𝑟𝑖𝑎𝑛𝑐𝑒  Coefficient of Alienation = 1 – r2 BirinderSingh,AssistantProfessor,PCTE
  • 52. PRACTICE PROBLEMS – COD Q28: The coefficient of correlation between consumption expenditure (C) and disposable income (Y) in a study was found to be +0.8. What percentage of variation in C are explained by variation in Y? Ans: 64% BirinderSingh,AssistantProfessor,PCTE
  • 53. CLASS TEST Q1: In a fancy dress competition, two judges accorded the following ranks to eight participants: Calculate the coefficient of rank correlation. Q2: Following results were obtained from an analysis: N = 12, ƩXY = 334, ƩX = 30, ƩY = 5, ƩX2 = 670, ƩY2 = 285 Later on it was discovered that one pair of values (X = 11, Y = 4) were wrongly copied. The correct value of the pair was (X = 10, Y = 14). Find the correct value of correlation coefficient. BirinderSingh,AssistantProfessor,PCTE Judge X 8 7 6 3 2 1 5 4 Judge Y 7 5 4 1 3 2 6 8