• NEET
      • Class 11th
      • Class 12th
      • Class 12th Plus
    • JEE
      • Class 11th
      • Class 12th
      • Class 12th Plus
    • Class 6-10
      • Class 6th
      • Class 7th
      • Class 8th
      • Class 9th
      • Class 10th
    • View All Options
      • Online Courses
      • Distance Learning
      • Hindi Medium Courses
      • International Olympiad
    • NEET
      • Class 11th
      • Class 12th
      • Class 12th Plus
    • JEE (Main+Advanced)
      • Class 11th
      • Class 12th
      • Class 12th Plus
    • JEE Main
      • Class 11th
      • Class 12th
      • Class 12th Plus
  • Classroom
    • NEET
      • 2025
      • 2024
      • 2023
      • 2022
    • JEE
      • 2025
      • 2024
      • 2023
      • 2022
    • Class 6-10
    • JEE Main
      • Previous Year Papers
      • Sample Papers
      • Result
      • Analysis
      • Syllabus
      • Exam Date
    • JEE Advanced
      • Previous Year Papers
      • Sample Papers
      • Mock Test
      • Result
      • Analysis
      • Syllabus
      • Exam Date
    • NEET
      • Previous Year Papers
      • Sample Papers
      • Mock Test
      • Result
      • Analysis
      • Syllabus
      • Exam Date
      • College Predictor
      • Counselling
    • NCERT Solutions
      • Class 6
      • Class 7
      • Class 8
      • Class 9
      • Class 10
      • Class 11
      • Class 12
    • CBSE
      • Notes
      • Sample Papers
      • Question Papers
    • Olympiad
      • NSO
      • IMO
      • NMTC
  • NEW
    • TALLENTEX
    • AOSAT
  • ALLEN E-Store
    • ALLEN for Schools
    • About ALLEN
    • Blogs
    • News
    • Careers
    • Request a call back
    • Book home demo
Home
JEE Maths
Karl Pearson’s Coefficient of Correlation

Karl Pearson’s Coefficient of Correlation

Correlation is a statistical measure that shows the degree of relationship between two variables. Karl Pearson’s coefficient of correlation is one of the most widely used methods for determining the strength and direction of this relationship. It plays an important role in statistics, data science, and also in competitive exams like JEE Main, JEE Advanced, and other entrance tests.

1.0Karl Pearson’s Coefficient of Correlation – Definition and Theory

  • Karl Pearson’s coefficient of correlation measures the linear relationship between two variables.
  • It is denoted by ‘r’.
  • The value of Karl Pearson’s coefficient of correlation lies between -1 and +1.
    • If r = +1 → Perfect positive correlation 
    • If r = -1 → Perfect negative correlation
    • If r = 0 → No correlation
  • Karl Pearson’s coefficient of correlation method uses covariance and standard deviation.

Formula:

r=σX​σY​Cov(X,Y)​

Where,

  • Cov(X,Y) = Covariance between variables X and Y
  • σX​,σY​ = Standard deviations of X and Y

2.0Methods to Calculate Karl Pearson’s Coefficient of Correlation

1. Direct Method

This is the standard method using raw data.

r=[n∑x2−(∑x)2][n∑y2−(∑y)2]​n∑xy−(∑x)(∑y)​

  • n: Number of pairs of observations
  • x, y: Data values of two variables

Use this method when numbers are small and simple.

2. Assumed Mean Method (Short-cut Method)

When data values are large, calculations can be simplified by taking deviations from assumed means.

Let:

dx​=x−Ax​,dy​=y−Ay​

Where Ax​,Ay​ are assumed means.

Formula:

r=∑dx2​⋅∑dy2​​∑dx​dy​​

This reduces calculation effort.

3. Step Deviation Method (Coding Method)

When data is very large or has a common difference (like in grouped data), divide deviations by a factor.

dx​=hx​x−Ax​​,dy​=hy​y−Ay​​

Formula becomes:

r=∑dx2​⋅∑dy2​​∑dx​dy​​

This method is useful for grouped frequency distributions.

4. Using Covariance Formula

Another approach is to use:

r=σX​σY​Cov(X,Y)​

  • First, calculate the covariance of X and Y.
  • Then divide by the product of their standard deviations.

Very common in statistics and probability questions.

So, the main methods are:

  1. Direct Method
  2. Assumed Mean Method
  3. Step Deviation Method
  4. Covariance Method

3.0Examples Based on Methods to Calculate Karl Pearson’s Coefficient of Correlation

Method 1 — Direct (raw-sum) formula

Example 1 (Direct method)

Data: (X,Y) = (1, 2), (2, 3), (3, 5), (4, 4). Compute r.

Solution

Compute sums:

n=4,∑x=10,∑y=14,∑xy=39,∑x2=30,∑y2=54.

Use raw-sum formula

r=(n∑x2−(∑x)2)(n∑y2−(∑y)2)​n∑xy−(∑x)(∑y)​.

Numerator: 4⋅39−10⋅14=156−140=16.

Denominator: (120−100)(216−196)​=20⋅20​=20.

So

r=2016​=0.8

Method 2 — Assumed-mean (shortcut) method

Example 2 (Assumed mean)

Data: X = {98, 102, 100, 105, 95}, Y = {200, 210, 205, 215, 195}.

Take assumed means Ax​=100,Ay​=205.

Solution

Compute deviations dx​=x−Ax​,dy​=y−Ay​ :

x

dx​

y

dy​

dx​dy​

dx2​

dy2​

98

-2

200

-5

10

4

25

102

2

210

5

10

4

25

100

0

205

0

0

0

0

105

5

215

10

50

25

100

95

-5

195

-10

50

25

100

Sum




120

58

250

Now

r=(∑dx2​)(∑dy2​)​∑dx​dy​​=58⋅250​120​.

Compute denominator: 14500​≈120.415. Hence

r≈120.415120​≈0.9965

(very strong positive linear association).

Note: assumed-mean reduces arithmetic — you only work with small deviations.

Method 3 — Step-deviation / coding method (useful when data have constant step)

Example 3 (Step-deviation)

Data: X = {100, 110, 120, 130}, Y = {45, 50, 60, 65}.

Use h = 10. Let u=(x−Ax)/h, v=(y−Ay)/hwithAx​=115, Ay​=55.

Solution

Compute coded deviations:

x

u = (x-115)/10

y

v = (y-55)/10

uv

u2

v2

100

-1.5

45

-1.0

1.5

2.25

1.00

110

-0.5

50

-0.5

0.25

0.25

0.25

120

0.5

60

0.5

0.25

0.25

0.25

130

1.5

65

1.0

1.5

2.25

1.00

Sum

0


0

3.5

5

2.5

Then

r=(∑u2)(∑v2)​∑uv​=5⋅2.5​3.5​=12.5​3.5​≈3.53553.5​≈0.990.

Because we scaled both variables by constants, r is unchanged — coding just simplifies arithmetic.

Method 4 — Covariance and standard deviations (conceptual form)

Example 4 (Covariance method, same data as Example 1)

Data: (1, 2), (2, 3), (3, 5), (4, 4). We already found r = 0.8. Show via covariance.

Solution

Means: xˉ=2.5,yˉ​=3.5.

Compute sums of deviations:

x

x−xˉ

y

y−yˉ​

(x−xˉ)(y−yˉ​)

(x−xˉ)2

(y−yˉ​)2

1

-1.5

2

-1.5

2.25

2.25

2.25

2

-0.5

3

-0.5

0.25

0.25

0.25

3

0.5

5

1.5

0.75

0.25

2.25

4

1.5

4

0.5

0.75

2.25

0.25

Sum




4.0

5.0

5.0

Population covariance: Cov(X,Y)=n∑(x−xˉ)(y−yˉ​)​=44​=1.

Population standard deviations: 

σx​=45​​=1.25​,σy​=1.25​.
Thus

r=σX​σY​Cov(X,Y)​=(1.25​)21​=1.251​=0.8, same result as Example 1.

4.0Solved Examples on Karl Pearson’s Coefficient of Correlation 

Example 1: Find Karl Pearson’s coefficient of correlation for the following data:

X

1

2

3

4

5

Y

2

4

6

8

10

Solution:

Here, Y = 2X.

Clearly, the relation is perfectly linear.

Thus, r = +1.

Example 2: Given the following data, calculate Karl Pearson’s coefficient of correlation:

X

10

12

13

16

17

18

19

Y

40

38

43

45

37

43

49

Solution:

Step 1: Compute mean of X and Y. 

Xˉ=710+12+13+16+17+18+19​=15

Yˉ=740+38+43+45+37+43+49​=42.14

Step 2: Find deviations

dx=X−Xˉ,dy=Y−Yˉ.

Step 3: Apply formula:

r=∑dx2⋅∑dy2​∑dx⋅dy​

After calculations:

r≈0.82

So, there is a strong positive correlation.

Example 3: If the correlation coefficient between X and Y is r = 0.6, standard deviation of X = 3, standard deviation of Y = 4, and covariance of (X, Y) = ?

Solution:
We know,

r=σX​σY​Cov(X,Y)​0.6=3×4Cov(X,Y)​Cov(X,Y)=0.6×12=7.2

Example 4: For the data X = {1, 2, 3}, Y = {6, 4, 2}, find r.

Solution:

Here Y = -2X + 8 (perfect linear with negative slope). Hence correlation is perfect negative:

r=−1. 

Example 5: For data (X, Y): (1, 2), (2, 3), (3, 5), (4, 4). Compute Karl Pearson’s r.

Solution:

Compute sums (careful arithmetic):

  n = 4

∑x=1+2+3+4=10∑y=2+3+5+4=14∑xy=1⋅2+2⋅3+3⋅5+4⋅4=2+6+15+16=39∑x2=1+4+9+16=30∑y2=4+9+25+16=54

Use formula

r=(n∑x2−(∑x)2)(n∑y2−(∑y)2)​n∑xy−(∑x)(∑y)​.

Numerator: 4⋅39−10⋅14=156−140=16.

Denominator: (4⋅30−100)(4⋅54−196)​=(120−100)(216−196)​=20⋅20​=20.

So r = 16/20 = 0.8.

r = 0.8 — strong positive linear correlation.

Example 6: Paired grouped data (midpoints used as scores): (xi​,fi​,yi​): (5,4,50), (15,6,60), (25,5,70). 

Compute r treating fi​ as weights.

Solution:

First compute weighted sums (N = total frequency = 4 + 6 + 5 = 15).

= 1000 + 5400 + 8750 = 15150.

∑fx=5⋅4+15⋅6+25⋅5=20+90+125=235.∑fy=50⋅4+60⋅6+70⋅5=200+360+350=910.∑fx2=52⋅4+152⋅6+252⋅5=100+1350+3125=4575.∑fy2=502⋅4+602⋅6+702⋅5=10000+21600+24500=56100.∑fxy=(5⋅50)⋅4+(15⋅60)⋅6+(25⋅70)⋅5= 1000 + 5400 + 8750 = 15150.

Now use weighted form:

r=(N∑fx2−(∑fx)2)(N∑fy2−(∑fy)2)​N∑fxy−(∑fx)(∑fy)​.

Numerator: 15⋅15150−235⋅910=227250−213850=13400.

First variance term: 15⋅4575−2352=68625−55225=13400.

Second variance term: 15⋅56100−9102=841500−828100=13400.

Denominator: 13400⋅13400​=13400.

Hence r = 13400/13400 = 1.

r = 1 — perfect positive linear correlation (weighted).

Example 7: Given regression coefficients bxy​=0.8 (regression of Y on X) and byx​=0.45 (regression of X on Y), find r.

Solution:

Property: r2=bxy​byx​ Sign of r equals sign of both coefficients (if both positive → r > 0).

Compute r2=0.8×0.45=0.36.

So r=0.36​=0.6.

r = 0.6. 

Example 8: If r = -0.5, σX​=4,σY​=3, find Cov(X, Y).

Solution:

Cov(X,Y)=−6.

Cov(X,Y)=rσX​σY​=−0.5×4×3=−6.Cov(X,Y)=−6.

Example 9: For a sample of n = 10 pairs you observe r = 0.8. Test significance at 5% level (two-tailed). Use t=1−r2​rn−2​​.

Solution:

Degrees of freedom = n – 2 = 8. Compute

t=0.81−0.648​​=0.80.368​​=0.822.222​≈0.8×4.714=3.771.

Critical two-tailed t0.025,8​≈2.306.Since3.771>2.306, the correlation is statistically significant at 5% level.

Reject H0​:r=0; significant correlation.

5.0Practice Questions on Karl Pearson’s Coefficient of Correlation

  1. Calculate Karl Pearson’s coefficient of correlation for the following:

X

1

2

3

4

5

Y

9

8

10

12

11

  1. If the correlation coefficient r = -0.75, σX​=5,σY​=4, find covariance.
  2. For two variables, the regression coefficients are , .b_{xy} = 0.8, \quad b_{yx} = 0.5 Find the value of Karl Pearson’s coefficient of correlation.
  3. Compute r for (X, Y) = (1, 1), (2, 2), (3, 2), (4, 3).

Answer hint: use raw-sum formula.

  1. For a sample n = 6, you get r = 0.45. Is it significant at 5%? (df = 4, t0.025,4​≈2.776 ).
    Answer hint: compute “t” as in Example 6.
  2. Given ∑x=40,∑y=50,∑x2=350,∑y2=520,∑xy=430 for n = 10. Find r.

Answer hint: apply the raw-sum formula.

  1. If r=0.3, σX​=5,σY​=2, compute covariance.
  2. Paired grouped data: (x, f, y) = (2, 3, 10), (6, 2, 20), (10, 5, 30). Compute weighted r.

Table of Contents


  • 1.0Karl Pearson’s Coefficient of Correlation – Definition and Theory
  • 2.0Methods to Calculate Karl Pearson’s Coefficient of Correlation
  • 2.11. Direct Method
  • 2.22. Assumed Mean Method (Short-cut Method)
  • 2.33. Step Deviation Method (Coding Method)
  • 2.44. Using Covariance Formula
  • 3.0Examples Based on Methods to Calculate Karl Pearson’s Coefficient of Correlation
  • 4.0Solved Examples on Karl Pearson’s Coefficient of Correlation 
  • 5.0Practice Questions on Karl Pearson’s Coefficient of Correlation

Frequently Asked Questions

It measures the strength and direction of a linear relationship between two variables.

It always lies between -1 and +1.

It is denoted by the letter r.

It means there is no linear correlation between the two variables.

Because it gives both the magnitude and the direction of the relationship, making it a robust method for correlation analysis.

Join ALLEN!

(Session 2025 - 26)


Choose class
Choose your goal
Preferred Mode
Choose State
  • About
    • About us
    • Blog
    • News
    • MyExam EduBlogs
    • Privacy policy
    • Public notice
    • Careers
    • Dhoni Inspires NEET Aspirants
    • Dhoni Inspires JEE Aspirants
  • Help & Support
    • Refund policy
    • Transfer policy
    • Terms & Conditions
    • Contact us
  • Popular goals
    • NEET Coaching
    • JEE Coaching
    • 6th to 10th
  • Courses
    • Online Courses
    • Distance Learning
    • Online Test Series
    • International Olympiads Online Course
    • NEET Test Series
    • JEE Test Series
    • JEE Main Test Series
  • Centers
    • Kota
    • Bangalore
    • Indore
    • Delhi
    • More centres
  • Exam information
    • JEE Main
    • JEE Advanced
    • NEET UG
    • CBSE
    • NCERT Solutions
    • Olympiad
    • NEET 2025 Results
    • NEET 2025 Answer Key
    • NEET College Predictor
    • NEET 2025 Counselling

ALLEN Career Institute Pvt. Ltd. © All Rights Reserved.

ISO