Home
JEE Maths
Correlation Coefficient

Correlation Coefficient 

In the world of statistics, correlation coefficient is an essential measure used to describe the relationship between two variables. Whether you’re a student of statistics, a data analyst, or a researcher, understanding the correlation coefficient can help you gain insights into how variables interact. In this section, we’ll explore what the correlation coefficient is, how it’s calculated, its types, and how to interpret it.

1.0What is Correlation Coefficient?

A correlation coefficient is a statistical measure that describes the strength and direction of the relationship between two variables. It tells us whether the variables tend to increase or decrease together and the degree of this association. The correlation coefficient can vary from -1 to +1:

  • +1 signifies a perfect positive correlation (as one variable increases, the other increases proportionally).
  • -1 signifies a perfect negative correlation (as one variable increases, the other decreases proportionally).
  • 0 signifies no linear relationship between the variables.

Image showing the graph of positive correlation, negative correlation and no correlation

Pearson's correlation coefficient is the most widely used measure of correlation, but there are others, such as Spearman’s rank correlation, which we will also discuss.

2.0Calculating Pearson’s Correlation Coefficient

Pearson’s correlation coefficient (r) is the most widely used measure of correlation. It is calculated using the following formula:

Where:

  • and are the individual data points in the datasets X and Y,
  • and are the means of the datasets X and Y.

This formula computes the covariance of the variables divided by the product of their standard deviations. The result, r, tells you how strongly the variables X and Y are related.

3.0Correlation Coefficient and Coefficient of Determination

The coefficient of determination, denoted as , is closely related to the correlation coefficient. It represents the proportion of the variance in one variable that can be explained by the other variable. If r = 0.8, then , meaning that 64% of the variance in one variable can be explained by the other. The coefficient of determination is particularly useful when analyzing linear regression models.

4.0Interpreting The Correlation Coefficient

The value of the correlation coefficient indicates the strength and direction of the relationship:

  • Positive correlation (r > 0): As one variable increases, the other variable also increases.
  • Negative correlation (r < 0): As one variable increases, the other decreases.
  • No correlation (r = 0): There is no linear relationship between the variables.

In general:

  • 0.1 to 0.3: Weak positive correlation.
  • 0.3 to 0.5: Moderate positive correlation.
  • 0.5 to 1: Strong positive correlation.
  • -0.1 to -0.3: Weak negative correlation.
  • -0.3 to -0.5: Moderate negative correlation.
  • -0.5 to -1: Strong negative correlation.

5.0Correlation Coefficient Example

Let’s say you want to determine the correlation coefficient between the number of hours studied (X) and the scores on a test (Y). Using the data points, you can calculate the Pearson’s correlation coefficient to see if there’s a strong or weak relationship between these two variables.

Example data:

X = [1, 2, 3, 4, 5], Y = [2, 4, 5, 4, 5] 

Calculating the correlation coefficient for this data would give you a numerical value that tells you the degree to which hours studied is related to test scores.

6.0Formula for Spearman’s Rank Correlation Coefficient

While Pearson’s correlation measures linear relationships, Spearman’s rank correlation coefficient (ρ) measures the strength and direction of a monotonic relationship between two variables. It is used when the data is not linear or when the variables are ordinal (ranked).

The formula for Spearman’s rank correlation is:

Where:

  • is the difference between the ranks of corresponding values in the two variables,
  • n is the number of data points.

Spearman’s rank correlation coefficient also ranges from –1 to +1, with the same interpretation as Pearson’s.

7.0Types of Correlation Coefficients

There are several types of correlation coefficients, depending on the nature of the data:

  1. Pearson’s correlation coefficient: Used for measuring the linear relationship between two continuous variables.
  2. Spearman’s rank correlation coefficient: Used for ordinal data or when the relationship is not linear but monotonic.
  3. Kendall’s tau: Another non-parametric measure of association, commonly used for ordinal data.

8.0Determining Correlation Coefficient

To determine the correlation coefficient:

  1. Choose the type of correlation (Pearson, Spearman, Kendall, etc.).
  2. Collect the data points for the two variables.
  3. Use the appropriate formula to calculate the correlation coefficient.
  4. Interpret the result to understand the strength and direction of the relationship.

9.0Solved Example On Correlation Coefficient

Example 1: Given the following data for two variables, X and Y, calculate the Pearson correlation coefficient.

X

Y

1

2

2

3

3

5

4

7

5

8

Solution: 

Step 1: Calculate the necessary sums.

Step 2: Apply the Pearson formula.

The formula for the Pearson correlation coefficient is:

Where:

  • n = 5 (the number of data points)

Step 3: Plug in the values into the formula.

Simplify the numerator:

Step 4: Interpretation

The correlation coefficient r ≈ 0.994 indicates a very strong positive linear relationship between variables X and Y.

Example 2: Calculate the correlation coefficient for the following height (in inches) of the father (x) and their son (y).

x

y

65

67

66

68

67

65

67

68

68

72

69

72

70

69

72

71

Solution:

x

y

x2

y2

xy

65

67

4225

4489

4355

66

68

4356

4624

4488

67

65

4489

4225

4355

67

68

4489

4624

4556

68

72

4624

5184

4896

69

72

4761

5184

4830

70

69

4900

4761

4830

72

71

5184

5041

5112

552

37028

38131

37560

We know that

= 4635 – 4692 = 3

= 2.121

= 2.345

Then

Frequently Asked Questions

Join ALLEN!

(Session 2025 - 26)


Choose class
Choose your goal
Preferred Mode
Choose State