In mathematics and statistics, data is a collection of numerical values or observations that help us understand patterns within. To summarise data effectively, we use specialised measured values, also known as the measures of central tendency—mean, median, and mode. Here, we will focus on the median of numbers — how to find it, its formula (for grouped and ungrouped data), and its importance in statistics and data analysis.
The median in statistics is basically the value of a dataset that lies exactly in the middle of the dataset, when arranged in an ascending or descending order. Simply put, the median tells us the central position of the data — the point that divides the dataset into two halves.
It is unaffected by the outliers or the extreme values. That is, other values may vary widely, but they don’t affect the median unless they shift the order of the middle values.
The median in statistics is particularly useful when working with real data. The median is not affected, unlike the mean, by very high or very low values (outliers). For instance, where the majority of students in a class got between 60 and 80 marks, but one student got 10 and another 100, the average will be inaccurate. The median is a better indication of what a "typical" score is like.
Just like other central tendencies, the Median is also calculated differently for grouped and ungrouped data. Here’s how to find the median for both types of data:
Ungrouped data refers to a collection of individual numbers that are not sorted or organised in any specific class intervals. To find the median of ungrouped data, follow these easy steps:
Step 1: Arrange the data in order from smallest to largest (in increasing order) or largest to smallest (in decreasing order).
Step 2: Count the number of values in a data set, which is generally denoted by n.
Step 3: Check the nature of n, whether it is odd or even:
Grouped data is data that has been arranged into class intervals along with their corresponding frequencies. Since the exact values within each class are not given, we use a formula to estimate the median. Here’s the median formula for grouped data:
Here:
Cumulative Frequency Table
Cumulative frequency means the total number of values added up one after another. It shows how many values are less than or equal to a certain number or class in a data set. Here’s how the frequency table column for the data set is formed:
Fun Fact: The sum of all the frequencies of a dataset is always equal to the last cumulative frequency of the last class interval.
Problem 1: The following table gives the frequencies of monthly incomes of workers. Find the median income.
Solution: Let’s first convert a frequency table:
According to the newly calculated frequency table , therefore,
Now using the formula:
Median= 10,000+ 4333.33=Rs. 14,333.33
Problem 2: The table below shows the marks obtained by students in a test. Two frequencies are missing, represented by x and y. The total number of students is 60, and the median is given as 32.5. Find the values of x and y.
Solution: First, construct the cumulative frequency table:
According to the new frequency table , therefore:
The total number of students (f) = 60
35 + x + y = 60
x + y = 25 ………………………….(1)
Now, using the formula for median:
y= (17-x)4
y=68-4x
4x+y=68 …(2)
Eliminating equation 2 from 1
x + y = 25
4x+y = 68
________
–3x = – 33
x = 11
Now, put this value of x in equation 1:
11 + y = 25
y = 25 – 11 = 14
Hence, the value of x and y is 11 and 14, respectively.
Problem 3: A group of 11 students appeared for a rapid-fire quiz. Their scores out of 20 are as follows, but one score is missing: 12, 15, 10, 13, 14, 16, 18, 11, 17, 19, x. If the median score is known to be 15, find the missing score x.
Solution: According to the question:
n = 11
Median or the order: 15
Median of odd number of values =th value of the dataset
Median = th=6th term
Now, arrange the given numbers in increasing order: 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, x
Let's put x in the end. But according to our observation, the 6th term should be 15, but according to our assumed order, it is 16.
Hence, our assumption is incorrect, and x is the 6th term, which is 15, in the observation.
(Session 2026 - 27)