Variance
Variance is a statistical tool used to measure the spread or dispersion of a set of data points around the mean. It shows how much the values deviate from the average, helping us understand data variability. The larger the variance, the more spread out the data is. From basic statistics to complex data science models, analyzing variance is essential in identifying consistency, trends, and outliers.
1.0Variance Meaning in Statistics
In statistics, variance refers to the average of the squared deviations from the mean of a data set. It quantifies how much the numbers differ from the average value.
Mathematically, the statistical variance of a data set tells us whether the data points are tightly grouped or widely scattered.
Formula for Calculating the Variance
For a Population (σ²): σ2=N1∑i=1N(xi−μ)2
For a Sample (s²): s2=n−11∑(xi−xˉ)2
Where:
- xi = each value
- μ= population mean
- xˉ = sample mean
- N = population size
- n = sample size
2.0How to Find the Variance – Step-by-Step
- Find the Mean of the data.
- Subtract the Mean from each data point (deviation).
- Square each deviation.
- Sum all squared deviations.
- Divide the sum by:
- n (for population)
- n - 1 (for sample)
3.0Example of Variance (Sample)
Q: Find the variance of the following data: 5, 7, 3, 7, 10
Step 1: Find the Mean ( xˉ )
xˉ=55+7+3+7+10532=6.4
Step 2: Subtract Mean and Square
(5−6.4)2=1.96(7−6.4)2
=0.36(3−6.4)2
=11.56(7−6.4)2
=0.36(10−6.4)2
=12.96
Step 3: Add the Squares
Sum= 1.96+0.36+11.56+0.36+12.96 = 27.2
Step 4: Divide by n - 1
s2=427.2=6.8
Answer: Variance = 6.8
4.0Solved Examples of Variance
Example 1: If the variance of a random variable X is 4, then find the variance of the variable Y = 3X + 5.
Solution:
We know that:
Var(aX+b)=a2.Var(X)
Here, a = 3, b = 5, and Var(X) = 4
Var(Y)=32.4=9×4=36
Example 2: Find the sample variance of the data: 8, 10, 6, 4, 12
Solution:
Step 1: Find the Mean
xˉ=58+10+6+4+12=540=8
Step 2: Calculate squared deviations
(8−8)2=0
(10−8)2=4
(6−8)2=4
(4−8)2=16
12−8)2=16
Sum = 0 + 4 + 4 + 16 + 16 = 40
Step 3: Sample Variance = s2=5−140=440=10
Example 3: If X is a random variable and kk is a constant, then what is Var(X+k)?
A. Var(X)
B. Var(X) + k
C. Var(X) - k
D. Var(X)+k2
Answer: A
Explanation: Adding a constant does not change the variance.
Var(X+k)=Var(X)
Correct Option: A
Example 4: Two groups of students have the following data:
- Group A: n1 = 5, mean = 4, variance = 2
- Group B: n2 = 7, mean = 6, variance = 3
Find the combined variance.
Solution:
Use the formula:
Combined Variance=n1+n2n1σ12+n2σ22+n1+n2n1n2(xˉ1−xˉ2)2.
Plug in values:
=125⋅2+7⋅3+125⋅7(4−6)2.
=1210+21+1235⋅4
=1231+12140.
=1231+11.67
=1242.67≈3.56
Example 5: A frequency distribution is given as:
Find the variance.
Solution:
Step 1: Find the mean xˉ
xˉ=∑f∑fx=1+2+3+2+2(2)(1)+(4)(2)+(6)(3)+(8)(2)+(10)(2)
xˉ=102+8+18+16+20
xˉ=1064=6.4
Step 2: Find ∑f(x−xˉ)2
2(2−6.4)2=19.36⋅1=19.36
(4−6.4)2=5.76⋅2=11.52
(6−6.4)2=0.16⋅3=0.48
(8−6.4)2=2.56⋅2=5.12
(10−6.4)2=12.96⋅2=25.92
Total = 19.36 + 11.52 + 0.48 + 5.12 + 25.92 = 62.4
Step 3: Variance = 1062.4=6.24
5.0Why is Variance Important?
- It helps in analyzing data consistency.
- Used in risk assessment in finance and insurance.
- Essential in machine learning models for feature analysis.
- Forms the basis for standard deviation and hypothesis testing.
6.0Difference Between Variance and Standard Deviation
7.0Practice: Find the Variance of the Following Data
- Find the variance of: 4, 6, 8, 10
- Find the variance of: 12, 15, 18, 21, 24
- Find the sample variance of: 9, 7, 5, 3
8.0Sample Question on Variance
Q1: How do I find variance in statistics?
Use the formula: s2=n−11∑(xi−xˉ)2
First, find the mean, then compute squared deviations and average them.
Also Read: