Does the CLT require normal population data?

No. The whole point is that the sample mean becomes approximately normal even when the population is not.

Can I use CLT for sums as well as means?

Yes. The CLT applies to normalized sums; the mean is a scaled sum.

What if population variance is unknown?

If n is large, you can replace sigma with sample standard deviation ss and use the normal approximation. For small n and normal population, use the Student’s t-distribution for inference about the mean.

Does CLT hold for dependent data?

The classical CLT assumes independence. There are generalized CLTs (e.g., for weakly dependent sequences, mixing conditions, martingales), but you must check specific assumptions.

When is continuity correction needed?

When approximating discrete distributions (like binomial) with normal for small/medium n, a continuity correction improves accuracy. For large nn, it’s less important.

Are there stronger versions of CLT?

Yes. Lindeberg and Lyapunov CLTs relax identical distribution assumptions and provide precise conditions for convergence.

How accurate is the normal approximation?

Accuracy depends on skewness, kurtosis, and sample size. Berry–Esseen theorem gives bounds on the approximation error in terms of the third absolute moment.

Home

JEE Maths

Central Limit Theorem

The Central Limit Theorem (CLT) is one of the most important results in statistics. It states that when we take sufficiently large random samples from any population with a finite mean and variance, the sampling distribution of the sample mean will be approximately normally distributed, regardless of the shape of the original population.

In simple words, even if your original data is skewed or irregular, the distribution of the sample means tends toward a bell curve as the sample size increases.

1.0Central Limit Theorem Statement

Statement:
If a random sample of size n is taken from a population with mean $μ$ and standard deviation $σ$ , then as n becomes large, the distribution of the sample mean $\overset{ˉ}{X}$ approaches a normal distribution with mean $μ$ and standard deviation $\frac{σ}{n}$ .

2.0Central Limit Theorem Formula

The formula for the central limit theorem for the sample mean is:

$Z = \frac{x ˉ - μ}{σ / n}$

Where:

$\overset{ˉ}{X}$ = Sample mean
$μ$ = Population mean
$σ$ = Population standard deviation
n = Sample size

3.0Central Limit Theorem Equation

The probability density function for the sampling distribution of the mean (as n grows large) is given by:

$f (\overset{x}{ˉ}) = \frac{1}{2 π ( \frac{σ ^{2}}{n} )} e^{- \frac{( x ˉ - μ ) ^{2}}{2 ( σ ^{2} / n )}}$

This is the normal distribution equation adapted for sample means.

4.0Central Limit Theorem Explanation

The CLT works because when independent random variables are added, their normalized sum tends to follow a normal distribution, even if the original variables themselves are not normally distributed. This is why normal distribution appears so often in real-world data analysis.

5.0Central Limit Theorem Example

Example:
Suppose the average height of students in a school is unknown, but the heights are skewed. You randomly select samples of 50 students at a time and record the average height for each sample. If you repeat this many times, the histogram of those sample averages will form an approximate normal curve, even though the original height distribution was skewed.

6.0Application of the Central Limit Theorem

The application of the central limit theorem is widespread in statistics and data science:

Confidence Intervals – Allows us to estimate population parameters from sample data.
Hypothesis Testing – Forms the foundation for many statistical tests like the Z-test and t-test.
Quality Control – Used in manufacturing to monitor processes.
Finance – Used in modeling returns and risk assessment.
Polling & Surveys – Helps predict population opinions from small samples.

7.0Solved Examples on Central Limit Theorem

Example 1: Population mean $μ = 50$ , population standard deviation $σ = 8$ . A random sample of n = 36 is taken. Find $P (\overset{ˉ}{X} > 52)$ .

Solution.

Compute standard error: $σ_{\overset{x}{ˉ}} = \frac{σ}{n} = \frac{8}{36} = \frac{8}{6} = 1.333333$ .
Compute z-score:

$Z = \frac{52 - 50}{1.333333} = \frac{2}{1.333333} = 1.5.$

Use standard normal: $P (\overset{ˉ}{X} > 52) = P (Z > 1.5)$ .

From normal table: $P (Z > 1.5) \approx 0.0668072$ .

Answer: 0.0668 (approx.)

Example 2: True proportion p=0.40. Sample size n = 200. Find $P (0.35 < \overset{p}{^} < 0.45)$ .

Solution.

Mean of $\overset{p}{^}$ is 0.400.40. Standard error:

$σ_{\overset{p}{^}} = \frac{p ( 1 - p )}{n} = \frac{0.4 \times 0.6}{200} = \frac{0.24}{200} = 0.0012 \approx 0.034641$

Convert bounds to z-scores:

$Z_{low} = \frac{0.35 - 0.40}{0.034641} \approx - 1.4434, Z_{high} = \frac{0.45 - 0.40}{0.034641} \approx 1.4434.$

Probability:

$P (- 1.4434 < Z < 1.4434) = Φ (1.4434) - Φ (- 1.4434) = 2Φ (1.4434) - 1.$

$Φ (1.4434) \approx 0.925543, so p ro babi l i t y \approx 0.8511.$

Answer: 0.8511 (approx.)

Example 3: Population standard deviation $σ = 3$ . How large a sample n is needed so that $P (∣ \overset{ˉ}{X} - μ ∣ < 0.5) = 0.95$ ?

Solution.

For two-sided 95% probability, critical z is $z_{0.975}$ =1.96.
We want $P (∣ \overset{ˉ}{X} - μ ∣ < ME) = 0.95$ with ME = 0.5. Use formula:

$ME = z_{0.975} \frac{σ}{n} \Rightarrow n = (\frac{z _{0.975} σ}{ME})^{2} .$

Plug numbers:

$n = (\frac{1.96 \times 3}{0.5})^{2} = (\frac{5.88}{0.5})^{2} = (11.76)^{2} \approx 138.2976.$

Round up to nearest whole person: n=139.

Answer: n = 139

Example 4: Let $X_{1}, \dots, X_{100}$ be good with mean 22 and variance 99. What is approximately $P (S_{100} > 230)$ where $S_{100} = \sum_{i = 1}^{100} X_{i}$ ?

Solution.

Mean of sum: $E [S_{100}] = 100 \times 2 = 200.$
Variance of sum: $Var (S_{100}) = 100 \times 9 = 900$ . Standard deviation: $σ_{s} = 30$ .
Convert: $Z = \frac{230 - 200}{30} = \frac{30}{30} = 1$ .
$P (S_{100} > 230) = P (Z > 1) = 0.1587 (approx)$

Answer: 0.1587

8.0Practice Questions on Central Limit Theorem

Population mean $μ = 120, σ = 15$ . Sample n = 49. Find $P (118 < \overset{ˉ}{X} < 124)$ .

Answer hint: . $S$ $E = 15/ 49 = 15/7 \approx 2.1429$

Compute z's, then probabilities.

Final answer: ≈ 0.8186.

A factory claims defect rate p = 0.05. Sample n = 500. What is probability that observed $\overset{p}{^} > 0.07$ ?

Answer hint: $SE = 0.05 \cdot 0.95/500$

Compute z and tail.

Final answer: ≈ 0.067(approx).

For population sd $σ = 10 z_{0.995} \approx 2.5758$ , how large n for 99% confidence and margin 2?

Answer hint: $z_{0.995} \approx 2.5758$ Compute $n = (z σ / ME)^{2}$

Final answer: $n \approx (2.5758 * 10/2)^{2} \approx 165.9 \Rightarrow 166$ _.

IID mean 0.5, variance 0.25, n = 400. Approx $P (\sum X_{i} < 210)$ _.

Answer hint: Mean sum = 400 * 0.5 = 200, sd $= 40 0^{*} 0.25 = 10$ . z = (210-200)/10 =1. So prob ≈ 0.1587 (upper tail) so P( <210 ) ≈ 0.8413.

The population is extremely skewed (exponential with mean 5). For sample size n = 10, is CLT safe to use for sample mean approx normal? For n = 100?

Answer: n = 10: approximation may be poor (skew shows). N = 100: CLT will give a good approximation.

Central Limit Theorem

1.0Central Limit Theorem Statement

2.0Central Limit Theorem Formula

3.0Central Limit Theorem Equation

4.0Central Limit Theorem Explanation

5.0Central Limit Theorem Example

6.0Application of the Central Limit Theorem

7.0Solved Examples on Central Limit Theorem

8.0Practice Questions on Central Limit Theorem

Table of Contents

Frequently Asked Questions

Does the CLT require normal population data?

Can I use CLT for sums as well as means?

What if population variance is unknown?

Does CLT hold for dependent data?

When is continuity correction needed?

Are there stronger versions of CLT?

How accurate is the normal approximation?

Join ALLEN!