Sampling Distribution of the Mean — Foundation of Inference

Foundations of Statistics

Why Sample Means Behave Predictably

The sampling distribution of the mean explains why repeated samples give consistent results and underpins all confidence intervals and hypothesis tests. Understanding this concept is the key to mastering statistical inference.

Clinical Trials — Determining whether drug effects are real or due to sampling variation
Polling — Estimating margin of error for survey results
Manufacturing — Quality control through process monitoring and control charts

The sampling distribution transforms individual randomness into collective predictability.

Core Concepts

The sampling distribution of the mean describes how the sample mean $\bar{X}$ varies across all possible samples of size $n$ .

DfSampling Distribution of the Mean

The sampling distribution of $\bar{X}$ is the probability distribution of the statistic $\bar{X} = \frac{1}{n}\sum_{i=1}^{n}X_i$ computed over all possible samples of size $n$ from a population. It is the theoretical basis for confidence intervals and hypothesis tests.

Mean and Standard Error

E[\bar{X}] = \mu, \quad \text{SE}(\bar{X}) = \frac{\sigma}{\sqrt{n}}

Here,

$\mu$ =Population mean
$\sigma$ =Population standard deviation
$n$ =Sample size
$\sigma/\sqrt{n}$ =Standard error of the mean

Key Insight

The standard error decreases as $\sqrt{n}$ increases — larger samples give more precise estimates of the mean.

Central Limit Theorem

ThCentral Limit Theorem (CLT)

For any population with mean $\mu$ and finite variance $\sigma^2$ , let $X_1, X_2, \ldots, X_n$ be i.i.d. random variables. Then as $n \to \infty$ :

\frac{\bar{X} - \mu}{\sigma/\sqrt{n}} \xrightarrow{d} N(0, 1)

Equivalently, $\bar{X} \approx N\left(\mu, \frac{\sigma^2}{n}\right)$ for large $n$ , regardless of the population distribution shape.

Proof Sketch (Lindeberg–Lévy CLT)

Step 1. Assume $E[X_i] = \mu$ and $\text{Var}(X_i) = \sigma^2 < \infty$ . Define $Z_i = (X_i - \mu)/\sigma$ , so $E[Z_i] = 0$ and $\text{Var}(Z_i) = 1$ .

Step 2. The moment generating function of $\bar{Z} = \frac{1}{n}\sum Z_i$ is $M_{\bar{Z}}(t) = \left[M_Z(t/n)\right]^n$ .

Step 3. Taylor-expand: $M_Z(s) = 1 + s^2/2 + o(s^2)$ as $s \to 0$ . Thus $M_{\bar{Z}}(t) = \left[1 + \frac{t^2}{2n^2} + o(1/n^2)\right]^n \to e^{t^2/2}$ .

Step 4. Since $e^{t^2/2}$ is the MGF of $N(0,1)$ , by Lévy's continuity theorem, $\sqrt{n}\,\bar{Z} \xrightarrow{d} N(0,1)$ .

Rule of Thumb

The CLT approximation is generally valid when $n \geq 30$ . For highly skewed or heavy-tailed populations, larger $n$ may be needed. The Berry–Esseen theorem quantifies the rate: $|F_n(x) - \Phi(x)| \leq \frac{C\,\rho}{\sigma^3 \sqrt{n}}$ where $\rho = E[|X-\mu|^3]$ .

Formal Properties of $\bar{X}$

ThUnbiasedness and Minimum Variance

The sample mean $\bar{X}$ is an unbiased estimator of $\mu$ : $E[\bar{X}] = \mu$ . Moreover, among all linear unbiased estimators, $\bar{X}$ has the minimum variance (Gauss–Markov theorem for the i.i.d. case).

Proof Sketch

Unbiasedness: $E[\bar{X}] = E\left[\frac{1}{n}\sum X_i\right] = \frac{1}{n}\sum E[X_i] = \frac{n\mu}{n} = \mu$ .

Variance: $\text{Var}(\bar{X}) = \frac{1}{n^2}\sum \text{Var}(X_i) = \frac{n\sigma^2}{n^2} = \frac{\sigma^2}{n}$ by independence. Any other linear combination $\sum a_i X_i$ with $\sum a_i = 1$ has variance $\sigma^2 \sum a_i^2 \geq \sigma^2/n$ by Cauchy–Schwarz, with equality iff all $a_i = 1/n$ .

Worked Example

Suppose the heights of adult males in a city are normally distributed with $\mu = 175$ cm and $\sigma = 8$ cm. A researcher samples $n = 64$ men.

Step 1. The sampling distribution of $\bar{X}$ is exactly:

\bar{X} \sim N\left(175, \frac{8^2}{64}\right) = N(175, 1)

Step 2. The standard error is $\text{SE}(\bar{X}) = 8/\sqrt{64} = 1$ cm.

Step 3. Probability the sample mean exceeds 177 cm:

P(\bar{X} > 177) = P\left(Z > \frac{177 - 175}{1}\right) = P(Z > 2) = 0.0228

Step 4. Even though individual heights have $\sigma = 8$ , the sample mean of 64 observations has $\text{SE} = 1$ . The sampling distribution is 8 times narrower than the population distribution — a direct consequence of averaging.

Key Takeaways

Summary: Sampling Distribution of the Mean

Describes variability of $\bar{X}$ across all samples of size $n$
Mean: $E[\bar{X}] = \mu$ , Standard Error: $\text{SE} = \sigma/\sqrt{n}$
CLT: $\bar{X} \approx N(\mu, \sigma^2/n)$ for large $n$ (typically $n \geq 30$ )
Standard error decreases with $\sqrt{n}$ — larger samples are more precise
$\bar{X}$ is the UMVUE (uniformly minimum variance unbiased estimator) of $\mu$ under normality
Foundation for all confidence intervals and hypothesis tests about the mean

Sampling Distribution of the Mean — Foundation of Inference

Sampling Distribution of the Mean — Foundation of Inference

Why Sample Means Behave Predictably

Core Concepts

DfSampling Distribution of the Mean

Mean and Standard Error

Central Limit Theorem

ThCentral Limit Theorem (CLT)

Formal Properties of $\bar{X}$

ThUnbiasedness and Minimum Variance

Worked Example

Key Takeaways

Summary: Sampling Distribution of the Mean

Premium Content

Need Expert Statistics Help?

Sampling Distribution of the Mean — Foundation of Inference

Sampling Distribution of the Mean — Foundation of Inference

Why Sample Means Behave Predictably

Core Concepts

DfSampling Distribution of the Mean

Mean and Standard Error

Central Limit Theorem

ThCentral Limit Theorem (CLT)

Formal Properties of Xˉ\bar{X}Xˉ

ThUnbiasedness and Minimum Variance

Worked Example

Key Takeaways

Summary: Sampling Distribution of the Mean

Premium Content

Need Expert Statistics Help?

Formal Properties of $\bar{X}$