🎉 75% of content is free forever — Unlock Premium from $10/mo →
CW
Search courses…
💼 Servicesℹ️ About✉️ ContactView Pricing Plansfrom $10

t-Distribution — When σ is Unknown

Foundations of StatisticsSampling Distributions🟢 Free Lesson

Advertisement

t-Distribution — When σ is Unknown

Foundations of Statistics

The Real-World Workhorse for Means

The t-distribution accounts for the extra uncertainty when estimating σ with s, making it the standard for real-world mean comparisons. Its heavier tails provide more conservative inference than the normal distribution.

  • Quality Control — Comparing process means when population variance is unknown
  • Clinical Research — Testing treatment effects with small sample sizes
  • Business Analytics — A/B testing with limited data to make faster decisions

When σ is unknown, the t-distribution is your trusted companion.


Core Concepts

The t-distribution arises when we estimate the population standard deviation σ\sigma with the sample standard deviation ss. It has heavier tails than the normal, reflecting additional uncertainty from estimating σ\sigma.

Dft-Distribution

Let ZN(0,1)Z \sim N(0,1) and Vχν2V \sim \chi^2_\nu be independent. Then T=ZV/νT = \frac{Z}{\sqrt{V/\nu}} follows a t-distribution with ν\nu degrees of freedom, written TtνT \sim t_\nu.

PDF of t-Distribution

f(t)=Γ(ν+12)νπΓ(ν2)(1+t2ν)(ν+1)/2f(t) = \frac{\Gamma\left(\frac{\nu+1}{2}\right)}{\sqrt{\nu\pi}\,\Gamma\left(\frac{\nu}{2}\right)} \left(1 + \frac{t^2}{\nu}\right)^{-(\nu+1)/2}

Here,

  • ν\nu=Degrees of freedom
  • Γ\Gamma=Gamma function

Heavy Tails

The t-distribution has heavier tails than the normal, meaning more probability in the extremes. This reflects the additional uncertainty from estimating σ\sigma. As ν\nu \to \infty, the t-distribution approaches N(0,1)N(0,1).


Interactive Visualization

t-Distribution — Interactive Explorer
-6-4.3-2.6-0.90.92.64.36t00.090.170.260.350.44f(t)μ = 0.00t(df = 5)
Mean (μ) = 0.0000Var = 1.6667σ = 1.2910
t-Distribution vs Normal — Heavy Tails
-6-4.3-2.6-0.90.92.64.36t00.090.180.280.370.46f(t)μ = 0.00t(df = 5)
Mean (μ) = 0.0000Var = 1.6667σ = 1.2910

Derivation: Why the t-Distribution Appears

ThOrigin of the t-Statistic

If X1,,Xni.i.d.N(μ,σ2)X_1, \ldots, X_n \overset{\text{i.i.d.}}{\sim} N(\mu, \sigma^2), then:

T=Xˉμs/ntn1T = \frac{\bar{X} - \mu}{s/\sqrt{n}} \sim t_{n-1}

where s2=1n1(XiXˉ)2s^2 = \frac{1}{n-1}\sum(X_i - \bar{X})^2.

Proof Sketch

Step 1. Define Z=Xˉμσ/nN(0,1)Z = \frac{\bar{X} - \mu}{\sigma/\sqrt{n}} \sim N(0,1) by properties of the normal.

Step 2. By Fisher's lemma, Xˉ\bar{X} and s2s^2 are independent for normal samples. Moreover, (n1)s2σ2χn12\frac{(n-1)s^2}{\sigma^2} \sim \chi^2_{n-1}.

Step 3. Therefore T=Zχn12/(n1)T = \frac{Z}{\sqrt{\chi^2_{n-1}/(n-1)}}, which is the definition of tn1t_{n-1}.

The independence of Xˉ\bar{X} and s2s^2 is specific to the normal distribution — it fails for other distributions, which is why the t-test is not robust to non-normality for small nn.


Degrees of Freedom and Tail Behavior

t-Statistic

t=Xˉμs/n,ν=n1t = \frac{\bar{X} - \mu}{s/\sqrt{n}}, \quad \nu = n - 1

Here,

  • Xˉ\bar{X}=Sample mean
  • μ\mu=Hypothesized population mean
  • ss=Sample standard deviation
  • nn=Sample size
  • ν=n1\nu = n-1=Degrees of freedom

Why Degrees of Freedom Matter

With ν\nu degrees of freedom, the estimator s2s^2 uses n1n-1 independent pieces of information (one is lost estimating μ\mu). Fewer degrees of freedom means more uncertainty about σ\sigma, hence heavier tails. The variance of tνt_\nu is νν2\frac{\nu}{\nu-2} for ν>2\nu > 2, which exceeds 1 (the normal variance) and decreases to 1 as ν\nu \to \infty.


Critical Values

Common t-Critical Values

ν\nut0.025t_{0.025} (95%)t0.005t_{0.005} (99%)z0.025z_{0.025} (normal)
52.5714.0321.960
102.2283.1691.960
292.0452.7561.960
1001.9842.6261.960
\infty1.9602.5761.960

As ν\nu increases, t-critical values converge to z-critical values. The difference is substantial for small ν\nu.


Worked Example

A biochemist measures enzyme reaction rates (in μmol/min) for n=16n = 16 samples: xˉ=42.3\bar{x} = 42.3, s=5.8s = 5.8. Test H0:μ=40H_0: \mu = 40 vs Ha:μ40H_a: \mu \neq 40 at α=0.05\alpha = 0.05.

Step 1. Compute the t-statistic:

t=xˉμ0s/n=42.3405.8/16=2.31.45=1.586t = \frac{\bar{x} - \mu_0}{s/\sqrt{n}} = \frac{42.3 - 40}{5.8/\sqrt{16}} = \frac{2.3}{1.45} = 1.586

Step 2. With ν=15\nu = 15 degrees of freedom, the critical values are t0.025,15=2.131t_{0.025, 15} = 2.131.

Step 3. Since t=1.586<2.131|t| = 1.586 < 2.131, we fail to reject H0H_0. The observed difference is not statistically significant at the 5% level.

Step 4. For comparison, if we had used the normal approximation: z=1.586z = 1.586 with critical value 1.9601.960. We would still fail to reject, but the normal approximation underestimates the tail probability. The exact p-value from t15t_{15} is 0.133, while the normal approximation gives 0.113.

Small Sample Consequence

With n=16n = 16, the t-distribution is substantially wider than the normal. Using the normal approximation would underestimate the p-value by about 15% in this case. Always use the t-distribution when σ\sigma is unknown and nn is small.


Convergence to Normal

ThAsymptotic Normality of t

As ν\nu \to \infty, tνdN(0,1)t_\nu \xrightarrow{d} N(0,1). More precisely, by Slutsky's theorem:

Xˉμs/n=(Xˉμ)/(σ/n)s/σdZ1=Z\frac{\bar{X} - \mu}{s/\sqrt{n}} = \frac{(\bar{X} - \mu)/(\sigma/\sqrt{n})}{s/\sigma} \xrightarrow{d} \frac{Z}{1} = Z

since spσs \xrightarrow{p} \sigma by the law of large numbers.


Key Takeaways

Summary: t-Distribution

  • Used when σ\sigma is unknown and estimated by ss
  • t=(Xˉμ)/(s/n)tn1t = (\bar{X} - \mu)/(s/\sqrt{n}) \sim t_{n-1} for normal populations
  • Heavier tails than normal (more uncertainty); approaches normal as ν\nu \to \infty
  • Degrees of freedom: ν=n1\nu = n - 1 (one lost estimating μ\mu)
  • The variance of tνt_\nu is ν/(ν2)\nu/(\nu-2) for ν>2\nu > 2, always >1> 1
  • Foundation for t-tests and t-intervals for the mean
  • Derived from the independence of Xˉ\bar{X} and s2s^2 under normality

Premium Content

t-Distribution — When σ is Unknown

Unlock this lesson and 900+ advanced tutorials with a Premium plan.

🎯End-to-end Projects
💼Interview Prep
📜Certificates
🤝Community Access

Already a member? Log in

Need Expert Statistics Help?

Get personalized tutoring, project support, or professional consulting.

Advertisement