F-Distribution — Ratio of Variances
Foundations of Statistics
The Engine Behind ANOVA and F-Tests
The F-distribution emerges as the ratio of two chi-square variables, making it the backbone of analysis of variance and equality-of-variance tests. Its skewed shape reflects the ratio's non-negative nature.
- Agriculture — Comparing crop yields across multiple fertilizer treatments
- Psychology — Analyzing variance in experimental designs with multiple groups
- Engineering — Testing whether manufacturing processes produce consistent results
The F-distribution turns multiple group comparisons into a single elegant test.
Core Concepts
The F-distribution arises as the ratio of two independent chi-square random variables, each divided by its degrees of freedom. It is the basis for ANOVA and F-tests.
DfF-Distribution
If and are independent, then follows an F-distribution with (numerator) and (denominator) degrees of freedom, written .
F-Statistic
Here,
- =Sample variances from two populations
- =Degrees of freedom for numerator and denominator
- =F-statistic (ratio of variances)
Key Properties
- Always positive and right-skewed
- is related to the reciprocal:
- As degrees of freedom increase, F approaches 1
- The mode (for ) is at
Interactive Visualization
Mean, Variance, and Moments
F-Distribution Mean
Here,
- =Numerator degrees of freedom
- =Denominator degrees of freedom
Variance and Higher Moments
The variance exists only when . The F-distribution is always right-skewed, with skewness decreasing as both degrees of freedom increase.
When , , which approaches 1 as . This makes sense: if both variances estimate the same , their ratio should be near 1.
Derivation: Why the F-Distribution Appears
ThDistribution of the Variance Ratio
If and independently, then:
Proof Sketch
Step 1. By the chi-square result: and .
Step 2. Since the samples are independent, and are independent.
Step 3. Therefore by definition.
The key requirements are: (1) normal populations, (2) equal variances, and (3) independent samples.
ANOVA Connection
ThF-Test in One-Way ANOVA
In one-way ANOVA with groups and total observations, define:
Under :
Proof Sketch
Under , all observations come from . The numerator is a chi-square variable divided by , and is a chi-square variable divided by , and they are independent by Cochran's theorem. Their ratio follows .
Worked Example
Two methods for measuring blood glucose are compared. Method A () gives ; Method B () gives . Test at .
Step 1. Compute the F-statistic:
Step 2. Under , . The upper critical value is .
Step 3. Since , we fail to reject . There is insufficient evidence that the variances differ.
Step 4. Note the asymmetry: for a two-sided test, we could also consider . We check whether or . Since , we fail to reject.
F-Test Sensitivity
The F-test for equal variances is highly sensitive to non-normality. The Levene test or Bartlett test should be preferred in practice. If the populations are not normal, the F-test can have severely inflated Type I error rates.
Key Takeaways
Summary: F-Distribution
- Ratio of two independent chi-squares, each divided by its df:
- Always positive and right-skewed; for
- Used in ANOVA (comparing group means) and F-tests (comparing variances)
- is the reciprocal distribution of
- Two-sample F-test for equality of variances:
- Requires normality and independence — sensitive to violations