One-Way ANOVA
Foundations of Statistics
Comparing Multiple Group Means Simultaneously
One-way ANOVA tests whether three or more group means differ significantly, controlling the family-wise error rate. The F-statistic compares between-group variance to within-group variance, with post-hoc tests identifying which specific groups differ.
-
Pharmaceutical Research — Compare efficacy across multiple drug dosages
-
Agriculture — Test crop yield differences across fertilizer types
-
Education — Evaluate teaching effectiveness across multiple methods
The F-test asks whether any group stands out — post-hoc tests reveal which ones.
Motivation
When comparing group means, conducting all pairwise -tests inflates the Type I error rate. With groups, there are pairwise comparisons. If each is tested at , the family-wise error rate can exceed — far above the nominal 5%.
ANOVA (Analysis of Variance) tests all group means simultaneously with a single test, controlling the overall Type I error at .
The Statistical Model
DfOne-Way ANOVA Model
where:
-
is the overall (grand) mean
-
is the effect of group (with for identifiability)
-
i.i.d. — the within-group errors
-
is the number of groups, is the size of group
Hypotheses
ANOVA Hypotheses
Here,
- =Population mean of group i
- =Number of groups
states that all group means are equal (no treatment effect). is the composite alternative that at least one group mean differs from the others.
The ANOVA Decomposition
ThSum of Squares Decomposition
The total variability in the data decomposes into two components:
where:
Degrees of freedom: , , , where .
The F-Statistic
F-Statistic for One-Way ANOVA
Here,
- =Mean square between = SSB/(k-1)
- =Mean square within = SSW/(N-k) — estimates s²
- =F-statistic: ratio of between-group to within-group variability
ThDistribution of F Under $H_0$
Under (all means equal) and the normality assumption:
where is the -distribution with and degrees of freedom.
The intuition: if is true, MSB and MSW both estimate , so . If is false, MSB overestimates (it includes the treatment effect), so is large.
Effect Size: Eta-Squared
Eta-Squared
Here,
- =Proportion of total variance explained by group membership
| | Interpretation |
|----------|---------------|
| | Negligible effect |
| | Small effect |
| | Medium effect |
| | Large effect |
Assumptions
| Assumption | What It Means | How to Check |
|-----------|---------------|-------------|
| Independence | Observations are independent within and between groups | Study design (randomization) |
| Normality | Within each group, the errors are normally distributed | Shapiro–Wilk test; Q–Q plots |
| Homogeneity of variance | is constant across groups | Levene's test; Bartlett's test |
Robustness of ANOVA
ANOVA is robust to moderate violations of normality when group sizes are equal (or nearly equal), thanks to the CLT. However, it is sensitive to unequal variances when group sizes are unequal. If Levene's test is significant, use Welch's ANOVA instead.
Post-Hoc Tests
When is rejected, post-hoc tests determine which pairs of groups differ:
| Method | Controls | Best For |
|--------|----------|----------|
| Tukey HSD | Family-wise error rate | All pairwise comparisons |
| Bonferroni | Family-wise error rate | Small number of comparisons |
| Scheffé | Family-wise error rate (conservative) | All possible contrasts |
| Dunnett | Family-wise error rate | Comparisons with a control group |
| Games–Howell | Family-wise error rate | Unequal variances and group sizes |
Key Takeaways
Summary: One-Way ANOVA
-
ANOVA tests all means simultaneously — avoids inflated Type I error of multiple -tests
-
— large means between-group variation exceeds random noise
-
Sum of squares decomposition:
-
Assumptions: independence, normality within groups, equal variances
-
Post-hoc tests (Tukey, Bonferroni) identify which specific groups differ
-
tells you the proportion of total variance explained by group membership
-
Welch's ANOVA is an alternative when homoscedasticity is violated