Multivariate Analysis of Variance (MANOVA)

Advanced Statistical Methods

Comparing Groups Across Multiple Outcomes Simultaneously

MANOVA extends ANOVA to multiple dependent variables, testing whether group means differ across a vector of outcomes while accounting for correlations among them. Wilks' Lambda and Pillai's Trace are key test statistics.

Psychology — Compare treatment groups on multiple behavioral outcomes simultaneously
Education — Assess whether teaching methods differ across several performance measures
Clinical research — Evaluate treatment effects on correlated biomarkers and symptoms

MANOVA tests the right question: do groups differ when all outcomes are considered together?

Multivariate Analysis of Variance (MANOVA) extends the univariate ANOVA framework to situations where multiple dependent variables are measured simultaneously. MANOVA tests whether group means differ across multiple correlated outcome variables, controlling the overall Type I error rate while exploiting the correlation structure among dependent variables. The method provides greater statistical power than conducting separate ANOVAs when dependent variables are correlated and the hypothesis involves simultaneous group differences.

Mathematical Foundation

DfMANOVA Model

The MANOVA model for $g$ groups and $p$ dependent variables is:

\mathbf{Y}_{ij} = \boldsymbol{\mu} + \boldsymbol{\tau}_j + \boldsymbol{\varepsilon}_{ij}

where:

$\mathbf{Y}_{ij}$ is the $p \times 1$ vector of observations for subject $i$ in group $j$
$\boldsymbol{\mu}$ is the $p \times 1$ vector of overall means
$\boldsymbol{\tau}_j$ is the $p \times 1$ vector of treatment effects for group $j$
$\boldsymbol{\varepsilon}_{ij} \sim N_p(\mathbf{0}, \boldsymbol{\Sigma})$ are independent error vectors

Assumptions:

Multivariate normality: $\mathbf{Y}_{ij} \sim N_p(\boldsymbol{\mu} + \boldsymbol{\tau}_j, \boldsymbol{\Sigma})$
Homogeneity of covariance matrices: $\boldsymbol{\Sigma}_1 = \boldsymbol{\Sigma}_2 = \cdots = \boldsymbol{\Sigma}_g = \boldsymbol{\Sigma}$
Independence of observations
No perfect multicollinearity among dependent variables

Hypothesis in MANOVA

The null and alternative hypotheses in matrix form:

H_0: \boldsymbol{\tau}_1 = \boldsymbol{\tau}_2 = \cdots = \boldsymbol{\tau}_g = \mathbf{0}

H_1: \text{At least one } \boldsymbol{\tau}_j \neq \mathbf{0}

The test statistic is based on two matrices:

H (Hypothesis/Between-groups matrix): $\mathbf{H} = \sum_{j=1}^{g} n_j (\bar{\mathbf{Y}}_j - \bar{\mathbf{Y}})(\bar{\mathbf{Y}}_j - \bar{\mathbf{Y}})^T$
E (Error/Within-groups matrix): $\mathbf{E} = \sum_{j=1}^{g} \sum_{i=1}^{n_j} (\mathbf{Y}_{ij} - \bar{\mathbf{Y}}_j)(\mathbf{Y}_{ij} - \bar{\mathbf{Y}}_j)^T$

Test Statistics

MANOVA test statistics are functions of the eigenvalues of $\mathbf{E}^{-1}\mathbf{H}$ or related matrices. Let $s = \min(p, g-1)$ and $\lambda_1 \geq \lambda_2 \geq \cdots \geq \lambda_s$ be the eigenvalues of $\mathbf{E}^{-1}\mathbf{H}$ .

DfWilks' Lambda ($\Lambda$)

Wilks' Lambda is the most commonly used MANOVA statistic:

\Lambda = \frac{|\mathbf{E}|}{|\mathbf{H} + \mathbf{E}|} = \prod_{i=1}^{s} \frac{1}{1 + \lambda_i}

where $|\cdot|$ denotes the determinant. $\Lambda \in [0, 1]$ with smaller values indicating greater group separation (stronger evidence against $H_0$ ).

Exact and Approximate Distributions of Wilks' Lambda

Exact distributions:

$p = 1$ : $\Lambda = F$ (standard F-test for one-way ANOVA)
$p = 2$ : $\Lambda$ transforms to an F-distribution with $2(g-1)$ and $2(n-g)$ df
$s = 1$ (i.e., $g = 2$ ): $\Lambda$ transforms to an F-distribution

Rao's approximation (general case):

F_{\text{approx}} = \frac{1 - \sqrt{\Lambda^{1/t}}}{\sqrt{\Lambda^{1/t}}} \cdot \frac{df_2}{df_1}

where $t = \sqrt{\frac{p^2(g-1)^2 - 4}{p^2 + (g-1)^2 - 5}}$ , $df_1 = p(g-1)$ , and $df_2 = 4 + (p(g-1) + 2)t - \frac{p^2(g-1)^2}{2t}$ .

DfPillai's Trace ($V$)

Pillai's Trace is:

V = \text{tr}[\mathbf{H}(\mathbf{H} + \mathbf{E})^{-1}] = \sum_{i=1}^{s} \frac{\lambda_i}{1 + \lambda_i}

Pillai's Trace is more robust than Wilks' Lambda when assumptions are violated, particularly with unequal group sizes or non-normality. $V \in [0, p]$ with larger values indicating greater group differences.

DfHotelling-Lawley Trace ($T_0$)

The Hotelling-Lawley Trace is:

T_0 = \text{tr}(\mathbf{E}^{-1}\mathbf{H}) = \sum_{i=1}^{s} \lambda_i

This statistic equals the sum of eigenvalues of $\mathbf{E}^{-1}\mathbf{H}$ . It is most powerful when group differences are large and covariance matrices are equal. $T_0$ has no upper bound.

DfRoy's Largest Root ($\theta$)

Roy's Largest Root uses only the largest eigenvalue:

\theta = \frac{\lambda_1}{1 + \lambda_1}

This is the most powerful statistic when group differences occur primarily along a single discriminant dimension. However, it is the least robust to assumption violations and should be used cautiously.

Computing MANOVA Test Statistics

Problem: A study compares three treatments with two dependent variables ( $Y_1$ : anxiety, $Y_2$ : depression). Given:

\mathbf{H} = \begin{bmatrix} 24.5 & 8.2 \\ 8.2 & 15.3 \end{bmatrix}, \quad \mathbf{E} = \begin{bmatrix} 42.1 & 5.7 \\ 5.7 & 38.9 \end{bmatrix}

Solution:

Step 1: Compute $\mathbf{H} + \mathbf{E}$ :

\mathbf{H} + \mathbf{E} = \begin{bmatrix} 66.6 & 13.9 \\ 13.9 & 54.2 \end{bmatrix}

Step 2: Compute determinants:

|\mathbf{E}| = (42.1)(38.9) - (5.7)^2 = 1637.69 - 32.49 = 1605.20

|\mathbf{H} + \mathbf{E}| = (66.6)(54.2) - (13.9)^2 = 3609.72 - 193.21 = 3416.51

Step 3: Wilks' Lambda:

\Lambda = \frac{|\mathbf{E}|}{|\mathbf{H} + \mathbf{E}|} = \frac{1605.20}{3416.51} = 0.470

Step 4: Compute eigenvalues of $\mathbf{E}^{-1}\mathbf{H}$ :

\mathbf{E}^{-1} = \frac{1}{1605.20}\begin{bmatrix} 38.9 & -5.7 \\ -5.7 & 42.1 \end{bmatrix} = \begin{bmatrix} 0.02424 & -0.00355 \\ -0.00355 & 0.02623 \end{bmatrix}

\mathbf{E}^{-1}\mathbf{H} = \begin{bmatrix} 0.02424 & -0.00355 \\ -0.00355 & 0.02623 \end{bmatrix}\begin{bmatrix} 24.5 & 8.2 \\ 8.2 & 15.3 \end{bmatrix}

Computing and solving the characteristic equation yields eigenvalues: $\lambda_1 = 0.682$ , $\lambda_2 = 0.198$

Step 5: Compute all statistics:

\Lambda = \frac{1}{(1+0.682)(1+0.198)} = \frac{1}{2.015} = 0.496

(rounded)

V = \frac{0.682}{1.682} + \frac{0.198}{1.198} = 0.405 + 0.165 = 0.570

T_0 = 0.682 + 0.198 = 0.880

\theta = \frac{0.682}{1.682} = 0.405

Assumptions and Diagnostics

ThMANOVA Assumption Testing

1. Multivariate Normality: Test using Mardia's multivariate skewness and kurtosis:

\text{Skewness} = \frac{1}{n} \sum_{i=1}^{n} [(\mathbf{y}_i - \bar{\mathbf{y}})^T \mathbf{S}^{-1} (\mathbf{y}_i - \bar{\mathbf{y}})]^2

\text{Kurtosis} = \frac{1}{n} \sum_{i=1}^{n} [(\mathbf{y}_i - \bar{\mathbf{y}})^T \mathbf{S}^{-1} (\mathbf{y}_i - \bar{\mathbf{y}})]^3

Under $H_0$ : $\text{Skewness} \sim \chi^2_p$ and $\text{Kurtosis} \sim N(6p, 24p)$ approximately.

2. Homogeneity of Covariance Matrices: Test using Box's M test:

M = (n - g) \ln|\mathbf{S}_p| - \sum_{j=1}^{g} (n_j - 1) \ln|\mathbf{S}_j|

where $\mathbf{S}_p$ is the pooled covariance matrix. Under $H_0$ :

M\left[1 - \frac{\sum_{j=1}^{g} (n_j - 1)^{-1} - (n-g)^{-1}}{2(p+1)}\right] \sim \chi^2_{p(p+1)(g-1)/2}

Box's M is very sensitive to non-normality; use with caution.

3. Linearity: Assess through scatter plots of dependent variable pairs within groups.

Post-Hoc Tests and Discriminant Analysis

DfDiscriminant Function Analysis

The discriminant functions are the eigenvectors of $\mathbf{E}^{-1}\mathbf{H}$ . The $s$ discriminant functions are:

D_i = \mathbf{a}_i^T \mathbf{Y} = a_{i1}Y_1 + a_{i2}Y_2 + \cdots + a_{ip}Y_p

where $\mathbf{a}_i$ is the eigenvector corresponding to eigenvalue $\lambda_i$ . The functions maximize group separation in the direction of greatest between-group relative to within-group variance.

Structure Coefficients

The structure matrix $\mathbf{S}$ contains correlations between original variables and discriminant functions:

s_{jk} = r(Y_j, D_k) = \frac{\sum_i (Y_{ij} - \bar{Y}_j)(D_{ik} - \bar{D}_k)}{(n-1) s_{Y_j} s_{D_k}}

Structure coefficients greater than $|0.30|$ or $|0.40|$ (convention) indicate variables that meaningfully contribute to group discrimination.

Python Implementation

import numpy as np
import pandas as pd
from scipy import stats
import matplotlib.pyplot as plt
from sklearn.discriminant_analysis import LinearDiscriminantAnalysis

# Generate MANOVA data
np.random.seed(42)

# Parameters
n_per_group = 30
n_groups = 3
n_vars = 2

# Group means
means = np.array([
    [10, 8],    # Group 1
    [12, 10],   # Group 2
    [14, 12]    # Group 3
])

# Covariance matrix
Sigma = np.array([
    [4.0, 2.5],
    [2.5, 3.5]
])

# Generate data
data_list = []
for g in range(n_groups):
    Y = np.random.multivariate_normal(means[g], Sigma, n_per_group)
    group_df = pd.DataFrame(Y, columns=['Anxiety', 'Depression'])
    group_df['Group'] = f'Treatment_{g+1}'
    data_list.append(group_df)

data = pd.concat(data_list, ignore_index=True)

# Compute H and E matrices
grand_mean = data[['Anxiety', 'Depression']].mean().values

H = np.zeros((n_vars, n_vars))
E = np.zeros((n_vars, n_vars))

for g in range(n_groups):
    group_data = data[data['Group'] == f'Treatment_{g+1}'][['Anxiety', 'Depression']].values
    n_g = len(group_data)
    group_mean = group_data.mean(axis=0)
    
    # Between-groups matrix
    H += n_g * np.outer(group_mean - grand_mean, group_mean - grand_mean)
    
    # Within-groups matrix
    for i in range(n_g):
        E += np.outer(group_data[i] - group_mean, group_data[i] - group_mean)

print("Hypothesis (Between-groups) matrix H:")
print(np.round(H, 3))
print("\nError (Within-groups) matrix E:")
print(np.round(E, 3))

# Compute eigenvalues of E^{-1}H
E_inv = np.linalg.inv(E)
E_inv_H = E_inv @ H
eigenvalues = np.linalg.eigvals(E_inv_H)
eigenvalues = np.sort(eigenvalues)[::-1]

print(f"\nEigenvalues of E^{{-1}}H: {np.round(eigenvalues, 4)}")

# MANOVA test statistics
s = min(n_vars, n_groups - 1)

# Wilks' Lambda
Lambda = np.prod(1 / (1 + eigenvalues[:s]))
print(f"\nWilks' Lambda: {Lambda:.4f}")

# Pillai's Trace
Pillai = np.sum(eigenvalues[:s] / (1 + eigenvalues[:s]))
print(f"Pillai's Trace: {Pillai:.4f}")

# Hotelling-Lawley Trace
HL_trace = np.sum(eigenvalues[:s])
print(f"Hotelling-Lawley Trace: {HL_trace:.4f}")

# Roy's Largest Root
Roy = eigenvalues[0] / (1 + eigenvalues[0])
print(f"Roy's Largest Root: {Roy:.4f}")

# Rao's approximation for Wilks' Lambda
p = n_vars
g = n_groups
n = len(data)

t = np.sqrt((p**2 * (g-1)**2 - 4) / (p**2 + (g-1)**2 - 5))
df1 = p * (g - 1)
df2 = 4 + (p * (g-1) + 2) * t - (p**2 * (g-1)**2) / (2*t)

F_approx = ((1 - Lambda**(1/t)) / Lambda**(1/t)) * (df2 / df1)
p_value = 1 - stats.f.cdf(F_approx, df1, df2)

print(f"\nRao's F approximation: F({df1:.1f}, {df2:.1f}) = {F_approx:.3f}, p = {p_value:.4f}")

# Box's M test for homogeneity of covariance matrices
def box_m_test(data, group_col, dep_vars):
    """Perform Box's M test for equality of covariance matrices."""
    groups = data[group_col].unique()
    g = len(groups)
    p = len(dep_vars)
    n = len(data)
    
    # Compute pooled and group covariance matrices
    S_pooled = np.zeros((p, p))
    df_total = 0
    
    log_dets = []
    df_groups = []
    
    for group in groups:
        group_data = data[data[group_col] == group][dep_vars].values
        n_g = len(group_data)
        S_g = np.cov(group_data, rowvar=False) * (n_g - 1)
        
        S_pooled += S_g
        df_total += n_g - 1
        
        log_dets.append(np.log(np.linalg.det(S_g / (n_g - 1))))
        df_groups.append(n_g - 1)
    
    S_pooled /= df_total
    log_det_pooled = np.log(np.linalg.det(S_pooled))
    
    # Box's M statistic
    M = (df_total) * log_det_pooled - sum([(df_groups[i]) * log_dets[i] for i in range(g)])
    
    # Correction factor
    sum_inv_df = sum([1/df_groups[i] for i in range(g)])
    C = (2 * p**2 + 3 * p - 1) / (6 * (p + 1) * (g - 1)) * (sum_inv_df - 1/df_total)
    
    # Chi-square approximation
    chi2 = M * (1 - C)
    df_chi2 = p * (p + 1) * (g - 1) / 2
    p_value = 1 - stats.chi2.cdf(chi2, df_chi2)
    
    return M, chi2, df_chi2, p_value

M_stat, chi2_stat, df_m, p_m = box_m_test(data, 'Group', ['Anxiety', 'Depression'])
print(f"\nBox's M test: M = {M_stat:.3f}, Chi2({df_m}) = {chi2_stat:.3f}, p = {p_m:.4f}")

# Discriminant Analysis
lda = LinearDiscriminantAnalysis()
X = data[['Anxiety', 'Depression']].values
y = data['Group'].values
lda.fit(X, y)

# Structure coefficients (correlations with discriminant functions)
disc_scores = lda.transform(X)
structure_corr = np.corrcoef(X.T, disc_scores.T)[:p, p:]
print(f"\nStructure coefficients:\n{np.round(structure_corr, 3)}")

# Visualize discriminant space
fig, axes = plt.subplots(1, 2, figsize=(14, 5))

# Scatter plot in original space
colors = ['blue', 'red', 'green']
for g, group in enumerate(data['Group'].unique()):
    mask = data['Group'] == group
    axes[0].scatter(data[mask]['Anxiety'], data[mask]['Depression'], 
                   c=colors[g], alpha=0.6, label=group, s=50)

axes[0].set_xlabel('Anxiety')
axes[0].set_ylabel('Depression')
axes[0].set_title('Original Variable Space')
axes[0].legend()
axes[0].grid(True, alpha=0.3)

# Discriminant scores
for g, group in enumerate(data['Group'].unique()):
    mask = data['Group'] == group
    axes[1].hist(disc_scores[mask, 0], bins=10, alpha=0.5, 
                color=colors[g], label=group, density=True)

axes[1].set_xlabel('Discriminant Score (Function 1)')
axes[1].set_ylabel('Density')
axes[1].set_title('Distribution of Discriminant Scores')
axes[1].legend()
axes[1].grid(True, alpha=0.3)

plt.tight_layout()
plt.savefig('manova_analysis.png', dpi=150)
plt.show()

# Effect sizes
# Partial eta-squared from Wilks' Lambda
eta2_partial = 1 - Lambda**(1/s)
print(f"\nPartial eta-squared (from Wilks' Lambda): {eta2_partial:.4f}")

# Compare with separate ANOVAs (for illustration)
print("\nSeparate one-way ANOVAs (for comparison):")
for var in ['Anxiety', 'Depression']:
    groups_data = [data[data['Group'] == g][var].values 
                   for g in data['Group'].unique()]
    f_stat, p_val = stats.f_oneway(*groups_data)
    print(f"  {var}: F = {f_stat:.3f}, p = {p_val:.4f}")

Summary: Multivariate Analysis of Variance (MANOVA)

MANOVA Advantage: Tests multiple dependent variables simultaneously, controlling overall $\alpha$ and leveraging inter-variable correlations for increased power
Test Statistics: Wilks' Lambda ( $\Lambda$ ), Pillai's Trace ( $V$ ), Hotelling-Lawley Trace ( $T_0$ ), Roy's Largest Root ( $\theta$ ) — all functions of eigenvalues of $\mathbf{E}^{-1}\mathbf{H}$
Wilks' Lambda: $\Lambda = \prod(1+\lambda_i)^{-1}$ ; smaller values indicate greater group differences; exact F-test for $p \leq 2$ or $s = 1$
Pillai's Trace: Most robust to assumption violations; $V = \sum \lambda_i/(1+\lambda_i)$ ; recommended when assumptions are questionable
Key Matrices: $\mathbf{H}$ (hypothesis) captures between-group variation; $\mathbf{E}$ (error) captures within-group variation; test statistics are ratios of these matrices

Multivariate Analysis of Variance (MANOVA)

Multivariate Analysis of Variance (MANOVA)

Comparing Groups Across Multiple Outcomes Simultaneously

Mathematical Foundation

DfMANOVA Model

Hypothesis in MANOVA

Test Statistics

DfWilks' Lambda ($\Lambda$)

DfPillai's Trace ($V$)

DfHotelling-Lawley Trace ($T_0$)

DfRoy's Largest Root ($\theta$)

Computing MANOVA Test Statistics

Assumptions and Diagnostics

ThMANOVA Assumption Testing

Post-Hoc Tests and Discriminant Analysis

DfDiscriminant Function Analysis

Structure Coefficients

Python Implementation

Summary: Multivariate Analysis of Variance (MANOVA)

Premium Content

Need Expert Statistics Help?