Properties of Estimators — Unbiasedness, Efficiency, Consistency
Foundations of Statistics
What Makes an Estimator Good?
Estimator properties determine whether statistical procedures produce reliable, accurate, and trustworthy results. Unbiasedness, efficiency, and consistency are the holy trinity of estimator quality.
- Method Selection — Choosing between competing estimators based on theoretical properties
- Study Design — Ensuring data collection methods support estimation quality
- Policy Analysis — Justifying estimator choices for consequential decisions
Understanding estimator properties separates rigorous statistics from mere calculation.
What Are Properties of Estimators?
DfProperties of Estimators
Good estimators share key properties: unbiasedness (the expected value equals the true parameter), efficiency (minimum variance among unbiased estimators), and consistency (converges to the true value as grows). These properties provide a framework for comparing competing estimators.
1. Unbiasedness
Unbiasedness
Here,
- =Expected value of the estimator
- =True population parameter
Bias Decomposition
For any estimator , the mean squared error decomposes as:
where . An unbiased estimator has , so .
Worked Example: Bias of
For , the MLE is .
Compute the expectation:
Bias: .
Unbiased version: has .
Unbiasedness is Not Everything
The MLE is biased but has lower MSE than the unbiased for finite :
Bias alone does not determine quality — the variance reduction from dividing by instead of outweighs the small bias.
2. Efficiency and the Cramér-Rao Lower Bound
ThCramér-Rao Lower Bound (CRLB)
Let be any unbiased estimator of . Under regularity conditions:
where is the Fisher information per observation.
Proof of the Cramér-Rao bound:
Let . For an unbiased estimator, so . Define the score function .
By the Cauchy-Schwarz inequality:
The left side equals (by the identity ). The right side has . Therefore .
Fisher Information
Here,
- =Fisher information per observation
- =Probability density (or mass) function
Efficiency
DfEfficiency
An estimator is efficient if it achieves the Cramér-Rao lower bound: . An efficient estimator has the smallest possible variance among all unbiased estimators.
Relative Efficiency
When comparing two unbiased estimators and :
If , then is more efficient. The Rao-Blackwell theorem guarantees that conditioning a crude unbiased estimator on a sufficient statistic always improves (or maintains) efficiency.
3. Consistency
ThConsistency
An estimator is consistent if as , i.e., for every :
Consistency via MSE
A sufficient condition for consistency is as . Since , this holds if both and .
Consistency of the Sample Mean (Weak Law of Large Numbers)
ThWeak Law of Large Numbers (WLLN)
If are i.i.d. with , then .
Proof (Chebyshev's inequality): . By Chebyshev: .
4. Sufficiency and the Rao-Blackwell Theorem
DfSufficient Statistic
A statistic is sufficient for if the conditional distribution of the data given does not depend on . Equivalently, by the Fisher-Neyman factorization theorem: .
ThRao-Blackwell Theorem
Let be any unbiased estimator of and a sufficient statistic. Then is unbiased and:
with equality only if is already a function of .
Proof sketch: By the law of total variance: . The non-negative term vanishes if and only if is a function of alone.
5. Lehmann-Scheffé Theorem
ThLehmann-Scheffé Theorem
If is a complete sufficient statistic and is an unbiased estimator that is a function of , then is the unique MVUE (minimum variance unbiased estimator).
Example: For , is complete sufficient. is unbiased for , so is the MVUE. (Note: is the MLE but is not unbiased.)
Python Simulation: Comparing Estimators
import numpy as np
from scipy import stats
np.random.seed(42)
n_values = [5, 10, 25, 50, 100, 500]
reps = 10000
true_mu = 5.0
true_sigma = 3.0
print("Comparing estimators for σ² (true = 9.0):")
print(f"{'n':>6} {'MLE (÷n)':>12} {'Unbiased (÷(n-1))':>18} {'MLE Bias':>10} {'Unbiased Bias':>15}")
for n in n_values:
mle_vars = []
unbiased_vars = []
for _ in range(reps):
data = np.random.normal(true_mu, true_sigma, n)
mu_hat = np.mean(data)
mle_vars.append(np.mean((data - mu_hat)**2))
unbiased_vars.append(np.var(data, ddof=1))
mle_mean = np.mean(mle_vars)
unbiased_mean = np.mean(unbiased_vars)
mle_bias = mle_mean - true_sigma**2
unbiased_bias = unbiased_mean - true_sigma**2
print(f"{n:6d} {mle_mean:12.4f} {unbiased_mean:18.4f} {mle_bias:10.4f} {unbiased_bias:15.4f}")
# Demonstrate WLLN
print("\nWLLN demonstration (convergence of X̄ to μ):")
for n in [10, 100, 1000, 10000]:
data = np.random.normal(true_mu, true_sigma, n)
xbar = np.mean(data)
print(f" n={n:5d}: X̄ = {xbar:.4f} (error = {abs(xbar - true_mu):.4f})")
Key Takeaways
Summary: Properties of Estimators
- Unbiasedness: . Necessary but not sufficient — also need low variance
- Efficiency: achieve the CRLB. The MLE is asymptotically efficient
- Consistency: . Guaranteed when
- Sufficiency: a statistic that captures all information about in the data
- Rao-Blackwell: conditioning on a sufficient statistic always reduces variance
- Lehmann-Scheffé: the unique function of a complete sufficient statistic that is unbiased is the MVUE
- The bias-variance tradeoff means unbiasedness is not always optimal; MSE is often a better criterion