Gamma Distribution — Sum of Exponential Variables
Foundations of Statistics
Flexible Modeling of Positive Data
The gamma distribution extends the exponential to model waiting times for multiple events and right-skewed positive data. Its flexibility makes it ideal for insurance claims, rainfall amounts, and survival analysis.
- Insurance — Modeling claim sizes and aggregate losses in actuarial science
- Meteorology — Predicting rainfall amounts and drought durations
- Healthcare — Survival times in clinical trials and time-to-event data
When data is positive and skewed, the gamma distribution provides the natural framework.
Core Concepts
The gamma distribution generalizes the exponential distribution. It models the waiting time until the -th event in a Poisson process and serves as a flexible model for right-skewed, positive-valued data.
DfGamma Distribution
A continuous random variable has a gamma distribution with shape parameter and rate parameter if its pdf is:
Written . It represents the sum of independent variables (when is integer).
The Gamma Function
DfThe Gamma Function
The Gamma function is defined for by:
Key properties:
- for positive integers
- (functional equation)
ThProof that Γ(n) = (n-1)! for Integer n
Base case:
Inductive step: Assume . Then:
Integration by parts with , :
PDF of Gamma Distribution
Here,
- =Shape parameter
- =Rate parameter
- =Gamma function (generalization of factorial)
Derivation of Mean and Variance
ThDerivation of E[X] and Var(X)
Mean:
Substitute , :
Second moment:
Variance:
Gamma Mean and Variance
Here,
- =Shape parameter
- =Rate parameter
MGF and Additivity
MGF of Gamma Distribution
Here,
- =Transform variable
- =Shape parameter
- =Rate parameter
ThSum of Independent Gammas with Same Rate
If and are independent, then:
Proof:
This is the MGF of .
Important: This additivity property requires the same rate . For different rates, the sum does not have a gamma distribution.
Special Cases
Special Cases of the Gamma
- :
- : (chi-squared with degrees of freedom)
- : — sum of i.i.d. variables
- As : by the CLT (sum of i.i.d. exponentials)
Chi-Squared Connection
ThChi-Squared as a Special Case of Gamma
The chi-squared distribution with degrees of freedom is:
Proof: If are i.i.d. , then .
Each has pdf:
This is . By additivity: sum of such variables is .
Log-Gamma Distribution
The Log-Gamma
If , then follows a log-gamma distribution. As , approaches a normal distribution. For (exponential), follows the Gumbel distribution.
Worked Example
Example: Claim Processing Time
An insurance company finds that claim processing times follow (in hours).
Mean processing time: hours
Variance: hours
SD: hours
Probability that a claim takes more than 10 hours:
where is the gamma CDF. Using the CDF integral or tables:
(The relationship where holds for integer .)
Specific Applications
- Insurance and finance — Aggregate claim amounts, ruin probability, and loss modeling.
- Hydrology — Rainfall amounts, flood frequency analysis (often as gamma or Pearson Type III).
- Bayesian statistics — Conjugate prior for the Poisson rate parameter.
- Queueing theory — Erlang distributions model total service time for multiple sequential tasks.
Key Takeaways
Summary: Gamma Distribution
- Generalizes the exponential; models waiting time for -th event
- PDF:
- Mean: , Variance:
- MGF: ; sum of same-rate gammas is gamma
- gives Exponential; gives Chi-squared
- CLT: approaches as
- The Gamma function generalizes factorial: