Geometric Distribution
Probability Distributions
How Long Until First Success?
The geometric distribution answers a natural question: how long must we wait for the first success? It is the discrete analogue of the exponential distribution and the only discrete distribution possessing the memoryless property.
- Quality control — how many items until first defect
- Recruiting — how many interviews until first hire
- Sales — how many calls until first sale
- Sports — how many games until first win
The geometric distribution is the mathematics of waiting — and its memoryless property makes it unique.
Core Concepts
The geometric distribution answers a natural question: how long must we wait for the first success in a sequence of independent Bernoulli trials? It is the discrete analogue of the exponential distribution and the only discrete distribution possessing the memoryless property.
DfGeometric Distribution (Number of Trials)
A random variable follows a geometric distribution with parameter , written , if its PMF is:
Here counts the trial number on which the first success occurs.
Alternative Parametrization
Some authors define as the number of failures before the first success, with PMF for The two conventions differ by a shift of 1. We use the "number of trials" convention throughout.
PMF Derivation and Verification
Why This PMF Is Correct
The event means the first trials are failures and the -th trial is a success. By independence:
Verification that it sums to 1:
(Using the geometric series formula with for .)
CDF
CDF of Geometric Distribution
Here,
- =Number of trials
- =Failure probability
Derivation
This is elegant: the probability that we haven't succeeded after trials is , the complementary cumulative probability.
Mean and Variance: Derivation
Geometric Mean and Variance
Here,
- =Probability of success
- =Expected trials until first success
Derivation of the Mean
Method 1: Direct summation
where and we used (derivative of geometric series).
Method 2: Tail sum formula (for non-negative integer-valued ):
Derivation of the Variance
Use plus:
Therefore , and:
The Memoryless Property
ThMemoryless Property
The geometric distribution is the only discrete distribution satisfying the memoryless property: for all non-negative integers ,
Proof
Forward direction:
Uniqueness: Suppose a non-negative integer-valued satisfies memorylessness. Let . Then , so is exponential: for some . Setting recovers the geometric distribution.
Intuition
If you've already failed times, the distribution of remaining trials is the same as starting fresh. Each trial is independent, so past failures carry no information about future success. This is the discrete analogue of the exponential distribution's memoryless property in continuous time.
Hazard Function
Hazard Rate
Here,
- =Hazard rate at trial k
- =Constant hazard rate
The geometric distribution has a constant hazard rate — the probability of success on any trial, given that we haven't succeeded yet, is always . This is another manifestation of memorylessness and distinguishes the geometric from distributions with increasing or decreasing hazard rates.
Relationship to Other Distributions
ThDistributional Connections
(i) Sum of Geometrics = Negative Binomial: If , then (counting trials to -th success).
(ii) Geometric ⊂ Geometric: The geometric is a special case of the negative binomial with .
(iii) Discrete Analogue of Exponential: The geometric is to the discrete case what the exponential distribution is to the continuous case. Both are memoryless and characterized by constant hazard rates.
(iv) Geometric as sum of Bernoulli indicators: where .
Worked Example: Quality Control
Example: Defective Items on Assembly Line
A machine produces items with defect probability . An inspector checks items sequentially until finding the first defective one.
Expected items inspected: .
Variance: .
Standard deviation: .
Probability the first defect appears on or before trial 10:
Probability it takes more than 50 trials:
Memoryless property in action: If the inspector has already checked 20 items with no defect found, the probability the next item is defective is still — exactly the same as for a fresh start.
Python Implementation
import numpy as np
from scipy import stats
np.random.seed(42)
# Simulate geometric random variables
p = 0.3
n = 10000
samples = np.random.geometric(p, size=n)
# Verify mean and variance
print(f"Geometric(p={p})")
print(f" Empirical mean: {np.mean(samples):.4f} (theoretical: {1/p:.4f})")
print(f" Empirical variance: {np.var(samples, ddof=0):.4f} (theoretical: {(1-p)/p**2:.4f})")
# Verify memoryless property
for s in [5, 10, 20]:
given_gt_s = samples[samples > s]
empirical = np.mean(given_gt_s > s + 5) # P(X > s+5 | X > s) ≈ P(X > 5)
theoretical = (1-p)**5
print(f" P(X > {s+5} | X > {s}): empirical={empirical:.4f}, theoretical P(X>5)={theoretical:.4f}")
Python Implementation: Hazard Rate Verification
import numpy as np
np.random.seed(42)
# Verify constant hazard rate for geometric distribution
p = 0.25
n = 50000
samples = np.random.geometric(p, size=n)
print(f"Geometric(p={p}) — Hazard Rate Verification")
print(f"{'k':>4} {'P(X=k | X>=k)':>14} {'p (theoretical)':>16}")
print("-" * 36)
for k in [1, 2, 3, 5, 10, 20]:
given_ge_k = samples[samples >= k]
if len(given_ge_k) > 0:
hazard = np.mean(given_ge_k == k)
print(f"{k:>4} {hazard:>14.4f} {p:>16.4f}")
# Show that geometric inter-arrival times in Bernoulli process are geometric
print(f"\nBernoulli process inter-arrival times:")
bernoulli = np.random.binomial(1, p, size=10000)
successes = np.where(bernoulli == 1)[0]
inter_arrival = np.diff(np.concatenate([[-1], successes]))
print(f" Mean inter-arrival: {np.mean(inter_arrival):.4f} (theoretical: {1/p:.4f})")
Key Takeaways
Summary: Geometric Distribution
- Counts trials until first success: for
- Mean: ; Variance:
- Memoryless property: — the only discrete distribution with this property
- Constant hazard rate: for all
- CDF:
- Foundation for the Negative Binomial (sum of independent geometrics)
- Discrete analogue of the exponential distribution