Generative Models
Diffusion Models Deep Dive — The Math Behind Image Generation
Diffusion models generate data by learning to reverse a gradual noising process. By training a neural network to predict and remove noise at each step, they produce state-of-the-art images that surpass GANs in both quality and diversity.
- Key point 1 — Forward process adds noise; reverse process learns to denoise step by step
- Key point 2 — Simple noise prediction loss achieves remarkable generation quality
- Key point 3 — Classifier-free guidance and latent diffusion enable text-to-image generation
"From noise, beauty emerges — one denoising step at a time."
Diffusion Models Deep Dive
Diffusion models generate data by learning to reverse a gradual noising process. They have achieved state-of-the-art image generation quality, surpassing GANs in both quality and diversity.
Forward Process (Diffusion)
DfForward Diffusion Process
The forward process gradually adds Gaussian noise to data over timesteps:
After steps, is approximately isotropic Gaussian noise. The process is fixed (no learned parameters) and defined by noise schedule .
Forward Process Marginal
Here,
- =1 - \beta_t
- =\prod_{s=1}^{t} \alpha_s (cumulative product)
- =Noisy version of x_0 at timestep t
- =Noise level at step t