Generalized Interpolating Discrete Diffusion (GIDD)
Reference: arXiv:2503.04482
GIDD generalizes masked diffusion by deriving a new family of interpolating discrete diffusion processes that offer greater flexibility in designing noising processes. By leveraging a novel diffusion ELBO and combining masking with uniform noise, it enables the model to correct its own mistakes and improves sample quality.
Usage
Train on OpenWebText: