LLaDA 2 | Lab Index

Scales diffusion language models to 100B by converting pretrained AR models. LLaDA 2.1 introduces token-to-token editing for real self-correction, achieving 892 tokens/s on HumanEval+.

Paper (2.0, arXiv)Paper (2.1, arXiv)GitHub HuggingFace

Outputs 2

LLaDA 2.0

model

Scales diffusion language models to 100B by converting pretrained AR models.

Paper (arXiv)HuggingFace

arXiv HTML

LLaDA 2.1

paper 2026-02-13

Introduces token-to-token editing on top of mask-to-token denoising. Achieves 892 tokens/s on HumanEval+.

Paper (arXiv)

arXiv HTML

generationarchitecturemoescalingresearch

Outputs 2

LLaDA 2.0

LLaDA 2.1

Related