Introduces "Relay Diffusion," a strategy that breaks image generation into a low-resolution stage followed by a super-resolution process. CogView3 outperformed SDXL in human evaluations while reducing inference time. Capable of high-quality generation at resolutions up to 2048x2048.

Paper

arXiv: 2403.05121

generationvisionresearch

Related