ByteDance's video generation model. Ranked first on Artificial Analysis leaderboards for both text-to-video and image-to-video. Generates 5-second 1080p video in 41.4 seconds on L20.

Paper

arXiv: 2506.09113

generationvideo

Related