Unified image generation model supporting text-to-image, subject-driven generation, identity-preserving generation, image editing, and image-conditioned generation without additional plugins like ControlNet or IP-Adapter. Comprises only a VAE and a transformer model. Accepts arbitrarily interleaved text and image inputs as conditions. Effectively transfers knowledge across tasks and handles unseen domains.

Model Details

Variants

Name Parameters Notes
OmniGen-v1
generationmultimodalopen-weight