BAAI's first large-scale multimodal foundation model for any-to-any generation. Pioneered multimodal generation at BAAI, combining vision and language in a unified architecture.

Model Details

Variants

Name Parameters Notes
Emu
multimodalgenerationopen-weight

Related