Falcon-H1 | Lab Index

Hybrid-head architecture: Transformer attention and Mamba-2 SSM heads run in parallel (concatenated), not interleaved. 0.5B to 34B, 18 languages, 256K context. Up to 4x input throughput and 8x output throughput vs same-size Transformers.

H1-34B matches 70B-class models (Qwen3-32B, Llama3.3-70B). H1-1.5B-Deep rivals 7B-10B models. CC BY 4.0.

No results found