Four-variant family released under Apache 2.0 (a notable license upgrade from prior Gemma releases). Includes a 31B dense flagship, 26B-A4B MoE (4B active), and edge variants E4B and E2B. 256K token context for the larger models. Multimodal: text, image, video, audio.

Architecture features alternating attention, dual RoPE, Per-Layer Embeddings (PLE), and shared KV cache. Gemma 4 31B achieves 85.2% MMLU Pro, 89.2% AIME 2026, 80.0% LiveCodeBench v6, and Codeforces Elo 2,150. Ranked #3 open model on Arena AI text leaderboard. Crossed 2M downloads shortly after release.

June 5, 2026: deployment-ready QAT (Quantization-Aware Training) 4-bit checkpoints shipped across the family on HuggingFace — including GGUF + unquantized + CT variants for E2B, E4B, 12B, 26B-A4B, and 31B (~20 repos total).

Model Details

Architecture DENSE
Parameters 31B
Context window 256,000
AA Intelligence 29

Variants

Name Parameters Notes
Gemma 4 31B 31B Dense flagship, 256K context; AA Intelligence Index v4.1 = 29
Gemma 4 26B-A4B 26B MoE, 256K context; AA Intelligence Index v4.1 = 26
Gemma 4 E4B 8B Effective 4.5B, 128K context; AA Intelligence Index v4.1 = 12
Gemma 4 E2B 5.1B Effective 2.3B, 128K context; AA Intelligence Index v4.1 = 9
open-weightopen-sourcemultimodalmoe

Related