Grok-4
modelMoE architecture with sparse + dilated attention. 256K token context. Trained with 100x more compute than Grok-2 and 10x more RL compute than Grok-3. Native tool-calling, real-time search, and extended reasoning.
Grok-4 Fast variant offers 2M context at $0.20/$0.50 per million tokens (AA index 35). AA Intelligence Index: 42. Proprietary.
Model Details
Architecture MOE
Context window 256,000
Variants
| Name | Parameters | Notes |
|---|---|---|
| Grok-4 | — | 256K context |
| Grok-4 Fast | — | 2M context, AA index 35 |