Poolside's flagship agentic-coding model: a 225.8B-total / 23.4B-active Mixture-of-Experts transformer (256 experts, top-8 plus one shared expert; first layer dense; pre-norm RMSNorm) with a 256K-token context. Trained from scratch in-house on >30T tokens using 6,144 NVIDIA H200 GPUs with the Muon (Moonlight-variant) optimizer, then post-trained via mid-training, SFT, and an asynchronous online-RL stack using a CISPO objective on verifiable software-engineering rewards (the team compared CISPO against GRPO and GSPO and chose it).

Originally launched April 2026 as an API-only model, Laguna M.1 was released as open weights under Apache 2.0 in June 2026 — both base and post-trained checkpoints, plus FP8 and NVFP4 quantizations, on Hugging Face — as Poolside made open releases its default. Serves on vLLM and SGLang; full BF16 inference requires multi-GPU memory.

Reported benchmarks (technical report, May 2026): SWE-bench Verified 74.6, SWE-bench Multilingual 73.3, SWE-bench Pro 49.2, Terminal-Bench 2.0 56.9 — frontier-tier on agentic coding, ahead of the Devstral 2, GLM-4.7, DeepSeek-V4-Flash, Qwen3.5, and Claude Sonnet 4.6 comparators in the report's SWE-bench Verified figure. Not currently scored on Artificial Analysis — numbers above are self-reported.

Model Details

Architecture MOE
Parameters 225.8B
Active params 23.4B
Experts 256 (top-8)
Context window 262,144
Training tokens 30T
Training hardware 6,144 NVIDIA H200
Optimizer Muon (Moonlight variant)
License Apache 2.0

Benchmark Scores

Benchmark Score Mode
SWE-bench Verified 74.6
SWE-bench Multilingual 73.3
SWE-bench Pro 49.2
Terminal-Bench 2.0 56.9
open-weightcodingagenticmoe

Related