Trinity Mini / Nano

Smaller Trinity variants sharing the same architecture. Mini: 26B/3B active (128 experts, top-8, 131K context). Nano: 6B/1B active (128 experts, 128K context). Both trained on 10T tokens using 512 H200 GPUs. Relicensed to OpenMDW-1.1 (Linux Foundation) on 2026-05-29, from Apache 2.0.

Paper (arXiv)HuggingFace (Mini)OpenRouter (Mini)

Model Details

Architecture MOE

Parameters 26B

Active params 3B

Context window 131,000

Variants

Name	Parameters	Notes
Trinity Mini	26B	—
Trinity Nano	6B	—

Paper

arXiv HTML

moeopen-weightefficiency

Trinity Mini / Nano

Your notes

Model Details

Variants

Paper

Related