First publicly reported 1.58-bit ternary quantization-aware training at 8B scale on Huawei Ascend NPU (CANN stack). Four sizes released: 0.5B, 1B, 3B, 8B. The 8B variant retains 95.7% of FP accuracy, achieves ~6× memory reduction, and imposes only ~5% throughput overhead vs full-precision training.

Significant domestic-chip co-design milestone — demonstrates a fully China-domestic training stack for bitnet-class models, paralleling ByteDance's recently disclosed custom-CPU effort and DeepSeek-V4-Pro's Ascend 950PR pre-training. By the MiniCPM team.

Model Details

Parameters 8B

Variants

Name Parameters Notes
BitCPM-CANN-0.5B 0.5B
BitCPM-CANN-1B 1B
BitCPM-CANN-3B 3B
BitCPM-CANN-8B 8B
efficiencyopen-weighttrainingfoundational

Related