BitCPM-CANN

First publicly reported 1.58-bit ternary quantization-aware training at 8B scale on Huawei Ascend NPU (CANN stack). Four sizes released: 0.5B, 1B, 3B, 8B. The 8B variant retains 95.7% of FP accuracy, achieves ~6× memory reduction, and imposes only ~5% throughput overhead vs full-precision training.

Significant domestic-chip co-design milestone — demonstrates a fully China-domestic training stack for bitnet-class models, paralleling ByteDance's recently disclosed custom-CPU effort and DeepSeek-V4-Pro's Ascend 950PR pre-training. By the MiniCPM team.

HuggingFace (8B)Technical doc (PDF)

Model Details

Parameters 8B

Variants

Name	Parameters	Notes
BitCPM-CANN-0.5B	0.5B	—
BitCPM-CANN-1B	1B	—
BitCPM-CANN-3B	3B	—
BitCPM-CANN-8B	8B	—

efficiencyopen-weighttrainingfoundational

Your notes

Model Details

Variants

Related