4B dense model optimized for reasoning and RAG, matching performance of 7B-9B models like Llama 3.1-8B while being significantly smaller and more efficient for on-device deployment.

Model Details

Architecture DENSE
Parameters 4B
on-deviceefficiencyopen-weightreasoning

Related