DeepSeek-Coder

Coding-specialized models (1.3B to 33B parameters) trained on 2 trillion tokens, with an accompanying technical report.

Outputs 2

model

Architecture DENSE

paper 2024-01-25

Technical report on the rise of code intelligence with DeepSeek-Coder models.

Citations 107

codingopen-weight