Skywork Open Reasoner 1

Open reasoning models (7B and 32B) trained via RL on DeepSeek-R1-Distill base with long chain-of-thought. Fully open: weights, training code, and datasets.

OR1-32B: AIME24 82.2, AIME25 73.3, LiveCodeBench 63.0 — outperforms DeepSeek-R1 and Qwen3-32B on math benchmarks. OR1-7B: AIME24 70.2, AIME25 54.6.

No results found