Kimi k1.5 | Lab Index

Reasoning-focused model using reinforcement learning, claimed to match OpenAI's o1-preview in math and coding. Technical report details scaling RL with LLMs.

No results found