AI Lab Tracker
Labs
Timeline
DeepSeek-GRM: Inference-Time Scaling for Generalist Reward Modeling
paper
2025-04-03
DeepSeek
Generalist reward model with inference-time scaling.
Paper (arXiv)
Paper
arXiv:
2504.02495
reasoning
reward-model