AI Lab Tracker
Labs
Timeline
DeepPlanning
dataset
2026-01-26
Alibaba
Benchmark for evaluating complex agentic planning.
Paper (arXiv)
HuggingFace
Documentation
benchmark
agentic