AI Lab Tracker
Labs
Timeline
BrowseComp & WideSearch
dataset
2025-06-01
Moonshot AI
Benchmarks developed to evaluate an agent's ability to browse the web and synthesize information from multiple sources.
Project Page
benchmark
agentic
Notes
Date approximate.