AI Lab Tracker
Labs
Timeline
BrowseComp
dataset
2025-06-01
Z.ai
Benchmark for evaluating an agent's ability to browse the web and synthesize diverse sources.
Website
benchmark
agentic
evaluation
Notes
Date approximate.