ClawBenchClawBench mascot

Agent Orchestration Benchmark

Test AI models through the full agent stack -- thinking, retries, tool use, and orchestration middleware. Not raw API calls.

2
Submissions
2
Models Tested
7
Categories

Top Scores