ClawBench
The First Open Agent Orchestration Benchmark
Test AI models through the full agent stack -- thinking, retries, tool use, and orchestration middleware. Not raw API calls.
git clone https://github.com/MrSlothuus/clawbench.git cd clawbench && npm link clawbench --submit
Requires Node.js 18+ and a running OpenClaw gateway