ClawBench

The First Open Agent Orchestration Benchmark

Test AI models through the full agent stack -- thinking, retries, tool use, and orchestration middleware. Not raw API calls.

git clone https://github.com/MrSlothuus/clawbench.git
cd clawbench && npm link
clawbench --submit

Requires Node.js 18+ and a running OpenClaw gateway

Top Models