A benchmarking harness for coding agents.
A small, powerful, open-source CLI coding agent that works…