Mercor

Terminal-Bench

Public benchmark / task-submission framework published by Mercor (terminal-bench-3 on GitHub) for evaluating AI agents on terminal-based engineering tasks.

Terminal-Bench is one of 7 APIs that Mercor publishes on the APIs.io network.

Tagged areas include Benchmarks, Agents, and Open Source. The published artifact set on APIs.io includes a GitHub repository.

Documentation GitHub

SDKs

📦

GitHubRepository

https://github.com/Mercor-io/terminal-bench-3

Terminal-Bench

SDKs

API entry from apis.yml