Bench MCP for AI Agents

Securely connect your AI agents and chatbots (Claude, ChatGPT, Cursor, etc) with Bench MCP or direct API to run benchmarks, fetch performance metrics, generate comparative reports, and track historical results through natural language.

Bench logoBench
No Auth

Bench is a benchmarking tool for automated performance measurement and analysis. It helps you quickly evaluate, compare, and track your systems or workflows.

1 Tools

Try Bench now

Type what you want done — sign in and watch it run live in the Tool Router playground.

TOOL ROUTER PLAYGROUND
Bench
Try asking
TOOLS

Supported Tools

Every Bench action and event your agent gets out of the box.

Sleep

Sleep

SETUP GUIDE

Connect Bench MCP Tool with your Agent

1

Install Composio

typescript
npm install @composio/core ai @ai-sdk/openai @ai-sdk/mcp
Install the Composio SDK and Claude Agent SDK
2

Create Tool Router Session

typescript
import { Composio } from '@composio/core';

const composio = new Composio({ apiKey: 'your-api-key' });

console.log("Creating Tool Router session...");
const { mcp } = await composio.create('your-user-id');
console.log(`Tool Router session created: ${mcp.url}`);
Initialize the Composio client and create a Tool Router session
3

Connect to AI Agent

typescript
import { openai } from '@ai-sdk/openai';
import { experimental_createMCPClient as createMCPClient } from '@ai-sdk/mcp';
import { generateText, stepCountIs } from 'ai';

const client = await createMCPClient({
  transport: {
    type: 'http',
    url: mcp.url,
    headers: { 'x-api-key': 'your-composio-api-key' }
  }
});

const tools = await client.tools();

const { text } = await generateText({
  model: openai('gpt-4o'),
  tools,
  messages: [{ role: 'user', content: 'Pause agent execution for 5 seconds' }],
  stopWhen: stepCountIs(5)
});

console.log(`Agent: ${text}`);
Use the MCP server with your AI agent
SETUP GUIDE

Connect Bench API Tool with your Agent

1

Install Composio

typescript
npm install @composio/openai
Install the Composio SDK
2

Initialize Composio and Create Tool Router Session

typescript
import OpenAI from 'openai';
import { Composio } from '@composio/core';
import { OpenAIResponsesProvider } from '@composio/openai';

const composio = new Composio({
  provider: new OpenAIResponsesProvider(),
});
const openai = new OpenAI({});
const session = await composio.create('your-user-id');
Import and initialize Composio client, then create a Tool Router session
3

Execute Bench Tools via Tool Router with Your Agent

typescript
const tools = session.tools;
const response = await openai.responses.create({
  model: 'gpt-4.1',
  tools: tools,
  input: [{
    role: 'user',
    content: 'Run a benchmark sleep test for 5 seconds'
  }],
});
const result = await composio.provider.handleToolCalls(
  'your-user-id',
  response.output
);
console.log(result);
Get tools from Tool Router session and execute Bench actions with your Agent

Why Use Composio?

AI Native Bench Integration

  • Supports both Bench MCP and direct API based integrations
  • Structured, LLM-friendly schemas for reliable tool execution
  • Rich coverage for running, tracking, and analyzing your Bench benchmarks

Managed Auth

  • No credentials required—Bench supports NO_AUTH for fast, frictionless setup
  • Central place to manage and scope Bench access if needed
  • Per user and per environment context for ultimate flexibility

Agent Optimized Design

  • Bench tools are tuned for LLM agents for reliable execution
  • Comprehensive logs so you always know what benchmarks ran, when, and why

Enterprise Grade Security

  • Fine-grained RBAC so you control which agents and users can access Bench
  • Scoped, least privilege access to benchmarking resources
  • Full audit trail of agent actions to support review and compliance
FAQ

Frequently asked questions

No developer credentials are needed for Bench. You can get started right away—no setup required!

Yes! Composio's Tool Router enables agents to use multiple toolkits. Learn more.

Composio is SOC 2 and ISO 27001 compliant with all data encrypted in transit and at rest. Learn more.

Composio maintains and updates all toolkit integrations automatically, so your agents always work with the latest API versions.

Start with Bench.It takes 30 seconds.

Managed auth, hosted MCP servers, and every Bench tool your agent needs.Free to start.

Start building