How to integrate Scrape do MCP with Codex

Codex is one of the most popular coding harnesses out there. And MCP makes the experience even better. With Scrape do MCP integration, you can draft, triage, summarise emails, and much more, all without leaving the terminal or the app, whichever you prefer.

Scrape do logoScrape do
Api Key

Scrape.do is a web scraping API offering rotating proxies, headless browser support, and session management to bypass anti-bot protections. Get reliable, scalable extraction of structured web data in JSON or HTML formats.

16 Tools

Introduction

Codex is one of the most popular coding harnesses out there. And MCP makes the experience even better. With Scrape do MCP integration, you can draft, triage, summarise emails, and much more, all without leaving the terminal or the app, whichever you prefer.

Also integrate Scrape do with

Why use Composio?

Apart from a managed and hosted MCP server, you will get:

  • CodeAct: A dedicated workbench that allows GPT to write its code to handle complex tool chaining. Reduces to-and-fro with LLMs for frequent tool calling.
  • Large tool responses: Handle them to minimise context rot.
  • Dynamic just-in-time access to 20,000 tools across 1000+ other Apps for cross-app workflows. It loads the tools you need, so GPTs aren't overwhelmed by tools you don't need.

How to install Scrape do MCP in Codex

Run the setup command

Run this command in your terminal to add the Composio MCP server to Codex.

Terminal

It will initiate the authentication in a browser window, authorize Codex to access your Composio account.

Composio authentication page

(Optional) Authenticate with OAuth

To authenticate manually, run the login command to open a browser window and authorize Codex to access your Composio account.

bash
codex mcp login composio

Verify the connection

Run codex mcp list to confirm Composio appears as a registered MCP server.

bash
codex mcp list

Codex App

Codex App follows the same approach as VS Code.

  1. Click ⚙️ on the bottom left → MCP Servers → + Add servers → Streamable HTTP:
  2. Fill the header and Key fields with { "x-consumer-api-key" = "ck_*******" }.
  3. The Key is the Composio API key, that you can find on dashboard.composio.dev
  4. Click on Authenticate and authorize Codex to your Composio account and you're all set.
Codex App MCP setup
  1. Restart and verify if it's there in .codex/config.toml
bash
[mcp_servers.composio]
url = "https://connect.composio.dev/mcp"
http_headers = { "x-consumer-api-key" = "ck_*******" }

What is the Scrape do MCP server, and what's possible with it?

The Scrape do MCP server is an implementation of the Model Context Protocol that connects your AI agent and assistants like Claude, Cursor, etc directly to your Scrape do account. It provides structured and secure access to robust web scraping tools, so your agent can perform actions like scraping dynamic pages, managing sessions, setting custom headers or proxies, and extracting structured data from any website on your behalf.

  • Dynamic page scraping with headless browsers: Retrieve fully rendered HTML content from JavaScript-heavy or protected websites by leveraging advanced browser emulation and proxy rotation.
  • Custom scraping session management: Set device type, cookies, wait times, and custom headers to imitate different users, maintain sessions, or access device-specific content for tailored data extraction.
  • Proxy and anti-bot bypass control: Enable super or proxy modes to utilize residential, mobile, or datacenter proxies, helping your agent bypass strict anti-bot systems and geo-restrictions seamlessly.
  • Targeted resource filtering: Block specific URLs like ads or analytics scripts during scraping to increase speed, avoid distractions, and improve privacy.
  • Account usage and statistics retrieval: Access real-time usage stats, subscription status, and remaining request limits so your agent can monitor scraping quotas and avoid interruptions.

Conclusion

You've successfully integrated Scrape do with Codex using Composio's MCP server. Now you can interact with Scrape do directly from your terminal, VS Code, or the Codex App using natural language commands.

Key benefits of this setup:

  • Seamless integration across CLI, VS Code, and standalone app
  • Natural language commands for Scrape do operations
  • Managed authentication through Composio
  • Access to 20,000+ tools across 1000+ apps for cross-app workflows
  • CodeAct workbench for complex tool chaining

Next steps:

  • Try asking Codex to perform various Scrape do operations
  • Explore cross-app workflows by connecting more toolkits
  • Build automation scripts that leverage Codex's AI capabilities
TOOLS

Supported Tools

Every Scrape do action and event your agent gets out of the box.

Cancel Async Job

Tool to cancel an asynchronous scraping job.

Create Async Scraping Job

Tool to create an asynchronous scraping job with specified targets and options.

Get Account Information

Retrieves account information and usage statistics from Scrape.

Get Amazon Product Offers

Get all seller offers for any Amazon product.

Get Amazon product details

Extract structured product data from Amazon product detail pages (PDP).

Get Amazon raw HTML

Tool to get raw HTML from any Amazon page with ZIP code geo-targeting.

Get Async API Account Information

Tool to get account information for the Async API including concurrency limits and usage statistics.

Get Async Job Details

Tool to retrieve details and status of a specific asynchronous scraping job.

Get Async Task Result

Tool to retrieve the result of a specific task within an asynchronous job.

Scrape webpage using scrape.do

A tool to scrape web pages using scrape.

List Asynchronous Scraping Jobs

Tool to list all asynchronous scraping jobs.

Use Scrape.do Proxy Mode

This tool implements the Proxy Mode functionality of scrape.

Scrape URL using POST method

Tool to scrape web pages using POST method via scrape.

Search Amazon products

Tool to search Amazon and scrape product listings with structured results.

Block specific URLs during scraping

This tool allows users to block specific URLs during the scraping process.

Set Regional Geolocation for Scraping

This tool allows users to set a broader geographical targeting by specifying a region code instead of a specific country code.

FAQ

Frequently asked questions

With a standalone Scrape do MCP server, the agents and LLMs can only access a fixed set of Scrape do tools tied to that server. However, with the Composio Tool Router, agents can dynamically load tools from Scrape do and many other apps based on the task at hand, all through a single MCP endpoint.

Yes, you can. Codex fully supports MCP integration. You get structured tool calling, message history handling, and model orchestration while Tool Router takes care of discovering and serving the right Scrape do tools.

Yes, absolutely. You can configure which Scrape do scopes and actions are allowed when connecting your account to Composio. You can also bring your own OAuth credentials or API configuration so you keep full control over what the agent can do.

All sensitive data such as tokens, keys, and configuration is fully encrypted at rest and in transit. Composio is SOC 2 Type 2 compliant and follows strict security practices so your Scrape do data and credentials are handled as safely as possible.

Start with Scrape do.It takes 30 seconds.

Managed auth, hosted MCP servers, and every Scrape do tool your agent needs.Free to start.

Start building