Web Tools

An MCP server that provides eight tools: a fast web search powered by SearXNG, five Crawl4AI-powered tools (web_fetch, web_screenshot, web_pdf, web_execute_js, web_crawl), and two Wayback Machine tools (web_snapshots, web_archive).

Architecture

graph LR
    Client["MCP Client<br/>(Claude, Cursor, etc.)"] -->|web_search| Server["MCP Server"]
    Client -->|web_fetch| Server
    Client -->|web_screenshot| Server
    Client -->|web_pdf| Server
    Client -->|web_execute_js| Server
    Client -->|web_crawl| Server
    Client -->|web_snapshots| Server
    Client -->|web_archive| Server
    Server --> SearXNG
    SearXNG --> Redis
    Server --> Crawl4AI
    Server --> Wayback["Wayback Machine"]

The web_search tool queries SearXNG for search results. The Crawl4AI tools handle content extraction, screenshots, PDFs, JS execution, and multi-URL crawling. The Wayback Machine tools list and retrieve archived pages.

The full stack deploys as 4 services: Redis, SearXNG, Crawl4AI, and this MCP server.

Tools

The server exposes eight MCP tools:

`web_search`

Lightweight web search via SearXNG. Returns structured results.

Parameter	Type	Description
`query`	string (required)	The search query
`limit`	number (optional)	Max results to return (default: 10, max: 20)

Returns a JSON array of { url, title, description } results.

`web_fetch`

Fetch a single URL and return its content as clean markdown via Crawl4AI.

Parameter	Type	Description
`url`	string (required)	URL to fetch
`f`	enum (optional)	Content-filter strategy: `raw`, `fit`, `bm25`, or `llm` (default: `fit`)
`q`	string (optional)	Query string for BM25/LLM filters

Returns the page content as markdown.

`web_screenshot`

Capture a full-page PNG screenshot of a URL via Crawl4AI.

Parameter	Type	Description
`url`	string (required)	URL to screenshot
`screenshot_wait_for`	number (optional)	Seconds to wait before capture (default: 2)

Returns a base64-encoded PNG image.

`web_pdf`

Generate a PDF document of a URL via Crawl4AI.

Parameter	Type	Description
`url`	string (required)	URL to convert to PDF

Returns a base64-encoded PDF.

`web_execute_js`

Execute JavaScript snippets on a URL via Crawl4AI and return the full crawl result.

Parameter	Type	Description
`url`	string (required)	URL to execute scripts on
`scripts`	string[] (required)	List of JavaScript snippets to execute in order

Returns the full CrawlResult JSON including markdown, links, media, and JS execution results.

`web_crawl`

Crawl one or more URLs and extract their content using Crawl4AI.

Parameter	Type	Description
`urls`	string[] (required)	List of URLs to crawl
`browser_config`	object (optional)	Crawl4AI browser configuration
`crawler_config`	object (optional)	Crawl4AI crawler configuration

Returns the extracted content from each URL.

`web_snapshots`

List Wayback Machine snapshots for a URL.

Parameter	Type	Description
`url`	string (required)	URL to check for snapshots
`from`	string (optional)	Start date in YYYYMMDD format
`to`	string (optional)	End date in YYYYMMDD format
`limit`	number (optional)	Max number of snapshots to return (default: 100)
`match_type`	enum (optional)	URL matching: `exact`, `prefix`, `host`, or `domain` (default: `exact`)
`filter`	string[] (optional)	CDX API filters (e.g. `["statuscode:200", "mimetype:text/html"]`)

Returns a JSON array of snapshots with timestamps, status codes, and archive URLs.

`web_archive`

Retrieve an archived page from the Wayback Machine.

Parameter	Type	Description
`url`	string (required)	URL of the page to retrieve
`timestamp`	string (required)	Timestamp in YYYYMMDDHHMMSS format
`original`	boolean (optional)	Get original content without Wayback Machine banner (default: false)

Returns the archived page content.

Connecting to the Server

All examples below assume your server is running at https://your-server.up.railway.app/mcp with an API key. Replace the URL and key with your own values.

Claude Code (CLI)

claude mcp add web_tools \
  --transport http \
  https://your-server.up.railway.app/mcp \
  --header "Authorization: Bearer your-api-key"

Project-level config (`.mcp.json`)

Add to .mcp.json at the root of any project to make the tool available to all collaborators:

{
  "mcpServers": {
    "web_tools": {
      "type": "http",
      "url": "https://your-server.up.railway.app/mcp",
      "headers": {
        "Authorization": "Bearer your-api-key"
      }
    }
  }
}

Claude Desktop (`claude_desktop_config.json`)

{
  "mcpServers": {
    "web_tools": {
      "type": "http",
      "url": "https://your-server.up.railway.app/mcp",
      "headers": {
        "Authorization": "Bearer your-api-key"
      }
    }
  }
}

Replace Claude Code's Built-in Web Search & Web Fetch (Optional)

By default, Claude Code uses its own WebSearch and WebFetch tools. You can replace them with this server's web_search and web_fetch tools for privacy-respecting, self-hosted results.

1. Add the MCP server globally:

claude mcp add web_tools --scope user \
  --transport http \
  https://your-server.up.railway.app/mcp \
  --header "Authorization: Bearer your-api-key"

2. Disable the built-in tools by editing ~/.claude/settings.json:

{
  "permissions": {
    "deny": ["WebSearch", "WebFetch"]
  }
}

3. Guide Claude via ~/.claude/CLAUDE.md so it uses your tools:

## Search & Fetch

- Use the web_search MCP tool for all web searches
- Use the web_fetch MCP tool to fetch and read web pages
- Do not attempt to use the built-in WebSearch or WebFetch tools

4. Verify by running /mcp inside Claude Code to check the server is connected, then ask Claude to search for something or fetch a URL.

Deployment (Railway)

Click Deploy on Railway: you'll see all 4 services listed (Redis, SearXNG, Crawl4AI, MCP Server)
Click Deploy: Railway provisions everything and wires the services together automatically
An API_KEY is auto-generated during deployment. Find it in your MCP Server service's Variables tab and use it as your Bearer token

Quick Start (Local)

1. Clone and install

git clone https://github.com/arnaudjnn/web-tools
cd web-tools
pnpm install

2. Configure environment

cp .env.example .env.local

3. Start the local stack

docker compose up -d redis searxng crawl4ai

This starts Redis, SearXNG, and Crawl4AI. Then run the MCP server:

SEARXNG_URL=http://localhost:8080 CRAWL4AI_URL=http://localhost:11235 pnpm run start

The server is available at http://localhost:3000/mcp.

4. Or run everything in Docker

docker compose up

Authentication

The API_KEY environment variable is required.

On Railway, the key is auto-generated at deploy time (via ${{secret()}}). For local development, set it in your .env.local file.

Clients provide the key as a Bearer token in the Authorization header (shown in the examples above) or as an ?api_key= query parameter. The /health endpoint is unauthenticated.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 131 Commits
assets		assets
src		src
.env.example		.env.example
.gitignore		.gitignore
.prettierignore		.prettierignore
Dockerfile		Dockerfile
LICENSE		LICENSE
RAILWAY.md		RAILWAY.md
README.md		README.md
docker-compose.yml		docker-compose.yml
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
prettier.config.mjs		prettier.config.mjs
railway.toml		railway.toml
searxng-settings.yml		searxng-settings.yml
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Web Tools

Architecture

Tools

`web_search`

`web_fetch`

`web_screenshot`

`web_pdf`

`web_execute_js`

`web_crawl`

`web_snapshots`

`web_archive`

Connecting to the Server

Claude Code (CLI)

Project-level config (`.mcp.json`)

Claude Desktop (`claude_desktop_config.json`)

Replace Claude Code's Built-in Web Search & Web Fetch (Optional)

Deployment (Railway)

Quick Start (Local)

1. Clone and install

2. Configure environment

3. Start the local stack

4. Or run everything in Docker

Authentication

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 15

Uh oh!

Languages

License

arnaudjnn/web-tools

Folders and files

Latest commit

History

Repository files navigation

Web Tools

Architecture

Tools

web_search

web_fetch

web_screenshot

web_pdf

web_execute_js

web_crawl

web_snapshots

web_archive

Connecting to the Server

Claude Code (CLI)

Project-level config (.mcp.json)

Claude Desktop (claude_desktop_config.json)

Replace Claude Code's Built-in Web Search & Web Fetch (Optional)

Deployment (Railway)

Quick Start (Local)

1. Clone and install

2. Configure environment

3. Start the local stack

4. Or run everything in Docker

Authentication

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 15

Uh oh!

Languages

`web_search`

`web_fetch`

`web_screenshot`

`web_pdf`

`web_execute_js`

`web_crawl`

`web_snapshots`

`web_archive`

Project-level config (`.mcp.json`)

Claude Desktop (`claude_desktop_config.json`)

Packages