Web Execution Layer Workshop

Session 4: Hands-On Building Web Execution Pipelines Format: Guided Implementation with Live Support

Workshop Overview

Learn to build intelligent web execution pipelines using Bright Data's infrastructure. This hands-on workshop covers the full spectrum from HTTP requests to AI-native browser automation using Bright Data's products:

Web Unlocker - Automatic unblocking for HTTP requests
Scraping Browser - Managed browser automation with built-in unblocking
SERP API - Search engine results extraction
Web Scraper MCP - AI-native web execution tools

The Scenario: Market Intelligence Across Varied Web Environments

Business Intent:

"Monitor competitor pricing across multiple markets, refresh weekly, detect significant changes, feed into BI dashboard"

Target Environments

Target	Structure	Protection	Complexity	Bright Data Solution
Competitor Site B	Static HTML	Minimal	Low	Web Unlocker (HTTP)
Amazon	Semi-structured JSON in HTML	Moderate anti-bot	Medium	Scraping Browser
Competitor Site A	React SPA, dynamic loading	Heavy fingerprinting	High	Scraping Browser (Advanced)

Execution Requirements

Frequency: Weekly batch + on-demand
Volume: 500 products × 3 sources × 3 geos = 4,500 data points
Latency: Batch acceptable (not real-time)
Failure handling: Retry with escalation
Output: Structured JSON to Snowflake/Databricks

Repository Structure

Web-Execution-Layer/
├── src/
│   ├── part1-http-execution/      # Web Unlocker for basic environments
│   ├── part2-browser-execution/   # Scraping Browser for complex sites
│   ├── part3-mcp-execution/       # Bright Data MCP for AI agents
│   ├── part4-resilience/          # Monitoring and observability
│   └── shared/                    # Utilities, types, and helpers
├── examples/                      # Complete working examples
└── docs/                          # Additional documentation

Quick Start

Prerequisites

Node.js 18+
Bright Data account (Sign up free)
API credentials configured

Installation

# Clone the repository
git clone https://github.com/ScrapeAlchemist/web-execution-layer.git
cd web-execution-layer

# Install dependencies
npm install

# Copy environment template
cp .env.example .env

# Add your Bright Data credentials to .env

Getting Bright Data Credentials

Sign up at Bright Data
Go to Web Unlocker → Get Web Unlocker credentials
Go to Scraping Browser → Create Scraping Browser zone
Go to SERP API → Get SERP credentials
Go to Zones → Create proxy zone (if needed)
Copy your credentials to .env

Running the Examples

# Part 1: HTTP Execution with Web Unlocker
npm run part1

# Part 2: Browser Execution with Scraping Browser
npm run part2

# Part 3: MCP Execution with Bright Data MCP
npm run part3

# Part 4: Resilience Demo
npm run part4

# Run complete pipeline
npm run full-demo

Workshop Parts

Part 1: HTTP Execution with Web Unlocker (15 min)

Target: Competitor Site B (minimal protection, static HTML)

Bright Data Products Used:

Web Unlocker API - Automatic unblocking

Learn to:

Use Web Unlocker for automatic CAPTCHA solving
Handle geo-targeting with Bright Data proxies
Implement automatic escalation triggers
Track costs per execution level

📁 Go to Part 1 →

Part 2: Browser Execution with Scraping Browser (20 min)

Target: Amazon & Competitor Site A (JavaScript, anti-bot, dynamic content)

Bright Data Products Used:

Scraping Browser - Managed Puppeteer with unblocking
Browser API - REST API for browser automation

Learn to:

Connect to Scraping Browser via Puppeteer
Handle JavaScript rendering automatically
Manage infinite scroll and dynamic content
Validate geographic consistency

📁 Go to Part 2 →

Part 3: MCP Execution for AI Agents (20 min)

AI-Native Web Execution for Unpredictable Environments

Bright Data Products Used:

Bright Data MCP Server - AI-native web tools
MCP tools: search_engine, scrape_as_markdown, scrape_as_html, session_*

Learn to:

Connect Claude/LLMs to Bright Data MCP
Use search and scraping tools in AI workflows
Build autonomous research agents
Calculate costs across execution levels

📁 Go to Part 3 →

Part 4: Environmental Resilience & Observability (10 min)

Learn to:

Monitor environments over time
Detect structural drift
Validate geographic consistency
Set up automated alerting on environmental changes

📁 Go to Part 4 →

Bright Data Products Overview

Web Unlocker (HTTP Execution)

// Simple HTTP request with automatic unblocking
const response = await fetch(url, {
  proxy: `http://${username}:${password}@brd.superproxy.io:22225`
});

Best for: Static HTML, APIs, simple pages Cost: ~$3/1000 requests

Scraping Browser (Browser Execution)

// Connect via Puppeteer with automatic unblocking
const browser = await puppeteer.connect({
  browserWSEndpoint: `wss://${auth}@brd.superproxy.io:9222`
});

Best for: JavaScript-heavy sites, SPAs, anti-bot protected sites Cost: ~$8/1000 pages

SERP API (Search Execution)

// Get Google search results
const results = await serpApi.search({
  query: 'competitor analysis',
  country: 'us'
});

Best for: Search engine results, market research Cost: ~$3/1000 searches

MCP Server (AI Execution)

{
  "mcpServers": {
    "brightdata": {
      "command": "npx",
      "args": ["@anthropic/mcp-brightdata"]
    }
  }
}

Best for: AI agents, autonomous workflows Cost: Variable based on operations

Cost Hierarchy

Level	Product	Cost/1000	Speed	Use Case
HTTP	Web Unlocker	~$3	Fastest	Static HTML, APIs
Browser Light	Scraping Browser	~$8	Fast	JavaScript rendering
Browser Advanced	Scraping Browser	~$15	Medium	Anti-bot, fingerprinting
MCP Adaptive	MCP Server	Variable	Variable	AI workflows

Environment Variables

# Bright Data Credentials
BRIGHTDATA_CUSTOMER_ID=your_customer_id
BRIGHTDATA_ZONE_WEB_UNLOCKER=web_unlocker_zone
BRIGHTDATA_ZONE_SCRAPING_BROWSER=scraping_browser_zone
BRIGHTDATA_PASSWORD=your_zone_password

# Optional: MCP API Token
BRIGHTDATA_API_TOKEN=your_api_token

# Optional: Database Output
SNOWFLAKE_ACCOUNT=your_account
SNOWFLAKE_USER=your_user
SNOWFLAKE_PASSWORD=your_password

# Optional: Monitoring
SLACK_WEBHOOK_URL=your_webhook

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
src		src
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Web Execution Layer Workshop

Workshop Overview

The Scenario: Market Intelligence Across Varied Web Environments

Target Environments

Execution Requirements

Repository Structure

Quick Start

Prerequisites

Installation

Getting Bright Data Credentials

Running the Examples

Workshop Parts

Part 1: HTTP Execution with Web Unlocker (15 min)

Part 2: Browser Execution with Scraping Browser (20 min)

Part 3: MCP Execution for AI Agents (20 min)

Part 4: Environmental Resilience & Observability (10 min)

Bright Data Products Overview

Web Unlocker (HTTP Execution)

Scraping Browser (Browser Execution)

SERP API (Search Execution)

MCP Server (AI Execution)

Cost Hierarchy

Environment Variables

Resources

Support

License

About

Uh oh!

Releases

Packages

Languages

ScrapeAlchemist/web-execution-layer

Folders and files

Latest commit

History

Repository files navigation

Web Execution Layer Workshop

Workshop Overview

The Scenario: Market Intelligence Across Varied Web Environments

Target Environments

Execution Requirements

Repository Structure

Quick Start

Prerequisites

Installation

Getting Bright Data Credentials

Running the Examples

Workshop Parts

Part 1: HTTP Execution with Web Unlocker (15 min)

Part 2: Browser Execution with Scraping Browser (20 min)

Part 3: MCP Execution for AI Agents (20 min)

Part 4: Environmental Resilience & Observability (10 min)

Bright Data Products Overview

Web Unlocker (HTTP Execution)

Scraping Browser (Browser Execution)

SERP API (Search Execution)

MCP Server (AI Execution)

Cost Hierarchy

Environment Variables

Resources

Support

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages