BrowserOS MCP Server

A unified server that provides browser automation tools via Model Context Protocol (MCP) and direct Agent access. Built as a monorepo with shared tools and single binary output.

Architecture

browseros-server/
├── packages/
│   ├── common/     # Shared utilities (context, mutex, browser connection)
│   ├── tools/      # Browser automation tools (CDP + controller-based)
│   ├── mcp/        # MCP HTTP server implementation
│   ├── agent/      # Agent WebSocket server with Claude SDK
│   └── server/     # Unified entry point with shared WebSocketManager

Unified Server Architecture

┌─────────────────────────────────────────┐
│         Unified Server Process          │
│                                         │
│  ┌───────────────────────────────────┐ │
│  │   WebSocketManager (Port 9224)    │ │  ← Shared by all
│  │   Browser Extension Connection    │ │
│  └───────────────┬───────────────────┘ │
│                  │                      │
│        ┌─────────┴─────────┐           │
│        │                   │           │
│  ┌─────▼──────┐    ┌──────▼──────┐    │
│  │ HTTP MCP   │    │   Agent     │    │
│  │ (Port 9223)│    │ (Port 3000) │    │
│  │            │    │             │    │
│  │ External   │    │ Claude SDK  │    │
│  │ MCP        │    │ In-process  │    │
│  │ Clients    │    │ SDK MCP     │    │
│  └────────────┘    └─────────────┘    │
│                                         │
└─────────────────────────────────────────┘

Quick Start

Prerequisites

Bun v1.0.0 or newer
Chrome stable version
macOS, Linux, or Windows

Installation

# Clone the repository
git clone https://github.com/browseros-ai/BrowserOS-server.git
cd BrowserOS-server

# Install dependencies
bun install

# Run tests
bun test:all

# Start the server
bun start

Development

Common Commands

Command	Description
`bun start`	Start the server (port 9223)
`bun test:all`	Run all tests across packages
`bun test:common`	Test common package
`bun test:tools`	Test tools package
`bun test:mcp`	Test MCP package
`bun test:server`	Test server package
`bun run build:binary`	Compile to single binary
`bun run format`	Format code (ESLint + Prettier)
`bun run check-format`	Check code formatting
`bun run clean`	Clean build artifacts

Running with Chrome

The server needs Chrome to run. Set the executable path if Chrome is not in the default location:

PUPPETEER_EXECUTABLE_PATH="/path/to/chrome" bun start

Building

Server Binary

Development builds (single platform):

bun run dev:server           # Current platform
bun run dev:server:linux     # Linux x64
bun run dev:server:macos     # macOS ARM64
bun run dev:server:windows   # Windows x64

# Output: dist/server/browseros-server

Production builds (all platforms):

bun run dist:server

# Output: dist/server/
#   - browseros-server-linux-x64
#   - browseros-server-linux-arm64
#   - browseros-server-darwin-x64
#   - browseros-server-darwin-arm64
#   - browseros-server-windows-x64.exe

Extension

Development build (with source maps):

bun run dev:ext

# Output: dist/ext/
#   - background.js (unminified)
#   - background.js.map
#   - manifest.json

Production build (minified):

bun run dist:ext

# Output: dist/ext/
#   - background.js (minified)
#   - manifest.json

Package Structure

`@browseros/common`

Shared utilities used by all packages:

McpContext - Browser context management
Mutex - Tool execution synchronization
Browser connection management
Logging utilities

`@browseros/tools`

Browser automation tools (26 total):

Pure functions, no server logic
Zod schemas for validation
Response formatters
Direct CDP integration

`@browseros/mcp`

MCP protocol implementation:

HTTP server with SSE transport
Tool registration with MCP SDK
Handles JSON-RPC protocol

`@browseros/agent`

Agent WebSocket server with Claude SDK integration:

Multi-session WebSocket server for concurrent agents
Direct tool execution via shared WebSocketManager (no HTTP overhead)
Claude SDK integration with in-process MCP
Automatic session management and cleanup

`@browseros/server`

Unified server orchestrating all components:

Starts HTTP MCP server and Agent WebSocket server
Single WebSocketManager shared by both MCP and Agent
CLI argument parsing with enable/disable flags
Graceful shutdown and resource cleanup

Testing

Tests use Puppeteer with a real Chrome browser. All tests are in packages/*/tests/:

# Run all tests
bun test:all

# Run with coverage
bun test --coverage

# Run specific package tests
bun test packages/tools

# Run specific test file
bun test packages/tools/tests/tools/pages.test.ts

Test Coverage

Current coverage across 16 test files with 26 tests passing.

Tools Reference

The server provides 26 browser automation tools:

Input automation (7 tools)
- click
- drag
- fill
- fill_form
- handle_dialog
- hover
- upload_file
Navigation automation (7 tools)
Emulation (3 tools)
Performance (3 tools)
Network (2 tools)
- get_network_request
- list_network_requests
Debugging (4 tools)

Server Configuration

Command Line Arguments

# Default startup (MCP + Agent on default ports)
bun start

# Connect to existing Chrome instance (optional)
bun start --cdp-port=9222

# Specify server ports
bun start --http-mcp-port=9223 --agent-port=3000 --extension-port=9224

# Disable MCP HTTP server (Agent-only mode)
bun start --disable-mcp-server

# Disable Agent server (MCP-only mode)
bun start --disable-agent-server

# Full example
bun start --cdp-port=9222 --http-mcp-port=9223 --agent-port=3000 --extension-port=9224

Agent Testing

node packages/agent/scripts/tests/test-client.ts

Environment Variables

Required for Agent:

ANTHROPIC_API_KEY - Anthropic API key for Claude SDK (required for agent functionality)

Agent Configuration:

MAX_SESSIONS - Maximum concurrent agent sessions (default: 5)
SESSION_IDLE_TIMEOUT_MS - Session idle timeout in milliseconds (default: 90000)
EVENT_GAP_TIMEOUT_MS - Max time between agent events in milliseconds (default: 60000)

Optional:

PUPPETEER_EXECUTABLE_PATH - Path to Chrome executable
DEBUG=mcp:* - Enable debug logging
LOG_LEVEL - Logging level (debug, info, warn, error)
NODE_ENV - Environment (development, production)

Architecture Benefits

Unified Process: MCP and Agent run in single server process
Shared WebSocket: Single WebSocketManager for extension connection
Zero Overhead: Agent uses in-process SDK MCP (direct function calls)
Single Binary: Everything compiles to one executable
Shared Tools: Both MCP and Agent use same tool implementations
No Duplication: One source of truth for browser automation
Type Safety: Full TypeScript with Zod validation
Modular: Each package has single responsibility

Contributing

See CONTRIBUTING.md for development guidelines.

License

AGPL-3.0 or later

Troubleshooting

See docs/troubleshooting.md for common issues.

Name		Name	Last commit message	Last commit date
Latest commit History 113 Commits
.github		.github
docs		docs
packages		packages
scripts		scripts
tests		tests
third_party/bin		third_party/bin
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
.nvmrc		.nvmrc
.prettierignore		.prettierignore
.prettierrc.cjs		.prettierrc.cjs
CLAUDE.md		CLAUDE.md
README.md		README.md
bun.lock		bun.lock
bunfig.toml		bunfig.toml
eslint.config.mjs		eslint.config.mjs
package-lock.json		package-lock.json
package.json		package.json
server.json		server.json
tsconfig.json		tsconfig.json

browseros-ai/BrowserOS-server

Folders and files

Latest commit

History

Repository files navigation

BrowserOS MCP Server

Architecture

Unified Server Architecture

Quick Start

Prerequisites

Installation

Development

Common Commands

Running with Chrome

Building

Server Binary

Extension

Package Structure

@browseros/common

@browseros/tools

@browseros/mcp

@browseros/agent

@browseros/server

Testing

Test Coverage

Tools Reference

Server Configuration

Command Line Arguments

Agent Testing

Environment Variables

Architecture Benefits

Contributing

License

Troubleshooting

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4