From agent-browse
This skill should be used when the user asks to "browse", "automate browser", "control chrome", "real browser", "anti-bot", "open website", "scrape page", "take screenshot", "click button", or wants to interact with their actual Chrome browser via the agent-browse relay. Provides the tool reference, decision matrix, and workflow patterns for remote browser automation.
How this skill is triggered — by the user, by Claude, or both
Slash command
/agent-browse:browser-relayThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Control the user's actual Chrome browser through the agent-browse relay server. The Chrome extension bridges Claude Code to the user's real browser — with logged-in sessions, natural fingerprints, and anti-bot bypass.
Control the user's actual Chrome browser through the agent-browse relay server. The Chrome extension bridges Claude Code to the user's real browser — with logged-in sessions, natural fingerprints, and anti-bot bypass.
| Scenario | Use agent-browse | Use chrome-devtools-mcp | Use Playwright |
|---|---|---|---|
| Anti-bot protected sites (Cloudflare, etc.) | YES | No | No |
| Pages requiring login sessions | YES | No | No |
| Sites with device fingerprinting | YES | No | No |
| User wants to watch automation live | YES | No | No |
| Quick prototyping / clean browser | Either | YES | YES |
| Automated testing pipelines | No | YES | YES |
Claude Code → MCP (HTTPS) → Relay Server → WebSocket → Chrome Extension → Real Browser
The user's Chrome extension connects to the relay server. Claude Code sends MCP tool calls that route to the user's specific extension via per-user auth tokens.
Before using browser tools, the user must have:
Check connection status first:
tabs_list — if it returns tabs, the extension is connectedAll tools are available as mcp__agent-browse__<tool_name>.
tabs_list — List all open browser tabs with id, url, titletab_attach — Attach debugger to a tab (required before most actions)tab_detach — Detach debugger from a tabnavigate — Navigate a tab to a URLclick — Click at coordinates (x, y)click_selector — Click element by CSS selector (auto-finds coordinates)click_text — Click element by visible text contenttype — Type text into focused elementpress_key — Press a key or key combo (e.g., "Enter", "Control+A")screenshot — Capture visible page as PNGsnapshot — Get accessibility tree (structured DOM) with element IDs (e1, e2, ...)evaluate — Execute JavaScript in page contextnetwork_enable — Start capturing network requests (call BEFORE navigating)network_requests — List captured network requests (with optional URL filter)network_request_detail — Get response body of a specific requestcookies_get — Get cookies for a URLcookies_set — Set a cookiestorage_get — Read localStorage (key or all)storage_set — Write to localStorageextract_table — Extract table data from page by CSS selectorextract_links — Extract all links from pagewait_for — Wait for selector, text, or network idlecdp_raw — Send any Chrome DevTools Protocol command directlytabs_list to find the tab to work withtab_attach with the tab IDnavigate to the target URLsnapshot to see page structure, or screenshot for visualclick_selector, type, press_key to interactevaluate for data, extract_table for tables, screenshot for visual prooftab_detach when donetab_attach before other actions on a tabsnapshot over screenshot for structured interaction — it gives element IDs you can referenceclick_selector or click_text over raw click (x, y) — more reliablenetwork_enable BEFORE navigating if you need to capture XHR/fetch requestswait_for tool is essential after navigation — pages may still be loadingsnapshot (structure) + screenshot (visual) for best understandingGuides creation, editing, and verification of skills for AI coding agents using test-driven development with subagent scenarios. Use when authoring or debugging skills.
npx claudepluginhub smarticeai/smartice-plugins --plugin agent-browse