From amplify
Automates browser interactions with agent-browser CLI: navigate URLs, click/fill elements, take screenshots, extract content, multi-step workflows. For web scraping, form automation, site tasks.
How this skill is triggered — by the user, by Claude, or both
Slash command
/amplify:browse-siteThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Browse and interact with websites using the `agent-browser` CLI.
Browse and interact with websites using the agent-browser CLI.
Before first use, verify agent-browser is available:
agent-browser --version
If the command is not found, install it:
npm install -g agent-browser
agent-browser install # Download Chromium
If installation fails, stop and report the error to the user. Do not proceed.
Navigate — open the target URL:
agent-browser open <url>
If the command exits with a non-zero code or outputs an error, report it and stop.
Snapshot — get interactive elements with stable refs:
agent-browser snapshot -i -c --json
The JSON output contains an elements array. Each element has a ref field (e.g., "@e2") and a role/name describing it. Use these refs for all subsequent interactions.
Interact — act on elements using refs from the latest snapshot:
agent-browser click @e2
agent-browser fill @e5 "search term"
agent-browser type @e7 "typed text"
Re-snapshot — after any interaction that changes the page, take a new snapshot before acting again. Refs from a previous snapshot are stale after page changes.
Repeat until the task is complete, then close the browser:
agent-browser close
| Action | Command |
|---|---|
| Navigate | agent-browser open <url> |
| Get elements | agent-browser snapshot -i -c --json |
| Click | agent-browser click @e<n> |
| Type text | agent-browser type @e<n> "text" |
| Fill field | agent-browser fill @e<n> "text" |
| Screenshot | agent-browser screenshot <path> |
| Get element text | agent-browser get text @e<n> |
| Get page title | agent-browser get title |
| Get current URL | agent-browser get url |
| Wait (milliseconds) | agent-browser wait 2000 |
| Wait (element) | agent-browser wait "#submit-button" |
| Close browser | agent-browser close |
Always provide a path for screenshots (e.g., agent-browser screenshot /tmp/page.png). The command may fail silently without one.
Use --session <name> to persist browser state (cookies, local storage) across commands:
agent-browser open https://example.com --session mysite
agent-browser snapshot -i -c --json --session mysite
agent-browser click @e3 --session mysite
agent-browser close --session mysite
Use sessions for login flows so authenticated state is preserved between steps.
| Option | Description |
|---|---|
-i, --interactive | Only interactive elements (buttons, links, inputs) |
-c, --compact | Remove empty structural elements |
-d, --depth <n> | Limit tree depth |
-s, --selector <sel> | Scope to CSS selector |
--json | Machine-readable output with refs |
Combine options: agent-browser snapshot -i -c -d 5 --json
| Situation | Action |
|---|---|
agent-browser not found | Install it (see Installation Check), then retry. |
open fails (bad URL, network error) | Report the error and URL to the user. Stop. |
| Snapshot returns empty elements | The page may require a wait; run agent-browser wait 2000 then re-snapshot. |
| Login blocked by captcha or MFA | Stop and ask the user to complete the manual step, then continue. |
| Ref not found (stale ref) | Re-snapshot the page; refs change after DOM updates. |
| Screenshot path missing | Always supply an explicit output path to avoid silent failures. |
--headed on any command to watch the browser visually during debugging: agent-browser open <url> --headed--session for any workflow that requires login-i -c on snapshots to keep output small and refs easy to readnpx claudepluginhub wunki/amplify --plugin ask-questions-if-underspecifiedAutomates headless browser tasks with Vercel's agent-browser CLI: navigate URLs, snapshot interactive elements with refs (@e1), click/fill/type, scroll, test web pages.
Automates headless browser tasks via Vercel's agent-browser CLI: navigate URLs, snapshot interactive elements with refs, click/fill forms, scrape data using Bash commands.
Automates browser interactions via the agent-browser CLI: navigation, form filling, clicking, screenshotting, and UI state checking. Use AI snapshot workflow to interact with elements by reference.