agent-browser-stagehandAutomate web browser interactions using natural language via CLI commands. Use when the user asks to browse websites, navigate web pages, extract data from websites, take screenshots, fill forms, click buttons, or interact with web applications.
Install via ClawdBot CLI:
clawdbot install peytoncasper/agent-browser-stagehandAutomate browser interactions using Stagehand CLI with Claude.
The skill automatically selects between local and remote browser environments:
Check setup.json in this directory. If setupComplete: false:
npm install # Install dependencies
npm link # Create global 'browser' command
All commands work identically in both modes:
browser navigate <url> # Go to URL
browser act "<action>" # Natural language action
browser extract "<instruction>" ['{}'] # Extract data (optional schema)
browser observe "<query>" # Discover elements
browser screenshot # Take screenshot
browser close # Close browser
browser navigate https://example.com
browser act "click the Sign In button"
browser extract "get the page title"
browser close
| Feature | Local | Browserbase |
|---------|-------|-------------|
| Speed | Faster | Slightly slower |
| Setup | Chrome required | API key required |
| Stealth mode | No | Yes |
| Proxy/CAPTCHA | No | Yes |
| Best for | Development | Production/scraping |
browser observe to discover available elementsFor detailed examples, see EXAMPLES.md.
For API reference, see REFERENCE.md.
Generated Mar 1, 2026
Automate daily price checks across competitor websites to track market trends and adjust pricing strategies. Extract product details and prices using natural language commands, enabling real-time competitive analysis without manual browsing.
Navigate to financial news sites and extract key metrics like stock prices or economic indicators for automated reporting. Use screenshots to verify data accuracy and schedule regular updates to support investment decision-making.
Automate property searches on listing websites to collect details such as prices, locations, and amenities. Fill search forms and click through pagination to aggregate data for market analysis or lead generation.
Browse academic journals or databases to extract publication metadata, abstracts, and citations for literature reviews. Use natural language actions to navigate complex sites and capture structured data for research projects.
Automate visits to social media platforms to observe trends, extract post analytics, or monitor brand mentions. Take screenshots for visual verification and use stealth mode to avoid detection during frequent checks.
Offer a subscription-based service where businesses pay monthly for automated web scraping and interaction tasks. Use Browserbase's stealth mode for reliable production scraping, targeting industries like e-commerce and finance with scalable pricing tiers.
Sell curated datasets extracted from websites using this skill, such as competitor prices or market trends. Automate data collection with natural language commands and provide updates via API or reports, focusing on high-demand verticals like real estate.
Provide consulting services to help companies integrate browser automation into their workflows, such as for form filling or monitoring. Charge for setup, training, and ongoing support, leveraging the skill's beginner-friendly commands for quick deployment.
💬 Integration Tip
Start with local mode for testing, then switch to Browserbase for production to handle proxies and CAPTCHAs; always use screenshots to debug actions.
A fast Rust-based headless browser automation CLI with Node.js fallback that enables AI agents to navigate, click, type, and snapshot pages via structured commands.
Automate web browser interactions using natural language via CLI commands. Use when the user asks to browse websites, navigate web pages, extract data from websites, take screenshots, fill forms, click buttons, or interact with web applications.
Advanced desktop automation with mouse, keyboard, and screen control
Manage n8n workflows and automations via API. Use when working with n8n workflows, executions, or automation tasks - listing workflows, activating/deactivating, checking execution status, manually triggering workflows, or debugging automation issues.
Design and implement automation workflows to save time and scale operations as a solopreneur. Use when identifying repetitive tasks to automate, building workflows across tools, setting up triggers and actions, or optimizing existing automations. Covers automation opportunity identification, workflow design, tool selection (Zapier, Make, n8n), testing, and maintenance. Trigger on "automate", "automation", "workflow automation", "save time", "reduce manual work", "automate my business", "no-code automation".
Browser automation via Playwright MCP server. Navigate websites, click elements, fill forms, extract data, take screenshots, and perform full browser automation workflows.