abby-browser基于 OpenClaw 的浏览器工具,支持自然语言控制网页打开、截图、点击、输入、表单填写、数据提取和页面操作。
Install via ClawdBot CLI:
clawdbot install earnabitmore365/abby-browser_基于 OpenClaw 内置浏览器工具封装_
让爸爸可以用自然语言控制浏览器,不需要记住复杂命令。
| 功能 | 命令 | 示例 |
|------|------|------|
| 打开网页 | open | 打开 Google |
| 截图 | screenshot | 截取当前页面 |
| 点击 | click | 点击按钮 |
| 输入 | type | 输入文字 |
| 填表单 | fill | 填写表单 |
| 提取数据 | snapshot | 获取页面内容 |
| 等待 | wait | 等待加载 |
| 滚动 | scroll | 滚动页面 |
爸爸:帮我打开 Google
Abby:好的爸爸!让我打开 Google...
[执行 openclaw browser open https://google.com]
✅ 已打开 Google
爸爸:帮我截图
Abby:好的,截个图...
[执行 openclaw browser screenshot]
📸 截图已保存
爸爸:帮我点击登录按钮
Abby:好的...
[执行 openclaw browser click 12]
✅ 已点击
# 打开网页
openclaw browser open https://example.com
# 截图
openclaw browser screenshot
openclaw browser screenshot --full-page
# 点击元素 (需要先 snapshot 获取 ref)
openclaw browser click 12
openclaw browser click 12 --double
# 输入文字
openclaw browser type 12 "hello world"
# 填写表单
openclaw browser fill --fields '[{"ref":"1","value":"xxx"}]'
# 获取页面快照
openclaw browser snapshot
openclaw browser snapshot --format aria
# 等待
openclaw browser wait --text "Done"
openclaw browser wait --selector ".content"
# 滚动
openclaw browser evaluate --fn 'window.scrollTo(0, document.body.scrollHeight)'
打开网页的封装脚本。
截图的封装脚本。
点击元素的封装脚本。
表单填写的封装脚本。
数据提取的封装脚本。
每次执行 snapshot 后,元素会有一个编号 (ref):
<button ref="12">登录</button>
<input ref="23" />
点击 ref=12:openclaw browser click 12
--format ai (默认) - AI 理解--format aria - 辅助功能树操作前最好等待:
openclaw browser wait --text "加载完成"
_创建于 2026-02-15_
Generated Mar 1, 2026
QA engineers can use Abby Browser to automate repetitive testing tasks like opening URLs, clicking buttons, and filling forms, reducing manual effort and improving test coverage. It integrates with existing workflows through command-line scripts, making it suitable for regression testing and user acceptance testing scenarios.
Market researchers can leverage the snapshot and extract features to gather data from websites, such as product prices or competitor information, without manual copying. The natural language interface allows non-technical users to request data pulls easily, streamlining research processes and enhancing data accuracy.
Support teams can automate browser interactions to handle common customer issues, like resetting passwords or filling out support forms, based on user requests. This reduces response times and frees up agents for more complex tasks, improving overall service efficiency.
Educators can use Abby Browser to teach students or trainees how to control browsers programmatically through simple commands, making learning web automation accessible. It helps demonstrate concepts like element referencing and waiting for page loads in a hands-on, interactive manner.
Offer Abby Browser as a cloud service with tiered subscriptions for individuals, teams, and enterprises, providing features like enhanced logging and priority support. Revenue is generated through monthly or annual fees, with potential upsells for advanced analytics and integration capabilities.
Provide a free version with basic browser control features to attract users, then monetize through premium add-ons such as advanced data extraction tools, custom scripting, or enterprise-grade security features. This model encourages adoption while driving revenue from power users and businesses.
Sell enterprise licenses that include Abby Browser as part of larger automation suites, coupled with professional services for customization and integration into existing systems. Revenue comes from licensing fees and consulting contracts, targeting large organizations with specific workflow needs.
💬 Integration Tip
Integrate Abby Browser by using its command-line scripts within existing automation pipelines or chatbots, ensuring compatibility with OpenClaw tools and Chrome/Chromium browsers for seamless operation.
A fast Rust-based headless browser automation CLI with Node.js fallback that enables AI agents to navigate, click, type, and snapshot pages via structured commands.
Automate web browser interactions using natural language via CLI commands. Use when the user asks to browse websites, navigate web pages, extract data from websites, take screenshots, fill forms, click buttons, or interact with web applications.
Advanced desktop automation with mouse, keyboard, and screen control
Manage n8n workflows and automations via API. Use when working with n8n workflows, executions, or automation tasks - listing workflows, activating/deactivating, checking execution status, manually triggering workflows, or debugging automation issues.
Design and implement automation workflows to save time and scale operations as a solopreneur. Use when identifying repetitive tasks to automate, building workflows across tools, setting up triggers and actions, or optimizing existing automations. Covers automation opportunity identification, workflow design, tool selection (Zapier, Make, n8n), testing, and maintenance. Trigger on "automate", "automation", "workflow automation", "save time", "reduce manual work", "automate my business", "no-code automation".
Browser automation via Playwright MCP server. Navigate websites, click elements, fill forms, extract data, take screenshots, and perform full browser automation workflows.