selenium-browserStart a Selenium‑controlled Chrome browser, open a URL, take a screenshot, and report progress. Supports headless mode and optional proxy.
Install via ClawdBot CLI:
clawdbot install andreybespalov89/selenium-browserThe skill triggers on any message that contains Chrome, browser, Selenium, screenshot, or open.
selenium-browser <URL> [--headless] [--proxy=<url>]
./home/main/clawd/diffusion_pdfs/ and report the path back to the chat.#!/usr/bin/env python3
import os
import sys
import time
import base64
from selenium import webdriver
from selenium.webdriver.chrome.service import Service
from selenium.webdriver.chrome.options import Options
# CLI parsing
import argparse
parser = argparse.ArgumentParser(description="Launch Selenium Chrome and take a screenshot.")
parser.add_argument("url", help="URL to open")
parser.add_argument("--headless", action="store_true", help="Run Chrome headless")
parser.add_argument("--proxy", help="Proxy URL (e.g., http://proxy:3128)")
args = parser.parse_args()
# Prepare Chrome options
chrome_options = Options()
if args.headless:
chrome_options.add_argument("--headless")
chrome_options.add_argument("--disable-gpu")
chrome_options.add_argument("--no-sandbox")
chrome_options.add_argument("--disable-dev-shm-usage")
if args.proxy:
chrome_options.add_argument(f"--proxy-server={args.proxy}")
# Locate binaries
chrome_bin = os.getenv("CHROME_BIN", "/usr/bin/google-chrome")
chromedriver_path = os.getenv("CHROMEDRIVER_PATH", "/usr/local/bin/chromedriver")
service = Service(executable_path=chromedriver_path)
# Start browser
try:
driver = webdriver.Chrome(service=service, options=chrome_options)
except Exception as e:
print(f"❌ Failed to start Chrome: {e}", file=sys.stderr)
sys.exit(1)
# Navigate and wait for page load
try:
driver.get(args.url)
time.sleep(5) # simple wait; can replace with WebDriverWait for better reliability
except Exception as e:
print(f"❌ Navigation error: {e}", file=sys.stderr)
driver.quit()
sys.exit(1)
# Take screenshot
screenshot_path = os.path.join(os.getenv("HOME", "/tmp"), "screenshot.png")
try:
driver.save_screenshot(screenshot_path)
except Exception as e:
print(f"❌ Screenshot error: {e}", file=sys.stderr)
driver.quit()
sys.exit(1)
# Clean up
driver.quit()
# Output a JSON object that OpenClaw can parse for the reply
print({"status": "ok", "screenshot": screenshot_path})
# Optional: set paths to Chrome/Chromedriver if not in standard locations
# export CHROME_BIN="/opt/google/chrome/google-chrome"
# export CHROMEDRIVER_PATH="/usr/local/bin/chromedriver"
The skill runs the Python script and captures its stdout as a JSON payload. OpenClaw parses the JSON and sends a message back:
✅ Screenshot saved: /home/main/clawd/diffusion_pdfs/screenshot.png
If the script prints an error, the skill forwards the error text.
chromedriver is in /usr/local/bin/chromedriver or set CHROMEDRIVER_PATH.google-chrome (or chromium) is in /usr/bin/google-chrome or set CHROME_BIN.pip install selenium (inside the virtual env you use for the skill).pip install selenium
The script uses a 5‑second static wait after navigation; replace with Selenium's WebDriverWait for dynamic waits.
If you encounter timeouts, adjust the time.sleep(5) value or use WebDriverWait(driver, 20).until(...).
Feel free to tweak the script to fit your environment (proxy, authentication, etc.).
```
Generated Mar 1, 2026
Automates visual regression testing by taking screenshots of web pages after updates or deployments, comparing them to baseline images to detect unintended changes. Useful for development teams to ensure UI consistency across releases without manual checks.
Scrapes e-commerce product pages to capture pricing and availability data via screenshots, enabling businesses to track competitor pricing strategies in real-time. Helps in dynamic pricing adjustments and market analysis.
Captures screenshots of web content such as social media posts or news articles for legal or regulatory compliance, providing timestamped visual records. Essential for industries like finance or media to meet audit requirements.
Verifies the correct display of online ads or landing pages across different regions by taking screenshots through proxies, ensuring marketing materials are rendered properly. Supports A/B testing and campaign optimization.
Automates the capture of screenshots from educational websites or online tutorials for creating study materials or presentations. Useful for educators and trainers to compile visual aids efficiently.
Offers a cloud-based platform where users can schedule automated screenshot captures via API, with features like headless mode and proxy support for scalability. Revenue is generated through monthly or annual subscriptions based on usage tiers.
Provides tailored solutions for businesses needing custom browser automation, such as integrating this skill into existing workflows or adding advanced features like dynamic waits. Revenue comes from project-based fees and ongoing support contracts.
Distributes the skill as an open-source tool with basic functionality, while offering premium add-ons like enhanced logging, priority support, or batch processing capabilities. Revenue is driven by upgrades and enterprise licenses.
💬 Integration Tip
Ensure CHROME_BIN and CHROMEDRIVER_PATH environment variables are correctly set in your deployment environment to avoid launch failures.
Automates browser interactions for web testing, form filling, screenshots, and data extraction. Use when the user needs to navigate websites, interact with w...
Playwright-based web scraping OpenClaw Skill with anti-bot protection. Successfully tested on complex sites like Discuss.com.hk.
Browser automation and web scraping with Playwright. Forms, screenshots, data extraction. Works standalone or via MCP. Testing included.
Performs deep scraping of complex sites like YouTube using containerized Crawlee, extracting validated, ad-free transcripts and content as JSON output.
Automate web tasks like form filling, data scraping, testing, monitoring, and scheduled jobs with multi-browser support and retry mechanisms.
Web scraping and content comprehension agent — multi-strategy extraction with cascade fallback, news detection, boilerplate removal, structured metadata, and...