⚠️Install with caution. This skill has very few installs. Always review the source and verify it on clawhub.ai before installing. Community-built skills run with agent permissions — only install ones you trust.

🌐 Browser Automation

Web Scraperv0.1.0

Name: Web Scraper
Author: guifav

web-scraper

guifav

Web scraping and content comprehension agent — multi-strategy extraction with cascade fallback, news detection, boilerplate removal, structured metadata, and...

browser-automationweb-scraping

Download Package View on ClawHub

Installs (all time)

Installs (current)

Downloads

8.2K

Stars

CreatedFeb 23, 2026

UpdatedMar 16, 2026

Install & Quick Start

Install via ClawdBot CLI:

clawdbot install guifav/web-scraper

Skill Package3 files

📋SKILL.mdmarkdown

Failed to load file.

Quality Score

A74/100

Grade Good — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.

Market Validation21/35

· 56 installs (good)
· 1610 downloads (moderate demand)

Documentation20/25

· SKILL.md present
· Detailed documentation (≥3000 chars)
· Contains usage examples or trigger description
· Detailed summary

Package Completeness8/15

· skillAssets present (2 files)

Security Analysis

💙 Low Risk

UNDOCUMENTED_EXTERNALlow

Calls external URL not in known-safe list

https://openrouter.ai/api/v1/chat/completions

Audited Apr 16, 2026 · audit v1.0

💡

Usage Guide

Generated Mar 1, 2026

Data EngineersBusiness AnalystsResearchersDigital Marketersintermediate

💡 Application Scenarios

Competitive Intelligence for Marketing AgenciesMarketing and Advertising

Marketing agencies can use this skill to scrape competitor websites, news articles, and blog posts to analyze content strategies, track industry trends, and identify keywords. The multi-strategy extraction ensures reliable data collection even from JavaScript-heavy sites, while LLM entity extraction helps identify key players and topics.

Academic Research Data CollectionEducation and Research

Researchers in social sciences or digital humanities can scrape news archives, academic blogs, and online publications to gather datasets for analysis. The pipeline's cleaning and metadata extraction stages produce structured JSON suitable for quantitative analysis, and the planning protocol helps manage ethical and technical risks like paywalls.

Financial News Monitoring for Investment FirmsFinance and Investment

Investment firms can automate scraping of financial news sites, press releases, and regulatory updates to monitor market-moving events. The news detection and entity extraction stages filter relevant articles and extract companies, dates, and relationships, enabling real-time alerts and trend analysis.

E-commerce Product and Review AggregationRetail and E-commerce

E-commerce businesses can scrape product pages, customer reviews, and competitor pricing from various online retailers. The cascade extraction handles dynamic content, while structured metadata extraction captures prices, ratings, and categories for competitive analysis and inventory management.

Media Monitoring for PR AgenciesPublic Relations and Media

Public relations agencies can track brand mentions, news coverage, and social media posts across the web. The skill's ability to detect articles and extract entities like organizations and locations helps in reputation management and reporting client impact metrics efficiently.

💼 Business Models

SaaS Platform for Data-as-a-ServiceSubscription fees, usage-based pricing

Offer a subscription-based platform where clients input URLs or domains to receive structured scraped data via API or dashboard. Revenue comes from tiered plans based on volume, with premium features like real-time monitoring and custom entity extraction using the LLM stage.

Consulting and Custom Integration ServicesProject fees, retainer contracts

Provide bespoke web scraping solutions for enterprises, integrating this skill into their existing data pipelines or workflows. Revenue is generated through project-based fees, ongoing maintenance contracts, and training sessions on using the planning protocol and pipeline effectively.

Data Resale and Market Research ReportsData sales, report subscriptions

Scrape public web data at scale, clean and enrich it using the pipeline's stages, and sell aggregated datasets or insights reports to businesses in specific industries. Revenue streams include one-time sales of datasets and subscription access to updated reports.

💬 Integration Tip

Ensure the OPENROUTER_API_KEY is set in the environment for LLM entity extraction, and install required Python packages like playwright and trafilatura before execution to avoid pipeline failures.