tra-extract-textExtract readable text, markdown, HTML, JSON, or XML content from web pages using the trafilatura CLI tool with optional metadata and output formatting.
Install via ClawdBot CLI:
clawdbot install goog/tra-extract-textGrade Fair — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Calls external URL not in known-safe list
https://medium.com/example/articleAudited Apr 17, 2026 · audit v1.0
Generated Mar 22, 2026
News aggregators can use this skill to extract article text from various sources and compile them into a unified feed. It supports markdown output for easy formatting and integration into content management systems, enabling automated updates without manual copying.
Researchers can extract text from online articles or papers for analysis, such as sentiment analysis or topic modeling. The ability to output in plain text or JSON simplifies data preprocessing, saving time compared to manual extraction methods.
Marketing teams can scrape competitor web pages to analyze content structure, keywords, and metadata. This helps in benchmarking SEO strategies and improving content creation, with options like HTML or markdown output for detailed reviews.
Law firms can extract text from online legal documents, such as court rulings or regulations, for archiving and reference. The skill supports output to files, ensuring organized storage and easy retrieval for case preparation.
Offer a subscription-based service where users input URLs to receive extracted text via an API, with features like batch processing and custom output formats. Revenue is generated through tiered pricing based on usage volume and advanced options like metadata inclusion.
Collect and clean text data from web sources using this skill, then sell aggregated datasets to businesses for market research or competitive analysis. Revenue comes from one-time sales or licensing agreements for specific industry datasets.
Provide consulting and integration services to embed this skill into existing workflows, such as CRM or content management systems, automating text extraction for clients. Revenue is earned through project-based fees and ongoing support contracts.
💬 Integration Tip
Use the CLI commands directly in scripts or automate with cron jobs for scheduled extractions; consider wrapping in a simple web interface for non-technical users.
Scored Apr 19, 2026
Remote-control tmux sessions for interactive CLIs by sending keystrokes and scraping pane output.
Terminal Spotify playback/search via spogo (preferred) or spotify_player.
Advanced desktop automation with mouse, keyboard, and screen control
Runs shell commands inside a dedicated tmux session named claw, captures, and returns the output, with safety checks for destructive commands.
Capture, inspect, and compare screenshots of screens, windows, regions, web pages, simulators, and CI runs with the right tool, wait strategy, viewport, and...
Manage Feishu (Lark) calendars by listing, searching, checking schedules, syncing events, and marking tasks with automated date extraction.