docx-toolkitExtract text, tables, and images from .docx and legacy .doc files. Handles large documents, CJK text, and complex table structures. Includes deduplication an...
Install via ClawdBot CLI:
clawdbot install zacjiang/docx-toolkitGrade Limited — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Calls external URL not in known-safe list
http://schemas.openxmlformats.org/drawingml/2006/main}blipAudited Apr 17, 2026 · audit v1.0
Generated Mar 22, 2026
Extract text and tables from legal contracts and briefs for AI-powered review, summarization, and compliance checking. This automates manual data entry and reduces errors in document processing.
Process research papers and theses in Word format to extract text, tables, and images for literature reviews or data aggregation. Supports CJK texts, making it suitable for international academic collaborations.
Migrate content from legacy Word documents to modern CMS or digital platforms by extracting structured text and images. Handles large files and preserves formatting for efficient content transfer.
Extract and deduplicate images from marketing documents to audit visual assets, ensure brand consistency, and optimize storage. Image compression reduces costs for further AI vision analysis.
Batch process internal reports and presentations to extract text for AI summarization and compress images to lower API costs. Supports both .docx and .doc formats for legacy file handling.
Offer a cloud-based API service for document extraction, charging per document or monthly based on usage volume. Targets businesses needing scalable, automated processing without local setup.
Provide custom integration and support for companies to embed this toolkit into their existing workflows, such as legal or research platforms. Includes training and maintenance contracts.
Release a free open-source version for basic extraction, with paid upgrades for advanced features like batch processing, priority support, or enhanced image compression. Appeals to individual users and small teams.
💬 Integration Tip
Install dependencies via pip and test with sample documents first; for batch processing, automate scripts in a pipeline using cron jobs or workflow tools.
Scored Apr 19, 2026
Connect to 100+ APIs (Google Workspace, Microsoft 365, GitHub, Notion, Slack, Airtable, HubSpot, etc.) with managed OAuth. Use this skill when users want to...
Fetch and read transcripts from YouTube videos. Use when you need to summarize a video, answer questions about its content, or extract information from it.
Skill 查找器 | Skill Finder. 帮助发现和安装 ClawHub Skills | Discover and install ClawHub Skills. 回答'有什么技能可以X'、'找一个技能' | Answers 'what skill can X', 'find a skill'. 触发...
Query, design, migrate, and optimize SQL databases. Use when working with SQLite, PostgreSQL, or MySQL — schema design, writing queries, creating migrations, indexing, backup/restore, and debugging slow queries. No ORMs required.
Extract text from PDFs with OCR support. Perfect for digitizing documents, processing invoices, or analyzing content. Zero dependencies required.
Complete toolkit for programmatic video creation with Remotion + React. Covers animations, timing, rendering (CLI/Node.js/Lambda/Cloud Run), captions, 3D, charts, text effects, transitions, and media handling. Use when writing Remotion code, building video generation pipelines, or creating data-driven video templates.