pdf-toolsView, extract, edit, and manipulate PDF files. Supports text extraction, text editing (overlay and replacement), merging, splitting, rotating pages, and getting PDF metadata. Use when working with PDF documents for reading content, adding/editing text, reorganizing pages, combining files, or extracting information.
Install via ClawdBot CLI:
clawdbot install cmpdchtr/pdf-toolsTools for viewing, extracting, and editing PDF files using Python libraries (pdfplumber and PyPDF2).
All scripts require dependencies:
pip3 install pdfplumber PyPDF2
Extract text from PDF (all pages or specific pages):
scripts/extract_text.py document.pdf
scripts/extract_text.py document.pdf -p 1 3 5
scripts/extract_text.py document.pdf -o output.txt
View metadata and structure:
scripts/pdf_info.py document.pdf
scripts/pdf_info.py document.pdf -f json
Combine multiple PDFs into one:
scripts/merge_pdfs.py file1.pdf file2.pdf file3.pdf -o merged.pdf
Split into individual pages:
scripts/split_pdf.py document.pdf -o output_dir/
Split by page ranges:
scripts/split_pdf.py document.pdf -o output_dir/ -m ranges -r "1-3,5-7,10-12"
Rotate all pages or specific pages:
scripts/rotate_pdf.py document.pdf -o rotated.pdf -r 90
scripts/rotate_pdf.py document.pdf -o rotated.pdf -r 180 -p 1 3 5
Add text overlay on a page:
scripts/edit_text.py document.pdf -o edited.pdf --overlay "New Text" --page 1 --x 100 --y 700
scripts/edit_text.py document.pdf -o edited.pdf --overlay "Watermark" --page 1 --x 200 --y 400 --font-size 20
Replace text (limited, works best for simple cases):
scripts/edit_text.py document.pdf -o edited.pdf --replace "Old Text" "New Text"
Note: PDF text editing is complex due to the format. The overlay method is more reliable than replacement.
scripts/pdf_info.py file.pdfscripts/extract_text.py file.pdf -p 1scripts/extract_text.py file.pdf -o content.txtscripts/split_pdf.py input.pdf -o pages/scripts/merge_pdfs.py pages/page_1.pdf pages/page_3.pdf -o reordered.pdfscripts/pdf_info.py document.pdfscripts/split_pdf.py document.pdf -o sections/ -m ranges -r "1-5,10-15"For detailed library documentation and advanced patterns, see references/libraries.md.
Generated Mar 1, 2026
Law firms can use this skill to extract text from contracts and briefs for analysis, merge multiple legal documents into a single file for case preparation, and add annotations or watermarks to drafts. It streamlines document handling without specialized software.
Researchers and students can split PDFs of journal articles into individual sections for focused study, extract text for citation or summarization, and merge different papers into a curated collection. This aids in organizing literature reviews and study materials efficiently.
Companies can extract data from financial or operational PDF reports, rotate pages for proper alignment in presentations, and overlay text like disclaimers or dates on documents. It supports creating polished reports from existing PDF sources.
Libraries or archives can use this skill to get metadata from historical PDFs for cataloging, split large scanned documents into manageable pages, and rotate pages to correct orientation. It helps digitize and organize collections with basic editing tools.
Real estate agents can merge property documents like contracts and disclosures into a single PDF for clients, extract key terms for quick review, and add text overlays for signatures or notes. This simplifies document handling during sales and leases.
Offer basic PDF tools for free with usage limits, then charge for advanced features like batch processing, API access, or priority support. This attracts individual users and small businesses, converting them to paid plans as needs grow.
License the skill as an SDK or API for companies to embed PDF functionality into their own applications, such as document management systems or workflow tools. Provide customization and technical support as part of the package.
Offer tailored solutions for specific industries, like setting up automated PDF workflows for legal firms or educational institutions. Charge for implementation, training, and ongoing maintenance based on client requirements.
๐ฌ Integration Tip
Integrate with cloud storage services like Google Drive or Dropbox for seamless file access, and use webhooks to trigger PDF processing in automated workflows.
Edit PDFs with natural-language instructions using the nano-pdf CLI.
Comprehensive PDF manipulation toolkit for extracting text and tables, creating new PDFs, merging/splitting documents, and handling forms. When Claude needs to fill in a PDF form or programmatically process, generate, or analyze PDF documents at scale.
Convert documents and files to Markdown using markitdown. Use when converting PDF, Word (.docx), PowerPoint (.pptx), Excel (.xlsx, .xls), HTML, CSV, JSON, XML, images (with EXIF/OCR), audio (with transcription), ZIP archives, YouTube URLs, or EPubs to Markdown format for LLM processing or text analysis.
็จ MinerU API ่งฃๆ PDF/Word/PPT/ๅพ็ไธบ Markdown๏ผๆฏๆๅ ฌๅผใ่กจๆ ผใOCRใ้็จไบ่ฎบๆ่งฃๆใๆๆกฃๆๅใ
Generate hand-drawn style diagrams, flowcharts, and architecture diagrams as PNG images from Excalidraw JSON
The awesome PPT format generation tool provided by baidu.