wiseocrPDF OCR ā Convert PDF to Markdown via WiseDiag cloud API (requires WISEDIAG_API_KEY). Files are uploaded to WiseDiag servers for processing. Supports table r...
Install via ClawdBot CLI:
clawdbot install wisediag/wiseocrIMPORTANT - READ BEFORE INSTALLING:
This skill uploads your files to WiseDiag's cloud servers for OCR processing.
Do NOT use with sensitive or confidential documents unless:
For sensitive documents, use offline/local OCR tools instead.
Convert PDF files into Markdown format. The script handles API authentication, file upload, OCR processing, and saves the result automatically.
You MUST use the provided script to process files. Do NOT call any API or HTTP endpoint directly.
The script scripts/wiseocr.py handles everything:
WISEDIAG_API_KEY from environment)WiseOCR/{filename}.mdGet your API key:
š https://console.wisediag.com/apiKeyManage
export WISEDIAG_API_KEY=your_api_key
pip install -r requirements.txt
To process a PDF file, run:
cd scripts
python wiseocr.py -i /path/to/input.pdf -n original_filename
IMPORTANT: Always pass -n with the original filename (without extension) so the output file is named correctly. If the uploaded file has been renamed (e.g. to ocr_input.pdf), -n ensures the output uses the user's original filename.
The script will automatically save the result to WiseOCR/{name}.md.
Example:
python wiseocr.py -i /tmp/ocr_input.pdf -n medical_report
# Output saved to: WiseOCR/medical_report.md
With custom output directory:
python wiseocr.py -i /path/to/input.pdf -n medical_report -o /custom/output/dir
| Flag | Description |
|------|-------------|
| -i, --input | Input PDF file path (required) |
| -n, --name | Original filename without extension for output (recommended) |
| -o, --output | Output directory (default: ./WiseOCR) |
| --dpi | PDF rendering DPI, 72-600 (default: 200) |
After the script runs, the Markdown file is saved automatically:
WiseOCR/{name}.md-n parameter (e.g. -n report ā report.md), or falls back to the input filenameWhat happens to your files:
https://openapi.wisediag.com)For sensitive documents, use offline/local OCR tools instead.
MIT
Generated Mar 1, 2026
Researchers can convert PDF medical papers and reports into Markdown for easier analysis and data extraction. This skill handles complex layouts and tables common in medical literature, streamlining literature reviews.
Law firms can use this to digitize legal briefs, contracts, and case files into Markdown format for editing and archiving. It supports multi-column layouts, making it suitable for standard legal documents.
Students can upload PDF academic papers to convert them into Markdown for note-taking, summarization, or citation management. The skill's table recognition helps preserve data from research articles.
Companies can process PDF financial reports and market analyses into Markdown to extract key metrics and insights. This aids in data aggregation and reporting for internal use.
Libraries and archives can digitize historical PDF documents into Markdown for preservation and searchability. The skill handles varied layouts, though sensitive documents should be avoided due to cloud processing.
Offer a free tier with limited monthly OCR conversions to attract users, then charge for higher volumes or advanced features like batch processing. This leverages WiseDiag's API infrastructure for scalability.
License the skill to businesses for integrating into their document management systems, providing custom support and enhanced security options. This targets industries like healthcare or legal with high document processing needs.
Market the skill to educational institutions as a tool for students and researchers, offering discounted bulk subscriptions. Include features like collaborative editing and export to academic formats.
š¬ Integration Tip
Ensure the WISEDIAG_API_KEY is securely stored as an environment variable and test with non-sensitive documents first to verify output quality.
Automatically update Clawdbot and all installed skills once daily. Runs via cron, checks for updates, applies them, and messages the user with a summary of what changed.
Full desktop computer use for headless Linux servers. Xvfb + XFCE virtual desktop with xdotool automation. 17 actions (click, type, scroll, screenshot, drag,...
Essential Docker commands and workflows for container management, image operations, and debugging.
Tool discovery and shell one-liner reference for sysadmin, DevOps, and security tasks. AUTO-CONSULT this skill when the user is: troubleshooting network issues, debugging processes, analyzing logs, working with SSL/TLS, managing DNS, testing HTTP endpoints, auditing security, working with containers, writing shell scripts, or asks 'what tool should I use for X'. Source: github.com/trimstray/the-book-of-secret-knowledge
Deploy applications and manage projects with complete CLI reference. Commands for deployments, projects, domains, environment variables, and live documentation access.
Monitor topics of interest and proactively alert when important developments occur. Use when user wants automated monitoring of specific subjects (e.g., product releases, price changes, news topics, technology updates). Supports scheduled web searches, AI-powered importance scoring, smart alerts vs weekly digests, and memory-aware contextual summaries.