mistral-ocrExtract text, tables, and images from PDFs or images using Mistral OCR API and output in Markdown, JSON, or HTML formats.
Install via ClawdBot CLI:
clawdbot install YZDame/mistral-ocrIMPORTANT - READ BEFORE INSTALLING:
This skill uploads your files to Mistral's cloud servers for OCR processing.
Do NOT use with sensitive or confidential documents unless:
For sensitive documents, use offline/local OCR tools instead.
A powerful OCR tool that converts PDF files and images into Markdown, JSON, or HTML formats using Mistral's state-of-the-art OCR API.
# Clone or download this repository
git clone https://github.com/YZDame/Mistral-OCR-SKILL.git
cd Mistral-OCR-SKILL
# Install dependencies
pip install -r requirements.txt
Get your API key:
š https://console.mistral.ai/home
Set the environment variable:
export MISTRAL_API_KEY=your_api_key
cd scripts
# Process PDF to Markdown
python3 mistral_ocr.py -i input.pdf
# Process PDF to JSON
python3 mistral_ocr.py -i input.pdf -f json
# Specify output directory
python3 mistral_ocr.py -i input.pdf -o ~/my_ocr_results
| Flag | Description |
|------|-------------|
| -i, --input | Input file path (required) |
| -f, --format | Output format: markdown/json/html (default: markdown) |
| -o, --output | Output directory |
What happens to your files:
For more details, see: https://mistral.ai/privacy-policy
MIT
Generated Mar 1, 2026
Researchers can convert scanned PDFs of historical documents or printed papers into searchable Markdown or JSON formats for easier analysis and citation extraction. This is useful for literature reviews and data mining in academic institutions.
Law firms can use this skill to convert scanned legal contracts or case files into structured formats like JSON for automated indexing and analysis, improving document management efficiency. Note: Only for non-sensitive documents due to cloud processing.
Companies can transform PDF reports, such as financial statements or marketing materials, into HTML or Markdown for web publishing or internal sharing, facilitating content reuse and collaboration.
Libraries and archives can digitize printed books or magazines into Markdown format to create accessible digital collections, supporting preservation efforts and online access for patrons.
Healthcare providers can convert scanned medical forms into JSON for data entry into electronic health records, streamlining administrative tasks. Must avoid sensitive patient data unless compliant with privacy policies.
Offer a free tier with limited OCR processing per month and charge for higher volumes or advanced features like table recognition. This attracts small users and scales with enterprise clients needing bulk conversions.
License the skill to software companies for embedding into their document management or workflow platforms, providing OCR capabilities as an add-on service. Revenue comes from licensing fees and support contracts.
Provide tailored OCR solutions for specific industries, such as legal or healthcare, with custom output formats and integration support. This model leverages expertise in niche markets for higher-value projects.
š¬ Integration Tip
Ensure the MISTRAL_API_KEY environment variable is securely set before use, and test with non-sensitive files first to verify output formats.
Use CodexBar CLI local cost usage to summarize per-model usage for Codex or Claude, including the current (most recent) model or a full model breakdown. Trigger when asked for model-level usage/cost data from codexbar, or when you need a scriptable per-model summary from codexbar cost JSON.
Gemini CLI for one-shot Q&A, summaries, and generation.
Research any topic from the last 30 days on Reddit + X + Web, synthesize findings, and write copy-paste-ready prompts. Use when the user wants recent social/web research on a topic, asks "what are people saying about X", or wants to learn current best practices. Requires OPENAI_API_KEY and/or XAI_API_KEY for full Reddit+X access, falls back to web search.
Check Antigravity account quotas for Claude and Gemini models. Shows remaining quota and reset times with ban detection.
Manages free AI models from OpenRouter for OpenClaw. Automatically ranks models by quality, configures fallbacks for rate-limit handling, and updates opencla...
Manages free AI models from OpenRouter for OpenClaw. Automatically ranks models by quality, configures fallbacks for rate-limit handling, and updates openclaw.json. Use when the user mentions free AI, OpenRouter, model switching, rate limits, or wants to reduce AI costs.