formula-ocrFormula OCR - recognize and extract mathematical formulas from PDFs or images using MinerU. Use for LaTeX formula extraction from documents.
Install via ClawdBot CLI:
clawdbot install mzlzyca/formula-ocrGrade Fair — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Calls external URL not in known-safe list
https://mineru.netAudited Apr 18, 2026 · audit v1.0
Generated May 11, 2026
Researchers can quickly extract LaTeX formulas from academic PDFs for editing, referencing, or inclusion in other documents. This saves time compared to manual retyping and reduces errors.
Publishers digitizing STEM textbooks can use the OCR to extract mathematical expressions from scanned images or PDFs, enabling searchable and editable content. This facilitates conversion to digital formats like ePub or web.
EdTech platforms can integrate formula OCR to capture student-submitted problem images, recognize formulas, and then provide step-by-step solutions or link to relevant resources. This enhances personalized learning.
Financial analysts dealing with reports containing equations (e.g., risk metrics, valuation models) can extract formulas for automated valuation or compliance checks. This streamlines data extraction from hybrid text-equation documents.
Offer the formula-ocr capability as a cloud API, charging per extract or subscription. Developers integrate the API into their apps, paying based on usage volume.
Provide a free basic version with limited extractions (e.g., 10 pages/month) and a premium version with unlimited access, batch processing, and VLM model for higher accuracy.
Bundle the formula OCR as a premium feature within a larger document processing platform (e.g., cloud storage or note-taking app). Charge extra for access or include in higher-tier plans.
💬 Integration Tip
Set the MINERU_TOKEN environment variable for authentication and use the mineru-open-api CLI directly for quick testing; for programmatic use, wrap the CLI calls in your application.
Scored Jun 20, 2026
Edit PDFs with natural-language instructions using the nano-pdf CLI.
Comprehensive PDF manipulation toolkit for extracting text and tables, creating new PDFs, merging/splitting documents, and handling forms. When Claude needs to fill in a PDF form or programmatically process, generate, or analyze PDF documents at scale.
Create, inspect, and edit Microsoft Word documents and DOCX files with reliable styles, numbering, tracked changes, tables, sections, and compatibility check...
Create, inspect, and edit Microsoft Excel workbooks and XLSX files with reliable formulas, dates, types, formatting, recalculation, and template preservation...
Convert documents and files to Markdown using markitdown. Use when converting PDF, Word (.docx), PowerPoint (.pptx), Excel (.xlsx, .xls), HTML, CSV, JSON, XML, images (with EXIF/OCR), audio (with transcription), ZIP archives, YouTube URLs, or EPubs to Markdown format for LLM processing or text analysis.
Create, inspect, and edit Microsoft PowerPoint presentations and PPTX decks with reliable layouts, templates, placeholders, notes, charts, and visual QA. Use...