ppt-ocrOCR for PowerPoint (.ppt, .pptx) presentations with scanned or image-embedded slides. Uses MinerU to extract text from image-based presentation content. Feat...
Grade Fair — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Calls external URL not in known-safe list
https://mineru.netAudited Apr 17, 2026 · audit v1.0
Generated May 11, 2026
Convert old PowerPoint files with scanned or image-based slides into editable Markdown, enabling search and reuse of archival content. Useful for corporate archives and libraries needing to digitize historical presentations.
Extract text from academic conference slides that are embedded as images, facilitating research summarization and citation. Researchers can quickly convert complex visual layouts into readable text.
OCR presentation files for compliance audits where slides contain policy details or meeting minutes as images. Enables automated text extraction for regulatory review.
Extract text from image-heavy training decks to create searchable online course materials or study guides. Enhances accessibility and learning management system integration.
Process slides in various languages by specifying language hints, enabling translation workflows. Useful for multinational companies localizing presentation content.
Offer OCR extraction as a paid feature via API, charging per slide or per presentation. Integrate with cloud storage platforms like Google Drive or SharePoint for automated processing.
Deploy on-premise or cloud-based solution for enterprises needing bulk extraction with security. Provide batch processing and integration with existing ECM systems.
Offer basic text extraction for free (e.g., 10 slides per day) while charging for unlimited usage, advanced VLM mode, or priority processing. Attract users with free tier and upsell.
💬 Integration Tip
Set the environment variable MINERU_TOKEN and use mineru-open-api extract with --ocr flag; for batch processing, iterate over files in a script.
Scored Jun 19, 2026
Edit PDFs with natural-language instructions using the nano-pdf CLI.
Comprehensive PDF manipulation toolkit for extracting text and tables, creating new PDFs, merging/splitting documents, and handling forms. When Claude needs to fill in a PDF form or programmatically process, generate, or analyze PDF documents at scale.
Create, inspect, and edit Microsoft Word documents and DOCX files with reliable styles, numbering, tracked changes, tables, sections, and compatibility check...
Create, inspect, and edit Microsoft Excel workbooks and XLSX files with reliable formulas, dates, types, formatting, recalculation, and template preservation...
Convert documents and files to Markdown using markitdown. Use when converting PDF, Word (.docx), PowerPoint (.pptx), Excel (.xlsx, .xls), HTML, CSV, JSON, XML, images (with EXIF/OCR), audio (with transcription), ZIP archives, YouTube URLs, or EPubs to Markdown format for LLM processing or text analysis.
Create, inspect, and edit Microsoft PowerPoint presentations and PPTX decks with reliable layouts, templates, placeholders, notes, charts, and visual QA. Use...