🖼️ Image Generation

Image OCRv0.4.0

Name: Image OCR
Author: mzlzyca

photo-ocr

OCR for photos and images using MinerU. Extract text from photographs, screenshots, camera captures, and image files with high accuracy. Features: image OCR...

latest

Download Package View on ClawHub

Installs (all time)

Installs (current)

Downloads

542

Stars

CreatedApr 1, 2026

UpdatedMay 11, 2026

Install & Quick Start

Install via ClawdBot CLI:

clawdbot install mzlzyca/photo-ocr

https://mineru.net

Skill Package2 files

📋SKILL.mdmarkdown

Failed to load file.

Quality Score

B58/100

Grade Fair — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.

Market Validation7/35

· 542 downloads (moderate demand)
· No tracked installs (may still have real users via manual install)

Documentation20/25

· SKILL.md present
· Detailed documentation (≥3000 chars)
· Contains usage examples or trigger description
· Detailed summary

Package Completeness6/15

· skillAssets present (1 files)

Security Analysis

💙 Low Risk

UNDOCUMENTED_EXTERNALlow

Calls external URL not in known-safe list

https://mineru.net

Audited Apr 17, 2026 · audit v1.0

💡

Usage Guide

Generated May 22, 2026

Small business owners needing receipt digitizationDevelopers integrating OCR into applicationsResearchers and students extracting text from screenshots or documentsArchivists digitizing historical documentsPhotographers documenting text-heavy scenesbeginner

💡 Application Scenarios

Receipt and Invoice DigitizationFinance and Accounting

Automatically extract text from photos of receipts and invoices, converting them into structured data for expense tracking or accounting. Ideal for small business owners and freelancers who need to digitize paper receipts quickly.

Document Capture from Whiteboards and SignsEducation and Corporate

Use the skill to capture notes from whiteboard photos or text from signs and posters, converting them into editable Markdown or text. Useful for meeting notes, brainstorming sessions, or translating signage.

Screenshot Text ExtractionTechnology and Research

Extract text from screenshots of web pages, presentations, or error messages. Developers and researchers can quickly capture code snippets, quotes, or data without manual typing.

Photo OCR for Document ArchivingArchiving and Legal

Digitize photos of paper documents, such as contracts, letters, or forms, for storage or search. Archives and libraries can use this to preserve and index physical documents.

Multilingual Text Extraction from ImagesTravel and Hospitality

Extract text from images containing multiple languages, such as multilingual menus or international signs. Supports English, Chinese, and other languages, making it useful for travel and global business.

💼 Business Models

Free Tier with Premium OCRToken sales for premium OCR features

Offer basic OCR via flash-extract for free, no token required, with limitations on file size and pages. Charge for premium extract with higher accuracy, VLM mode, and no size limits, monetized via token purchases.

API-as-a-Service for DevelopersPay-per-call or monthly subscription fees

Provide the mineru-open-api as a service, charging per API call or subscription for developers integrating OCR into their apps. The CLI tool can be used directly or via API wrappers.

B2B Document Processing SolutionsEnterprise licensing and consultancy fees

License the skill to enterprises for automated document processing, such as invoice scanning or data entry. Offer custom integrations, on-premises deployment, and dedicated support.

💬 Integration Tip

Start with flash-extract for quick testing without a token. For production, authenticate with MINERU_TOKEN and use extract for higher accuracy on complex images.