image-ocrExtract text from images using Tesseract OCR
Install via ClawdBot CLI:
clawdbot install Xejrax/image-ocrExtract text from images using Tesseract OCR. Supports multiple languages and image formats including PNG, JPEG, TIFF, and BMP.
# Extract text from an image (default: English)
image-ocr "screenshot.png"
# Extract text with a specific language
image-ocr "document.jpg" --lang eng
sudo dnf install tesseract
Generated Mar 1, 2026
Convert scanned historical documents or printed records into searchable digital text. This enables easy indexing and retrieval for libraries, museums, or government archives, preserving content while enhancing accessibility.
Automate the extraction of text from receipt images to streamline expense reporting. Businesses can integrate this into mobile apps or software to reduce manual data entry and improve accuracy in accounting workflows.
Use OCR to read license plates from surveillance or traffic camera images. This supports security monitoring, parking management, or law enforcement by automating vehicle identification and logging.
Extract patient information from scanned medical forms or prescription labels. Healthcare providers can use this to digitize records, reduce errors, and speed up administrative processes in clinics or hospitals.
Capture text from product label images to facilitate translation or content localization. E-commerce platforms can automate this to list international products, improving catalog management and customer reach.
Offer an online OCR service via a web API, charging users based on usage tiers or monthly subscriptions. This model targets developers and businesses needing scalable text extraction without managing infrastructure.
Develop a mobile app that provides basic OCR for free, with premium features like batch processing, advanced language support, or ad removal for a one-time purchase or subscription. This appeals to individual users and small businesses.
License the OCR skill as a component for integration into larger enterprise systems, such as document management or workflow automation software. Charge per installation or based on the number of users or transactions.
đŹ Integration Tip
Ensure Tesseract is installed on the system and test with various image qualities; preprocess images (e.g., enhance contrast) to improve OCR accuracy in noisy environments.
Generate/edit images with Nano Banana Pro (Gemini 3 Pro Image). Use for image create/modify requests incl. edits. Supports text-to-image + image-to-image; 1K/2K/4K; use --input-image.
Capture frames or clips from RTSP/ONVIF cameras.
Batch-generate images via OpenAI Images API. Random prompt sampler + `index.html` gallery.
Generate images using the internal Google Antigravity API (Gemini 3 Pro Image). High quality, native generation without browser automation.
äœżçšć çœź image_generate.py èæŹçæćŸç, ć〿ž æ°ć ·äœç `prompt`ă
AI image generation powered by CellCog. Create images, edit photos, consistent characters, product photography, reference-based images, sets of images, style transfer. Professional image creation with AI.