tesseract-ocrExtract text from images using the Tesseract OCR engine directly via command line. Supports multiple languages including Chinese, English, and more. Use this...
Install via ClawdBot CLI:
clawdbot install WhaleFell/tesseract-ocrExtract text content from images using the Tesseract engine directly via command line.
Install Tesseract OCR system package:
# Ubuntu/Debian:
sudo apt-get install tesseract-ocr tesseract-ocr-chi-sim
# macOS:
brew install tesseract tesseract-lang
# Use default language (English)
tesseract /path/to/image.png stdout
# Specify language (Chinese + English)
tesseract /path/to/image.png stdout -l chi_sim+eng
# Save to file
tesseract /path/to/image.png output.txt -l chi_sim+eng
# Multiple languages
tesseract /path/to/image.png stdout -l chi_sim+eng+jpn
| Language | Code |
|----------|------|
| Simplified Chinese | chi_sim |
| Traditional Chinese | chi_tra |
| English | eng |
| Japanese | jpn |
| Korean | kor |
| Chinese + English | chi_sim+eng |
# OCR with Chinese support
tesseract image.jpg stdout -l chi_sim
# OCR with mixed Chinese and English
tesseract image.png stdout -l chi_sim+eng
# Save to file instead of stdout
tesseract document.png result -l chi_sim+eng
# Creates result.txt
Generated Mar 1, 2026
Convert scanned historical documents, such as old books or manuscripts, into searchable digital text. This enables libraries and museums to preserve and index content efficiently, facilitating research and public access without manual transcription.
Extract text from receipts or invoices captured via mobile apps or scanners for expense tracking and accounting. Small businesses can automate data entry, reducing errors and saving time on manual input for financial records.
Recognize text in images containing multiple languages, such as signs or product labels, to support translation services. This aids travelers or global companies in quickly understanding foreign text without relying on human interpreters.
Convert text from images, like menus or printed materials, into audio or braille output through assistive technologies. This enhances accessibility in public spaces, allowing visually impaired individuals to access information independently.
Extract handwritten or printed text from forms, such as surveys or applications, for data analysis. Organizations can streamline workflows by automating the collection and digitization of form responses, improving efficiency.
Offer a basic free tier for limited OCR requests and charge for higher volumes or advanced features like multi-language support. This attracts small developers and scales with enterprise clients needing bulk processing, generating revenue through subscription plans.
License the OCR skill as a component for other software products, such as document management systems or mobile apps. Businesses pay a one-time or recurring fee to embed the functionality, creating a steady income stream from B2B partnerships.
Develop tailored OCR packages for specific sectors like healthcare or legal, offering specialized language packs and support. Charge for customization, training, and maintenance, targeting organizations with unique compliance or accuracy needs.
đŹ Integration Tip
Ensure Tesseract and required language packs are installed on the system before use; test with clear, high-quality images to maximize accuracy.
Remove signs of AI-generated writing from text. Use when editing or reviewing text to make it sound more natural and human-written. Based on Wikipedia's comprehensive "Signs of AI writing" guide. Detects and fixes patterns including: inflated symbolism, promotional language, superficial -ing analyses, vague attributions, em dash overuse, rule of three, AI vocabulary words, negative parallelisms, and excessive conjunctive phrases.
Humanize AI-generated text to bypass detection. This humanizer rewrites ChatGPT, Claude, and GPT content to sound natural and pass AI detectors like GPTZero,...
Collaborative thinking partner for exploring complex problems through questioning
Humanize AI-generated text by detecting and removing patterns typical of LLM output. Rewrites text to sound natural, specific, and human. Uses 24 pattern detectors, 500+ AI vocabulary terms across 3 tiers, and statistical analysis (burstiness, type-token ratio, readability) for comprehensive detection. Use when asked to humanize text, de-AI writing, make content sound more natural/human, review writing for AI patterns, score text for AI detection, or improve AI-generated drafts. Covers content, language, style, communication, and filler categories.
ć šćŽç¨ćˇçĺč˝éćąďźĺŽćä¸ VeADK ç¸ĺ łçĺč˝ă
Use this skill to query your Google NotebookLM notebooks directly from Claude Code for source-grounded, citation-backed answers from Gemini. Browser automation, library management, persistent auth. Drastically reduced hallucinations through document-only responses.