image-to-code将图片(含文字、公式、标题)转换为指定代码格式。自动识别标题级别(title1/title2/title3),文字行转为 $word->body("正文=".$F);,公式转为 $word->formula("");,图片标记为 ![image]
Install via ClawdBot CLI:
clawdbot install nidhov01/image-to-codeGrade Fair — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Calls external URL not in known-safe list
https://tesseract-ocr.github.io/Uses known external API (expected, informational)
aip.baidubce.comAudited Apr 16, 2026 · audit v1.0
Generated Mar 22, 2026
Converts scanned academic papers with titles, text, formulas, and images into structured code format for easy integration into LaTeX or markdown editors. Useful for researchers digitizing handwritten notes or printed materials.
Processes screenshots of technical manuals or reports containing hierarchical headings, body text, and mathematical equations into a standardized code output. Helps technical writers automate formatting tasks.
Transforms textbook pages or lecture slides with titles, content, and formulas into code snippets for e-learning platforms or interactive tutorials. Assists educators in quickly repurposing materials.
Converts scanned business reports with sections, bullet points, and charts into a code format for data processing or presentation tools. Streamlines report generation from physical documents.
Extracts text and formulas from screenshots of code documentation or whiteboards, converting them into structured comments or documentation files. Aids developers in maintaining up-to-date docs.
Offer the tool as a cloud-based service with tiered pricing based on usage volume (e.g., pages per month). Includes API access for integration into other platforms, targeting enterprises and institutions.
Provide a free version with basic OCR and limited conversions, and a paid version with advanced features like batch processing, custom templates, and priority support. Targets individual users and small teams.
License the core technology to educational platforms, content management systems, or document processing companies for integration into their products. Includes customization and technical support.
💬 Integration Tip
Integrate with existing OCR pipelines or document management systems by using the provided Python class structure, ensuring proper image preprocessing for optimal accuracy.
Scored Apr 19, 2026
Generate/edit images with Nano Banana Pro (Gemini 3 Pro Image). Use for image create/modify requests incl. edits. Supports text-to-image + image-to-image; 1K/2K/4K; use --input-image.
Batch-generate images via OpenAI Images API. Random prompt sampler + `index.html` gallery.
Capture frames or clips from RTSP/ONVIF cameras.
Create, inspect, process, and optimize image files and visual assets with reliable format choice, resizing, compression, color-profile, metadata, and platfor...
使用内置 image_generate.py 脚本生成图片, 准备清晰具体的 `prompt`。
Generate images using the internal Google Antigravity API (Gemini 3 Pro Image). High quality, native generation without browser automation.