minimax-image-understanding使用多模态大模型理解图片内容,生成业务含义描述。支持多种模型:(1) MiniMax VLM (2) OpenAI GPT-4V (3) Claude Vision。用于理解截图、图表、文档照片等,生成精准的文字描述。
Install via ClawdBot CLI:
clawdbot install aidescend/minimax-image-understandingGrade Fair — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Calls external URL not in known-safe list
https://api.minimaxi.comUses known external API (expected, informational)
api.anthropic.comAudited Apr 17, 2026 · audit v1.0
Generated Mar 20, 2026
Analyze product images from online stores to generate detailed descriptions for listings, including features, materials, and usage scenarios. This helps improve SEO and customer engagement by providing accurate, context-rich content.
Process medical scans like X-rays or MRIs to generate descriptive reports summarizing findings, such as anomalies or conditions, aiding healthcare professionals in documentation and preliminary analysis for faster decision-making.
Interpret screenshots of stock charts, graphs, or financial reports to extract trends, key data points, and insights, enabling analysts to quickly summarize market conditions or performance metrics for reports.
Analyze diagrams, illustrations, or textbook images to generate explanatory descriptions for learning materials, helping educators create accessible content for students with visual or textual summaries.
Review user-uploaded images on platforms to detect inappropriate content or generate captions for accessibility, enhancing user safety and compliance with community guidelines through automated analysis.
Offer the image understanding capability as a cloud-based API, charging per request or through subscription tiers. This model targets developers and businesses needing scalable, on-demand image analysis without infrastructure management.
Provide customized solutions for large organizations, such as healthcare or finance firms, with dedicated support, integration services, and compliance features. This includes one-time licensing fees and ongoing maintenance contracts.
Offer a free tier with limited usage or basic models, while charging for advanced features like higher accuracy models, batch processing, or priority support. This attracts small users and converts them to paid plans as needs grow.
💬 Integration Tip
Ensure API keys are securely stored as environment variables and test with sample images to verify model compatibility before full deployment.
Scored Apr 19, 2026
Use CodexBar CLI local cost usage to summarize per-model usage for Codex or Claude, including the current (most recent) model or a full model breakdown. Trigger when asked for model-level usage/cost data from codexbar, or when you need a scriptable per-model summary from codexbar cost JSON.
Gemini CLI for one-shot Q&A, summaries, and generation.
Manages free AI models from OpenRouter for OpenClaw. Automatically ranks models by quality, configures fallbacks for rate-limit handling, and updates openclaw.json. Use when the user mentions free AI, OpenRouter, model switching, rate limits, or wants to reduce AI costs.
Manages free AI models from OpenRouter for OpenClaw. Automatically ranks models by quality, configures fallbacks for rate-limit handling, and updates opencla...
Reduce OpenClaw AI costs by 97%. Haiku model routing, free Ollama heartbeats, prompt caching, and budget controls. Go from $1,500/month to $50/month in 5 min...
HTML-first PDF production skill for reports, papers, and structured documents. Must be applied before generating PDF deliverables from HTML.