minimax-understand-image使用 MiniMax MCP 进行图像理解和分析。触发条件:(1) 用户要求分析图片、理解图像、描述图片内容 (2) 需要识别图片中的物体、文字、场景 (3) 使用 MiniMax 的 understand_image 功能
Install via ClawdBot CLI:
clawdbot install Thincher/minimax-understand-image使用 MiniMax MCP 服务器进行图像理解和分析。
which uvx
如果不存在,安装 uv:
方法 1: 使用官方安装脚本(推荐)
curl -LsSf https://astral.sh/uv/install.sh | sh
方法 2: 使用国内镜像加速(如果官方脚本下载失败)
临时使用清华镜像源安装:
export UV_INDEX_URL="https://pypi.tuna.tsinghua.edu.cn/simple"
curl -LsSf https://astral.sh/uv/install.sh | sh
或者临时使用阿里云镜像源:
export UV_INDEX_URL="https://mirrors.aliyun.com/pypi/simple/"
curl -LsSf https://astral.sh/uv/install.sh | sh
uvx minimax-coding-plan-mcp --help
执行命令判断是否MCP服务器已安装, 如果安装了跳到步骤 2。
方法 1: 使用默认源安装
uvx install minimax-coding-plan-mcp
方法 2: 使用国内镜像加速(如果默认源下载失败)
临时使用清华镜像源:
export UV_INDEX_URL="https://pypi.tuna.tsinghua.edu.cn/simple"
uvx install minimax-coding-plan-mcp
或者临时使用阿里云镜像源:
export UV_INDEX_URL="https://mirrors.aliyun.com/pypi/simple/"
uvx install minimax-coding-plan-mcp
cat ~/.openclaw/config/minimax.json 2>/dev/null | python3 -c "import json,sys; d=json.load(sys.stdin); print(d.get('api_key', ''))"
如果返回非空的 API Key,跳到步骤 4。
根据返回的判断:
sk-cp- 开头找到匹配的 Key 后,询问用户确认是否使用。
直接询问用户提供 MiniMax API Key。
如果未购买MiniMax,购买地址为: https://platform.minimaxi.com/subscribe/coding-plan?code=GjuAjhGKqQ&source=link
mkdir -p ~/.openclaw/config
cat > ~/.openclaw/config/minimax.json << EOF
{
"api_key": "API密钥",
"output_path": "~/.openclaw/workspace/minimax-output"
}
EOF
将图片放到可访问路径,例如:
~/.openclaw/workspace/images/图片名.jpg使用脚本调用 MCP 服务:
python3 {curDir}/scripts/understand_image.py <图片路径或URL> "<对图片的提问>"
示例:
# 描述图片内容
python3 {curDir}/scripts/understand_image.py ~/image.jpg "详细描述这张图片的内容"
# 使用 URL
python3 {curDir}/scripts/understand_image.py "https://example.com/image.jpg "这张图片展示了什么?"
| 参数 | 说明 | 类型 |
|------|------|------|
| image | 图片路径或 URL | string (必填) |
| prompt | 对图片的提问 | string (必填) |
用户:帮我看看这张图片
AI:检查 uvx → 已安装
检查 MCP 服务器 → 已安装
检查 API Key → 已配置
直接调用脚本 → 返回结果
---
用户:帮我看看这张图片
AI:检查 uvx → 已安装
检查 MCP 服务器 → 未安装
安装 minimax-coding-plan-mcp
检查 API Key → 未配置
从 Gateway providers 找到 Key → 询问用户确认
保存 Key
调用脚本 → 返回结果
脚本位置:{curDir}/scripts/understand_image.py
功能:
MINIMAX_API_KEY 读取 API Key,如果没有则从 ~/.openclaw/config/minimax.json 读取understand_image 工具错误处理:
Generated Mar 1, 2026
Online retailers can use this skill to automatically analyze product images, extracting details like object categories, colors, and text descriptions. This helps in cataloging inventory, generating product descriptions, and improving search functionality for customers.
Social media platforms can integrate this skill to scan uploaded images for inappropriate content, such as violence or explicit material, by identifying objects and scenes. It assists in automating moderation workflows and ensuring community guidelines are followed.
Educational institutions can apply this skill to describe images in textbooks or online resources, providing audio descriptions for visually impaired students. It enhances accessibility by converting visual content into detailed textual summaries.
Real estate agencies can use this skill to analyze property photos, identifying features like room types, furniture, and outdoor spaces. This automates the creation of detailed listings and helps potential buyers understand properties better.
Healthcare providers can leverage this skill for preliminary analysis of medical images, such as X-rays or scans, by identifying anomalies or specific patterns. It supports radiologists in initial screenings and improves diagnostic efficiency.
Offer this skill as a cloud-based service where users pay a monthly or annual fee for API access to image analysis. It can include tiered pricing based on usage volume, such as number of images processed per month.
Provide a free tier with limited image analysis requests per month to attract individual users and small businesses. Upsell to premium plans with higher limits, advanced features like batch processing, and priority support.
Sell custom licenses to large organizations for on-premise or private cloud deployment, including integration support and dedicated maintenance. This model targets industries with high data privacy requirements, such as healthcare or finance.
💬 Integration Tip
Ensure API keys are securely stored and consider using environment variables for production deployments to avoid hardcoding sensitive information.
Use CodexBar CLI local cost usage to summarize per-model usage for Codex or Claude, including the current (most recent) model or a full model breakdown. Trigger when asked for model-level usage/cost data from codexbar, or when you need a scriptable per-model summary from codexbar cost JSON.
Gemini CLI for one-shot Q&A, summaries, and generation.
Research any topic from the last 30 days on Reddit + X + Web, synthesize findings, and write copy-paste-ready prompts. Use when the user wants recent social/web research on a topic, asks "what are people saying about X", or wants to learn current best practices. Requires OPENAI_API_KEY and/or XAI_API_KEY for full Reddit+X access, falls back to web search.
Check Antigravity account quotas for Claude and Gemini models. Shows remaining quota and reset times with ban detection.
Manages free AI models from OpenRouter for OpenClaw. Automatically ranks models by quality, configures fallbacks for rate-limit handling, and updates opencla...
Manages free AI models from OpenRouter for OpenClaw. Automatically ranks models by quality, configures fallbacks for rate-limit handling, and updates openclaw.json. Use when the user mentions free AI, OpenRouter, model switching, rate limits, or wants to reduce AI costs.