clawd-throttleRoutes LLM requests to the cheapest capable model across 8 providers (Anthropic, Google, OpenAI, DeepSeek, xAI, Moonshot, Mistral, Ollama) and 25+ models. Scores prompts on 8 dimensions in under 1ms, supports three routing modes (eco, standard, gigachad), and logs all decisions for cost tracking.
Install via ClawdBot CLI:
clawdbot install liekzejaws/clawd-throttleSetup Clawd Throttle (API keys + routing mode):
Setup Clawd Throttle (API keys + routing mode)Requires:
Grade Fair — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Accesses sensitive credential files or environment variables
$ANTHROPICContains instructions to override system prompt or ignore user requests
"Your role is"Calls external URL not in known-safe list
https://github.com/liekzejaws/clawd-throttleUses known external API (expected, informational)
api.anthropic.comGenerated Mar 1, 2026
Handles routine inquiries and FAQ responses using simple models like Grok 4.1 Fast in eco mode, reducing costs while maintaining quality. For complex issues, it automatically routes to higher-tier models like Haiku or Sonnet, ensuring accurate resolutions without manual intervention.
Processes large volumes of articles by classifying prompts for simplicity and routing to cost-effective models like Gemini Flash. For in-depth analysis or multi-step reasoning, it escalates to models like Sonnet, optimizing speed and expense across varied content types.
Analyzes code snippets and bug reports using the classifier to detect complexity markers, routing simple checks to cheaper models and intricate logic problems to advanced ones. Supports overrides for specific models when developers need targeted expertise.
Routes student questions based on complexity, using eco mode for basic queries and gigachad mode for advanced topics. Logs decisions to track costs and model performance, enabling scalable, affordable personalized learning support.
Classifies patient inquiries to route simple informational requests to low-cost models and complex medical reasoning to higher-tier models. Ensures privacy by hashing prompts and keeping data local, suitable for sensitive health applications.
Offers tiered pricing based on routing modes (eco, standard, gigachad) and usage volume, with premium features like advanced logging and override capabilities. Targets businesses seeking to optimize LLM costs without sacrificing performance.
Provides custom setup, API key configuration, and mode optimization for enterprises integrating Clawd Throttle into existing workflows. Generates revenue through project-based fees and ongoing support contracts.
Licenses the routing technology to other AI platforms or developers, allowing them to rebrand and offer cost-efficient LLM access. Revenue comes from licensing fees and a share of savings passed to end-users.
💬 Integration Tip
Ensure required API keys are set up first, and use the setup script to configure routing modes based on your cost and performance needs.
Scored Apr 19, 2026
Audited Apr 17, 2026 · audit v1.0
Use CodexBar CLI local cost usage to summarize per-model usage for Codex or Claude, including the current (most recent) model or a full model breakdown. Trigger when asked for model-level usage/cost data from codexbar, or when you need a scriptable per-model summary from codexbar cost JSON.
Gemini CLI for one-shot Q&A, summaries, and generation.
Manages free AI models from OpenRouter for OpenClaw. Automatically ranks models by quality, configures fallbacks for rate-limit handling, and updates opencla...
Manages free AI models from OpenRouter for OpenClaw. Automatically ranks models by quality, configures fallbacks for rate-limit handling, and updates opencla...
Manages free AI models from OpenRouter for OpenClaw. Automatically ranks models by quality, configures fallbacks for rate-limit handling, and updates openclaw.json. Use when the user mentions free AI, OpenRouter, model switching, rate limits, or wants to reduce AI costs.
使用 MiniMax MCP 进行图像理解和分析。触发条件:(1) 用户要求分析图片、理解图像、描述图片内容 (2) 需要识别图片中的物体、文字、场景 (3) 使用 MiniMax 的 understand_image 功能