clawrouterSmart LLM router — save 67% on inference costs. Routes every request to the cheapest capable model across 41 models from OpenAI, Anthropic, Google, DeepSeek,...
Install via ClawdBot CLI:
clawdbot install 1bcMax/clawrouterSmart LLM router that saves 78% on inference costs by routing each request to the cheapest model that can handle it. 30+ models across 5 providers, all through one wallet.
openclaw plugins install @blockrun/clawrouter
# Enable smart routing (auto-picks cheapest model per request)
openclaw models set blockrun/auto
# Or pin a specific model
openclaw models set openai/gpt-4o
ClawRouter classifies each request into one of four tiers:
Rules handle ~80% of requests in <1ms. Only ambiguous queries hit the LLM classifier (~$0.00003 per classification).
30+ models including: gpt-5.2, gpt-4o, gpt-4o-mini, o3, o4-mini, claude-opus-4.5, claude-sonnet-4, claude-haiku-4.5, gemini-2.5-pro, gemini-2.5-flash, deepseek-chat, deepseek-reasoner, grok-3, grok-3-mini.
[ClawRouter] google/gemini-2.5-flash (SIMPLE, rules, confidence=0.92)
Cost: $0.0025 | Baseline: $0.308 | Saved: 99.2%
Generated Mar 1, 2026
A business deploys a chatbot to handle customer inquiries, using ClawRouter to route simple questions like order status to low-cost models and complex troubleshooting to high-quality models, reducing operational costs while maintaining service quality.
An online platform uses ClawRouter to classify and moderate user-generated content, routing basic spam detection to cheap models and nuanced hate speech analysis to more capable models, ensuring efficient and accurate moderation at scale.
An edtech company integrates ClawRouter into a tutoring app to route factual queries and translations to cost-effective models, while directing complex math problems and reasoning tasks to specialized models, optimizing learning support costs.
A financial firm employs ClawRouter to process reports, routing simple data extraction tasks to low-tier models and complex multi-step analysis to high-performance models, cutting down on AI inference expenses for routine operations.
A healthcare provider uses ClawRouter in a symptom assessment tool, routing basic informational queries to inexpensive models and intricate medical reasoning to advanced models, balancing cost-efficiency with patient safety.
Offer ClawRouter as a cloud-based service with tiered pricing based on usage volume, targeting businesses that need cost-effective AI routing without infrastructure management, generating recurring revenue from monthly or annual subscriptions.
Provide ClawRouter via an API where customers pay per request routed, appealing to startups and developers with variable AI workloads, with revenue scaling directly with usage and potential volume discounts.
Sell on-premise or custom deployments of ClawRouter to large enterprises in regulated industries, offering dedicated support and customization, with revenue from one-time licenses and ongoing maintenance contracts.
💬 Integration Tip
Start by enabling auto-routing with 'openclaw models set blockrun/auto' to quickly test cost savings, then fine-tune with specific model pins based on performance needs.
Use CodexBar CLI local cost usage to summarize per-model usage for Codex or Claude, including the current (most recent) model or a full model breakdown. Trigger when asked for model-level usage/cost data from codexbar, or when you need a scriptable per-model summary from codexbar cost JSON.
Gemini CLI for one-shot Q&A, summaries, and generation.
Research any topic from the last 30 days on Reddit + X + Web, synthesize findings, and write copy-paste-ready prompts. Use when the user wants recent social/web research on a topic, asks "what are people saying about X", or wants to learn current best practices. Requires OPENAI_API_KEY and/or XAI_API_KEY for full Reddit+X access, falls back to web search.
Check Antigravity account quotas for Claude and Gemini models. Shows remaining quota and reset times with ban detection.
Manages free AI models from OpenRouter for OpenClaw. Automatically ranks models by quality, configures fallbacks for rate-limit handling, and updates opencla...
Manages free AI models from OpenRouter for OpenClaw. Automatically ranks models by quality, configures fallbacks for rate-limit handling, and updates openclaw.json. Use when the user mentions free AI, OpenRouter, model switching, rate limits, or wants to reduce AI costs.