smart-model-switchingAuto-route tasks to the cheapest Claude model that works correctly. Three-tier progression: Haiku → Sonnet → Opus. Classify before responding. HAIKU (default): factual Q&A, greetings, reminders, status checks, lookups, simple file ops, heartbeats, casual chat, 1-2 sentence tasks. ESCALATE TO SONNET: code >10 lines, analysis, comparisons, planning, reports, multi-step reasoning, tables, long writing >3 paragraphs, summarization, research synthesis, most user conversations. ESCALATE TO OPUS: architecture decisions, complex debugging, multi-file refactoring, strategic planning, nuanced judgment, deep research, critical production decisions. Rule: If a human needs >30 seconds of focused thinking, escalate. If Sonnet struggles with complexity, go to Opus. Save 50-90% on API costs by starting cheap and escalating only when needed.
Install via ClawdBot CLI:
clawdbot install millibus/smart-model-switchingGrade Good — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Generated Mar 1, 2026
A chatbot handling customer inquiries uses Haiku for greetings and simple FAQ lookups, Sonnet for analyzing issues and generating detailed troubleshooting steps, and Opus for complex complaint resolution requiring nuanced judgment.
A coding assistant uses Haiku for quick syntax lookups and file reads, Sonnet for writing functions and debugging standard bugs, and Opus for designing system architecture or refactoring multi-file codebases.
A platform for generating articles uses Haiku for initial topic research and status checks, Sonnet for drafting reports and summarizing sources, and Opus for strategic content planning and deep research synthesis.
An AI tutor uses Haiku for factual Q&A and simple reminders, Sonnet for explaining concepts and creating study plans, and Opus for complex problem-solving and ethical discussions in advanced subjects.
A tool for financial insights uses Haiku for quick data lookups and status updates, Sonnet for generating reports and comparing investment options, and Opus for strategic portfolio decisions and deep market analysis.
Offer the skill as part of a subscription-based AI platform, charging monthly fees based on usage tiers. Revenue comes from cost savings passed to clients, with premium plans for advanced features.
Sell the skill as an API service that developers integrate into their applications, with pricing based on API call volume. Revenue is generated through pay-per-use or tiered pricing models.
Provide consulting services to customize and implement the skill for enterprise clients, optimizing model routing for specific workflows. Revenue comes from one-time project fees and ongoing support contracts.
💬 Integration Tip
Start by integrating Haiku for basic tasks to test cost savings, then gradually add Sonnet and Opus triggers based on task complexity metrics.
Scored Apr 15, 2026
Gemini CLI for one-shot Q&A, summaries, and generation.
Use CodexBar CLI local cost usage to summarize per-model usage for Codex or Claude, including the current (most recent) model or a full model breakdown. Trigger when asked for model-level usage/cost data from codexbar, or when you need a scriptable per-model summary from codexbar cost JSON.
Manages free AI models from OpenRouter for OpenClaw. Automatically ranks models by quality, configures fallbacks for rate-limit handling, and updates opencla...
Check Antigravity account quotas for Claude and Gemini models. Shows remaining quota and reset times with ban detection.
A comprehensive AI model routing system that automatically selects the optimal model for any task. Set up multiple AI providers (Anthropic, OpenAI, Gemini, Moonshot, Z.ai, GLM) with secure API key storage, then route tasks to the best model based on task type, complexity, and cost optimization. Includes interactive setup wizard, task classification, and cost-effective delegation patterns. Use when you need "use X model for this", "switch model", "optimal model", "which model should I use", or to balance quality vs cost across multiple AI providers.
Reduce OpenClaw AI costs by 97%. Haiku model routing, free Ollama heartbeats, prompt caching, and budget controls. Go from $1,500/month to $50/month in 5 min...