save-moneyAuto-detect task complexity for Claude models (Haiku + Sonnet). Route simple tasks to Haiku, escalate complex ones to Sonnet. Save 50%+ on API costs. | 適用 Claude 模型(Haiku + Sonnet),自動偵測任務難度,簡單用 Haiku,複雜升級 Sonnet,輕鬆省下 50% 以上的費用。— RULES: Before responding, classify the task. If it matches an escalation trigger, call sessions_spawn with the full task and model anthropic/claude-sonnet-4-20250514 IMMEDIATELY. Do NOT answer complex tasks yourself. ESCALATE: analysis, comparisons, planning, strategy, code >10 lines, reports, proposals, multi-step reasoning, tables, outlines, charts, long writing >3 paragraphs, summarization, long translation. NEVER on Haiku: comparison tables, code >10 lines, multi-step analysis, reports, complex reasoning. STAY on Haiku: factual Q&A, definitions, greetings, reminders, short lookups, casual chat, 1-2 sentence tasks. When in doubt, escalate. Keep Haiku replies concise.
Install via ClawdBot CLI:
clawdbot install peterokase42/save-moneyRun on Haiku by default. Only spawn Sonnet when the task actually needs it. Save 50%+ on API costs.
Before generating ANY response, you MUST:
sessions_spawn() IMMEDIATELY. Do NOT attempt to answer yourself.When in doubt, escalate. A wrong escalation only costs a few cents. A bad answer on Haiku wastes the user's time.
If a human would need more than 30 seconds of focused thinking, escalate.
>
如果一個人需要超過 30 秒的專注思考才能完成,就升級模型。
If the task matches ANY of the following, do NOT answer yourself.
Instead, IMMEDIATELY call:
sessions_spawn(
message: "<the full task description>",
model: "anthropic/claude-sonnet-4-20250514",
label: "<short task label>"
)
| Language | Escalate — real examples |
|----------|--------------------------|
| English | "Can you analyze this for me?", "Write me a report on...", "Help me plan...", "What are the pros and cons?", "Build a script that...", "Compare A vs B", "Step by step, how do I...", "Draft a proposal for..." |
| 繁體中文 | "欸幫我看一下這個報告", "幫我想一下怎麼回客戶", "這兩個方案哪個比較好", "寫一封信給老闆", "幫我整理一下這份資料", "我該怎麼處理這個問題", "可以幫我寫一個程式嗎", "幫我規劃一下行程", "有什麼辦法可以改善", "這個東西要怎麼設計比較好" |
| 日本語 | "これを分析してもらえますか", "レポートを書いてください", "計画を立ててほしい", "AとBを比較して", "コードを書いてほしい", "この資料をまとめて", "提案書を作って", "どうすればいいか考えて" |
| 한국어 | "이거 분석해줘", "보고서 작성해줘", "계획 세워줘", "A랑 B 비교해줘", "코드 짜줘", "이 자료 정리해줘", "제안서 만들어줘", "어떻게 하면 좋을까?" |
| Deutsch | "Kannst du das analysieren?", "Schreib mir einen Bericht", "Hilf mir das zu planen", "Vergleich A mit B", "Schreib ein Skript für...", "Fass das zusammen", "Wie soll ich das lösen?", "Erstell einen Entwurf" |
If you catch yourself writing a long response for a complex task, STOP and call sessions_spawn instead.
| Language | Stay — real examples |
|----------|----------------------|
| English | "What's the weather?", "Remind me at 3pm", "What does OKR mean?", "Translate: thank you", "Hey what's up" |
| 繁體中文 | "今天天氣怎樣", "幫我記一下明天要開會", "這個字什麼意思", "現在幾點", "嗨", "謝謝", "OK", "查一下匯率", "翻譯一下 thank you" |
| 日本語 | "天気は?", "意味を教えて", "これ何?", "おはよう", "リマインドして", "ありがとう" |
| 한국어 | "날씨 어때?", "뜻이 뭐야?", "이게 뭐야?", "안녕", "알림 설정해줘", "고마워" |
| Deutsch | "Wie ist das Wetter?", "Was bedeutet das?", "Was ist das?", "Hallo", "Erinner mich um 3", "Danke" |
When on Haiku, keep replies concise. Fewer output tokens = lower cost.
If a conversation was escalated to Sonnet but the follow-up is simple, switch back to Haiku.
Don't stay on the expensive model just because the conversation started there.
Return the result directly. Do NOT mention the model switch unless the user asks.
This skill is written for Claude (Haiku + Sonnet). Swap model names for other providers:
| Role | Claude | OpenAI | Google |
|------|--------|--------|--------|
| Cheap (default) | claude-3-5-haiku | gpt-4o-mini | gemini-flash |
| Strong (escalate) | claude-sonnet-4 | gpt-4o | gemini-pro |
The Clawdbot skill system only injects the frontmatter description field
into the system prompt — the body of SKILL.md is not automatically
included. The model may optionally read the full file, but it is not
guaranteed. Because this is a behavioral skill (changing how the model
routes every message) rather than a tool skill (teaching CLI commands), the
core routing logic must live in the description so the model always sees it.
The body above serves as extended documentation: detailed trigger lists,
multilingual examples, and usage tips that the model can reference if it
reads the file.
TL;DR: description = what the model always sees. body = reference docs.
小安 Ann Agent — Taiwan 台灣
Building skills and local MCP services for all AI agents, everywhere.
為所有 AI Agent 打造技能與在地 MCP 服務,不限平台。
Generated Mar 1, 2026
Handles routine customer inquiries about product features, order status, and basic troubleshooting on Haiku. Automatically escalates complex technical issues, complaint analysis, or policy explanations to Sonnet for detailed resolution.
Uses Haiku for quick research, fact-checking, and simple content ideas. Escalates to Sonnet for drafting long-form articles, creating structured outlines, or developing marketing copy with brand guidelines.
Provides quick code explanations, syntax help, and simple debugging on Haiku. Automatically escalates complex algorithm design, multi-file refactoring, or architecture planning tasks to Sonnet for comprehensive solutions.
Handles basic data queries, definitions, and simple calculations on Haiku. Escalates financial analysis, competitive comparisons, strategic planning, and detailed report generation to Sonnet for thorough execution.
Manages greetings, simple translations, and basic Q&A in multiple languages using Haiku. Escalates complex document translation, cultural adaptation of content, or detailed cross-language explanations to Sonnet.
Offers a subscription-based service where businesses pay a monthly fee to integrate this skill into their AI workflows. The service guarantees reduced Claude API costs through intelligent model routing, with savings passed partially to customers.
Packages this skill with analytics dashboards and custom escalation rules for large organizations. Provides detailed cost reports, usage analytics, and team-specific optimization recommendations as a premium enterprise solution.
Distributes this skill through AI agent marketplaces with a freemium model. Offers basic functionality for free while charging for advanced features like custom trigger rules, priority support, and integration with additional AI models.
💬 Integration Tip
Start by implementing the escalation triggers for your most common complex tasks, then gradually refine based on actual usage patterns to maximize savings.
Use CodexBar CLI local cost usage to summarize per-model usage for Codex or Claude, including the current (most recent) model or a full model breakdown. Trigger when asked for model-level usage/cost data from codexbar, or when you need a scriptable per-model summary from codexbar cost JSON.
Gemini CLI for one-shot Q&A, summaries, and generation.
Research any topic from the last 30 days on Reddit + X + Web, synthesize findings, and write copy-paste-ready prompts. Use when the user wants recent social/web research on a topic, asks "what are people saying about X", or wants to learn current best practices. Requires OPENAI_API_KEY and/or XAI_API_KEY for full Reddit+X access, falls back to web search.
Check Antigravity account quotas for Claude and Gemini models. Shows remaining quota and reset times with ban detection.
Manages free AI models from OpenRouter for OpenClaw. Automatically ranks models by quality, configures fallbacks for rate-limit handling, and updates opencla...
Manages free AI models from OpenRouter for OpenClaw. Automatically ranks models by quality, configures fallbacks for rate-limit handling, and updates openclaw.json. Use when the user mentions free AI, OpenRouter, model switching, rate limits, or wants to reduce AI costs.