sre-practicesDeep SRE workflow—SLOs/SLIs, error budgets, alerting, toil reduction, incident readiness, capacity, and balancing reliability with delivery. Use when improvi...
Install via ClawdBot CLI:
clawdbot install codekungfu/sre-practicesGrade Fair — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Generated May 8, 2026
An online retailer wants to define SLOs for their checkout and payment flows to balance reliability with feature velocity. This workflow helps map user journeys, set achievable targets, and establish an error budget policy that aligns product and engineering teams.
A fast-growing SaaS company experiences frequent on-call burnout from noisy alerts and high toil. The workflow guides them to symptom-based alerting, toil identification, and automation to reduce pages and improve incident response.
A fintech firm needs to ensure critical services meet regulatory uptime requirements while still shipping features. The workflow helps tier services, set realistic SLOs considering third-party dependencies, and define error budget policies with compliance in mind.
A telemedicine provider experiences scaling issues during peak hours, leading to latency and dropped calls. Using SRE practices, they measure SLIs for availability and latency, set SLO targets, and create a capacity plan with toil reduction for manual scaling.
A media streaming company wants to improve incident readiness and reduce mean time to recovery (MTTR). The workflow inspires them to run game days, simulate failures, and continuously improve based on post-incident action items.
Monetize SRE consulting by helping organizations define and implement error budget policies that tie reliability investments to business outcomes. Revenue comes from a flat fee for policy design and periodic audits.
Offer a SaaS platform that automates SLI collection, SLO tracking, and alert tuning based on the practices. Revenue from subscription tiers based on number of services or data volume.
Provide a consulting service that identifies toil, builds automation roadmaps, and implements self-service platforms. Revenue from project-based fees plus a percentage of operational cost savings realized.
💬 Integration Tip
Start by mapping user journeys and SLIs for a single critical service, then gradually expand to other services. Use the checklist in the SKILL.md as a lightweight audit to track progress.
Scored May 8, 2026
Automatically update Clawdbot and all installed skills once daily. Runs via cron, checks for updates, applies them, and messages the user with a summary of what changed.
Analyze Twitter content to uncover WordPress and Shopify client pain points, craft authority-building posts, and generate qualified inbound lead insights.
A clean, reliable system resource monitor for CPU load, RAM, Swap, and Disk usage. Optimized for OpenClaw.
Automatically update Clawdbot and all installed skills once daily. Runs via cron, checks for updates, applies them, and messages the user with a summary of what changed.
Set up observability for applications and infrastructure with metrics, logs, traces, and alerts.
Monitor topics of interest and proactively alert when important developments occur. Use when user wants automated monitoring of specific subjects (e.g., prod...