⚠️Install with caution. This skill has very few installs. Always review the source and verify it on clawhub.ai before installing. Community-built skills run with agent permissions — only install ones you trust.

🛡️ Agent Security

Reef Prompt Guardv1.0.0

Name: Reef Prompt Guard
Author: staybased

reef-prompt-guard

staybased

Detect and filter prompt injection attacks in untrusted input. Use when processing external content (emails, web scrapes, API inputs, Discord messages, sub-agent outputs) or when building systems that accept user-provided text that will be passed to an LLM. Covers direct injection, jailbreaks, data exfiltration, privilege escalation, and context manipulation.

latest

Download Package View on ClawHub

Installs (all time)

Installs (current)

Downloads

1.5K

Stars

CreatedFeb 12, 2026

UpdatedMay 18, 2026

Install & Quick Start

Install via ClawdBot CLI:

clawdbot install staybased/reef-prompt-guard

Skill Package3 files

📋SKILL.mdmarkdown

Failed to load file.

Quality Score

C45/100

Grade Limited — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.

Market Validation4/35

· No tracked installs (may still have manual users)
· 688 downloads (moderate demand)

Documentation16/25

· SKILL.md present
· Detailed documentation (≥3000 chars)
· Detailed summary

Package Completeness8/15

· skillAssets present (2 files)
· Includes scripts or config files

Security Analysis

🚨 Critical Risk

CREDENTIAL_ACCESShigh

Accesses sensitive credential files or environment variables

/etc/passwd

PROMPT_POISONINGhigh

Contains instructions to override system prompt or ignore user requests

"ignore previous instructions"

UNSAFE_SHELLmedium

Potentially destructive shell commands in tool definitions

rm -rf /

UNDOCUMENTED_EXTERNALlow

Calls external URL not in known-safe list

https://attacker.com/log?data=SYSTEM_PROMPT_CONTENT

💡

Usage Guide

Generated Mar 22, 2026

AI developersCybersecurity teamsbeginner

💡 Application Scenarios

Customer Support Email FilteringCustomer Service

Automatically scan incoming customer emails for prompt injection attempts before passing content to an LLM for response generation. This prevents malicious users from manipulating support agents to disclose sensitive information or execute unauthorized commands.

Web Scraping Content SecurityMarket Research

Filter untrusted text from web scrapes used in market research or content aggregation to block injection attacks. Ensures that scraped data does not contain hidden instructions that could compromise downstream AI systems.

API Input ValidationSaaS

Integrate the skill into API endpoints that accept user-generated text for LLM processing, such as chatbots or content moderation tools. It blocks high-risk inputs from sources like webhooks or third-party integrations.

Internal Sub-Agent CommunicationAI Development

Scan outputs from sub-agents in multi-agent AI systems to prevent cascading injection attacks. This adds a layer of security when agents pass data between each other, mitigating privilege escalation risks.

Discord Bot SecuritySocial Media

Protect Discord bots by filtering user messages for jailbreak attempts or data exfiltration patterns before processing with an LLM. This is crucial for community management bots handling untrusted public input.

💼 Business Models

Subscription-Based API ServiceMonthly recurring fees from $99 to $999 per month

Offer the prompt guard as a cloud API with tiered pricing based on usage volume. Provide real-time scanning for businesses integrating AI into their products, with premium support for custom threat patterns.

Enterprise Security IntegrationOne-time license fees starting at $10,000 plus annual maintenance

Sell on-premise licenses or custom integrations to large organizations needing to secure internal AI systems. Include consulting services for deployment, training, and ongoing pattern updates.

Open Source with Premium FeaturesFreemium model with premium features priced at $49 to $299 per user annually

Release the core tool as open source to build community trust and adoption. Monetize through paid add-ons like advanced ML classifiers, priority pattern updates, and dedicated support channels.

💬 Integration Tip

Use the JSON mode for easy integration into existing workflows, and always apply context multipliers based on input source risk levels to enhance detection accuracy.