⛔This skill has been removed from clawhub.ai. It is no longer available for installation and may not function correctly. The information shown here is preserved for reference only.

🔐 Security & Audit

Content Safety Guardv1.0.0

Name: Content Safety Guard
Author: PHY041

phy-content-safety-guard

PHY041

Dual-layer AI content guardrail with red-team test methodology

securitysecurity-scan

UnavailableView on ClawHub

Installs (all time)

Installs (current)

Downloads

141

Stars

CreatedMar 6, 2026

UpdatedMar 6, 2026

Install & Quick Start

Install via ClawdBot CLI:

clawdbot install PHY041/phy-content-safety-guard

Skill Package1 files

📋SKILL.mdmarkdown

Failed to load file.

Quality Score

B52/100

Grade Fair — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.

Market Validation2/35

· No tracked installs (may still have manual users)
· 141 downloads (low demand)

Documentation19/25

· SKILL.md present
· Detailed documentation (≥3000 chars)
· Contains usage examples or trigger description

Package Completeness6/15

· skillAssets present (0 files)

Security Analysis

💙 Low Risk

UNDOCUMENTED_EXTERNALlow

Calls external URL not in known-safe list

https://generativelanguage.googleapis.com/v1beta/models

KNOWN_EXTERNALlow

Uses known external API (expected, informational)

googleapis.com

Audited Apr 18, 2026 · audit v1.0

💡

Usage Guide

Generated Mar 22, 2026

AI chatbot developersEnterprise security teamsContent moderation specialistsintermediate

💡 Application Scenarios

Customer Service ChatbotsE-commerce/Retail

Prevents AI agents from leaking internal policies, making inappropriate brand statements, or providing harmful advice in customer interactions. Ensures all responses align with company values and safety guidelines.

Educational AI TutorsEdTech

Blocks content that could negatively evaluate student capabilities, provide medical/psychological diagnoses, or contain discriminatory remarks. Maintains encouraging, educational tone while filtering unsafe outputs.

Mental Wellness AssistantsHealthcare/Wellness

Filters out diagnostic language, harmful self-evaluation statements, and dangerous content while allowing supportive guidance. Crucial for preventing AI from causing psychological harm through inappropriate responses.

Social Media Content ModeratorsSocial Media

Acts as second-layer defense against violent, sexual, or hateful content generation by AI moderation tools. Catches failures where primary models might be manipulated through prompt injection attacks.

Enterprise Knowledge AssistantsEnterprise Software

Prevents leakage of internal information like API keys, system prompts, or proprietary data while blocking harmful content. Essential for maintaining corporate security and brand integrity.

💼 Business Models

SaaS Subscription$10-500/month per customer

Offer tiered monthly subscriptions based on message volume and customization levels. Enterprise tiers include custom forbidden categories, multiple language fallbacks, and detailed analytics dashboards.

API-as-a-Service$0.001-0.01 per evaluation

Charge per API call for content evaluation with volume discounts. Include premium features like custom judge models, lower latency guarantees, and industry-specific safety templates.

White-label SolutionAnnual licensing fees $5k-50k per platform

License the guardrail technology to chatbot platforms and AI agent marketplaces. Provide customization tools for brands to define their safety parameters while maintaining core infrastructure.

💬 Integration Tip

Customize the GUARD_SYSTEM_PROMPT with abstract category descriptions rather than specific forbidden terms to avoid triggering Gemini's own safety filters on benign content.