evalEvaluate everything the PA agent manages — tasks, skills, PA network health, billing, calendar connections, and memory quality. Use when: owner asks for an e...
Install via ClawdBot CLI:
clawdbot install netanel-abergel/evalGrade Fair — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Accesses sensitive credential files or environment variables
${ANTHROPICSends data to undocumented external endpoint (potential exfiltration)
POST → https://api.monday.com/v2Calls external URL not in known-safe list
https://api.monday.com/v2Uses known external API (expected, informational)
api.anthropic.comGenerated May 8, 2026
An individual uses the Eval skill to self-assess their AI agent's performance, task completion, and memory health on a weekly basis. The report highlights stalled tasks, communication issues, and recommendations for improvement, helping the user optimize their workflow.
A small business owner deploys the Eval skill to monitor the AI agent managing their calendar, CRM, and billing. The evaluation reveals integration failures or billing problems, enabling proactive maintenance and ensuring business continuity.
A freelancer runs the Eval to check the health of their PA network, including tools like monday.com and WhatsApp. The skill's recommendations help prioritize fixes like updating API keys or resolving stalled tasks, enhancing client responsiveness.
A team lead uses the Eval skill to oversee multiple AI agents handling different projects. The aggregated self-performance scores and task audits provide a dashboard-style overview, enabling data-driven delegation and training.
An AI developer integrates the Eval skill into their development workflow to assess agent memory quality and integration health. The detailed memory and integration checks help identify issues like bloated memory files or broken API connections, streamlining debugging.
Offer a premium AI PA service that includes automated weekly Eval reports. Subscribers receive proactive recommendations and performance scores, justifying recurring revenue through improved agent reliability.
Provide a tool for organizations to monitor the health of multiple AI agents deployed across departments. The Eval skill generates combined reports and recommendations, monetized through SaaS licensing.
Offer basic Eval reports for free, with premium features like deep recommendations, benchmarking against peers, and integration health alerts. Upsell to paid tiers as users demand more detailed analytics.
💬 Integration Tip
To integrate, store the Eval script as a custom plugin and call it on-demand or via cron. Ensure the script has read access to necessary files like tasks.md and billing-status.json.
Scored Jun 19, 2026
AI Analysis
The skill performs legitimate system evaluation functions but accesses sensitive credential files (${ANTHROPIC}) and calls external APIs (Monday.com, Anthropic) which could expose credentials if improperly handled. While these API calls appear consistent with the skill's purpose, credential access patterns warrant caution.
Audited Apr 17, 2026 · audit v1.0
PollyReach gives every AI agent a phone number and the ability to get things done over the phone — finding contacts, making calls, and completing tasks. Just...
Helps users discover and install agent skills when they ask questions like "how do I do X", "find a skill for X", "is there a skill that can...", or express interest in extending capabilities. This skill should be used when the user is looking for functionality that might exist as an installable skill.
Ultimate AI agent memory system for Cursor, Claude, ChatGPT & Copilot. WAL protocol + vector search + git-notes + cloud backup. Never lose context again. Vibe-coding ready.
Give your AI agent eyes to see the entire internet. 7500+ GitHub stars. Search and read 14 platforms: Twitter/X, Reddit, YouTube, GitHub, Bilibili, XiaoHongS...
A self-evolution engine for AI agents. Analyzes runtime history to identify improvements and applies protocol-constrained evolution. Communicates with EvoMap...
Infinite organized memory that complements your agent's built-in memory with unlimited categorized storage.