⛔This skill has been removed from clawhub.ai. It is no longer available for installation and may not function correctly. The information shown here is preserved for reference only.

📈 Analytics & BI

LLM Evaluator Prov1.0.0

Name: LLM Evaluator Pro
Author: aiwithabidi

llm-evaluator-pro

aiwithabidi

LLM-as-a-Judge evaluator via Langfuse. Scores traces on relevance, accuracy, hallucination, and helpfulness using GPT-5-nano as judge. Supports single trace...

evaluationlatestquality

UnavailableView on ClawHub

Installs (all time)

Installs (current)

Downloads

538

Stars

CreatedFeb 15, 2026

UpdatedMay 10, 2026

Install & Quick Start

Install via ClawdBot CLI:

clawdbot install aiwithabidi/llm-evaluator-pro

Skill Package2 files

📋SKILL.mdmarkdown

Failed to load file.

Quality Score

B56/100

Grade Fair — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.

Market Validation5/35

· No tracked installs (may still have manual users)
· 538 downloads (moderate demand)
· 1 stars

Documentation18/25

· SKILL.md present
· Moderate documentation (≥1500 chars)
· Contains usage examples or trigger description
· Detailed summary

Package Completeness8/15

Security Analysis

💙 Low Risk

UNDOCUMENTED_EXTERNALlow

Calls external URL not in known-safe list

https://www.linkedin.com/in/mohammad-ali-abidi

Audited Apr 16, 2026 · audit v1.0

💡

Usage Guide

Generated Mar 22, 2026

AI developersQuality assurance teamsintermediate

💡 Application Scenarios

Content Moderation for Customer SupportCustomer Service

Evaluate AI-generated responses in customer service chatbots for relevance and accuracy, ensuring helpful and factual interactions. This helps reduce misinformation and improve user satisfaction by scoring hallucination and helpfulness metrics.

Quality Assurance in Legal ResearchLegal Services

Score AI-assisted legal document summaries for factual correctness and relevance to queries, detecting hallucinations to maintain high standards. This supports law firms in verifying AI outputs before use in case preparation.

Educational Content EvaluationEducation Technology

Assess AI-generated educational materials for accuracy and helpfulness, ensuring they are relevant to curriculum needs. This aids e-learning platforms in maintaining quality and reducing errors in automated content creation.

Healthcare Information VerificationHealthcare

Evaluate AI responses in medical chatbots for accuracy and hallucination detection, ensuring patient safety and reliable information. This is critical for healthcare providers using AI to assist with preliminary diagnoses or advice.

E-commerce Product Recommendation ScoringE-commerce

Score AI-generated product descriptions and recommendations for relevance and helpfulness, improving customer experience. This helps online retailers optimize their AI systems to drive sales and reduce returns.

💼 Business Models

SaaS SubscriptionMonthly or annual subscription fees

Offer the evaluator as a cloud-based service with tiered pricing based on usage volume, such as number of traces scored per month. This provides recurring revenue and scalability for businesses integrating AI quality checks.

Consulting and IntegrationOne-time project fees and ongoing support contracts

Provide custom setup and integration services for enterprises adopting the evaluator, including training and support. This generates project-based revenue and long-term partnerships with clients needing specialized AI evaluation.

White-Label SolutionLicensing fees and royalties

License the evaluator technology to other AI platforms or agencies for rebranding and use in their own products. This creates revenue through licensing fees and expands market reach without direct customer management.

💬 Integration Tip

Ensure environment variables for OpenRouter and Langfuse are securely configured before running scripts, and test with sample cases to verify setup.