benchmark-store当需要初始化基准数据库、对比 skill 评分与历史基线、查看 Pareto front 是否有维度回退、或查阅质量分级标准时使用。不用于给候选打分(用 improvement-discriminator)或自动改进(用 improvement-learner)。
Install via ClawdBot CLI:
clawdbot install lanyasheng/benchmark-storeGrade Fair — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Accesses sensitive credential files or environment variables
/etc/passwdPotentially destructive shell commands in tool definitions
exec(Calls external URL not in known-safe list
http://evil.comAI Analysis
The skill definition describes a benchmark storage and comparison system with no actual code execution, credential access, or data exfiltration. The 'RULE-BASED SIGNALS FOUND' section appears to be test/example data or false positives from pattern matching, as the skill definition itself contains only documentation and conceptual descriptions of Pareto front checking.
Generated Apr 25, 2026
当团队需要为新项目建立基准测试库,或查询已有的基准数据时,可通过CLI命令注册或列出所有基准条目,确保后续评估有据可依。适用于软件质量团队在项目启动阶段。
在迭代过程中,自动化将新版本的技能评分与历史的冻结基线进行对比,生成包含每个维度前后差值的JSON报告,帮助团队快速识别能力变化。
在自动改进循环中,通过ParetoFront模块逐维度检查新评分,若发现任一维度回退超过5%则拒绝变更,防止因局部优化牺牲其他关键能力。
基于POWERFUL/SOLID/GENERIC/WEAK四级分级标准,每次评估后自动计算加权综合分,并根据分级结果决定是否将技能推向市场、开源或需要继续迭代。
在进行大规模技能批量评估前,通过list操作获取全部基准条目及其元数据,确保评估覆盖全面,适用于质量保障团队。
将benchmark-store作为SaaS服务提供给外部团队,让他们管理自己的基准数据、执行回归检查并获取质量分级报告。
将benchmark-store嵌入CI/CD流水线,为每个技能变更自动执行Pareto回归检查,作为质量门禁的一部分。
为客户定制评估维度和权重,并部署私有化benchmark-store实例,配套培训和技术支持。
💬 Integration Tip
集成时需确保Pareto基线文件(state/pareto.json)和维护基准数据库(benchmarks.db)的同步更新,并遵循CLI规范操作以避免数据不一致。
Scored Apr 19, 2026
Audited Apr 17, 2026 · audit v1.0
Search and summarize papers from ArXiv. Use when the user asks for the latest research, specific topics on ArXiv, or a daily summary of AI papers.
Assistance with writing literature reviews by searching for academic sources via Semantic Scholar, OpenAlex, Crossref and PubMed APIs. Use when the user needs to find papers on a topic, get details for specific DOIs, or draft sections of a literature review with proper citations.
Creates formal academic research papers following IEEE/ACM formatting standards with proper structure, citations, and scholarly writing style. Use when the user asks to write a research paper, academic paper, or conference paper on any topic.
Search, download, and summarize academic papers from arXiv. Built for AI/ML researchers.
Use this skill when users need to search academic papers, download research documents, extract citations, or gather scholarly information. Triggers include: requests to "find papers on", "search research about", "download academic articles", "get citations for", or any request involving academic databases like arXiv, PubMed, Semantic Scholar, or Google Scholar. Also use for literature reviews, bibliography generation, and research discovery. Requires OpenClawCLI installation from clawhub.ai.
Manage Zotero reference libraries via the Web API. Search, list, add items by DOI/ISBN/PMID (with duplicate detection), delete/trash items, update metadata and tags, export in BibTeX/RIS/CSL-JSON, batch-add from files, check PDF attachments, cross-reference citations, find missing DOIs via CrossRef, and fetch open-access PDFs. Supports --json output for scripting. Use when the user asks about academic references, citation management, literature libraries, PDFs for papers, bibliography export, or Zotero specifically.