llm-deploy在 GPU 服务器上部署 LLM 模型服务(vLLM)。支持多服务器配置,自动检查 GPU 和端口占用,一键部署流行的开源大语言模型。
Install via ClawdBot CLI:
clawdbot install wang-junjian/llm-deployGrade Fair — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Calls external URL not in known-safe list
https://github.com/vllm-project/vllmAudited Apr 18, 2026 · audit v1.0
Generated Mar 22, 2026
Research teams can quickly deploy multiple LLMs on GPU clusters for experimentation and benchmarking. This skill automates server checks and model launches, reducing setup time from hours to minutes.
Companies can deploy proprietary or open-source LLMs as internal APIs for chatbots, document analysis, or code generation. It supports multi-server configurations for scaling across GPU resources.
Providers can use this skill to manage LLM deployments for customers, offering on-demand model hosting with automated resource monitoring and port management.
Startups can rapidly prototype AI products by deploying models like Llama 3 or Mistral on available GPU servers, enabling quick iteration without deep DevOps expertise.
Institutions can set up hands-on workshops where students deploy and interact with LLMs, using the skill's simple commands to manage models and server states.
Offer a subscription-based service where clients pay for hosted LLM instances on GPU servers. Revenue comes from monthly fees based on model size and usage hours, with automated deployment reducing operational costs.
Provide consulting services to help enterprises integrate LLMs into their workflows, using this skill for setup and optimization. Revenue is generated through project-based contracts and ongoing support retainers.
License the skill as part of a white-label AI platform for other businesses to resell. Revenue comes from licensing fees and a percentage of customer sales, leveraging the skill's ease of use for quick market entry.
💬 Integration Tip
Integrate with existing CI/CD pipelines by automating server checks before deployment, and use the custom model configuration to align with internal model repositories.
Scored Jun 19, 2026
Full desktop computer use for headless Linux servers. Xvfb + XFCE virtual desktop with xdotool automation. 17 actions (click, type, scroll, screenshot, drag,...
Diagnoses common Linux service issues using logs, systemd/PM2, file permissions, Nginx reverse proxy checks, and DNS sanity checks. Use when a server app is failing, unreachable, or misconfigured.
Run a single command on a remote Tailscale node via SSH without opening an interactive session.
Debug DNS resolution and network connectivity. Use when troubleshooting DNS failures, testing port connectivity, diagnosing firewall rules, inspecting HTTP requests with curl verbose mode, configuring /etc/hosts, or debugging proxy and certificate issues.
主动监控系统状态。定期检查服务器健康,主动汇报,无需等待指令。
Manage Coolify deployments, applications, databases, and services via the Coolify API. Use when the user wants to deploy, start, stop, restart, or manage applications hosted on Coolify.