computer-vision-expertSOTA Computer Vision Expert (2026). Specialized in YOLO26, Segment Anything 3 (SAM 3), Vision Language Models, and real-time spatial analysis.
Install via ClawdBot CLI:
clawdbot install zorrong/computer-vision-expertRole: Advanced Vision Systems Architect & Spatial Intelligence Expert
To provide expert guidance on designing, implementing, and optimizing state-of-the-art computer vision pipelines. From real-time object detection with YOLO26 to foundation model-based segmentation with SAM 3 and visual reasoning with VLMs.
| Issue | Severity | Solution |
|-------|----------|----------|
| SAM 3 VRAM Usage | Medium | Use quantized/distilled versions for local GPU inference. |
| Text Ambiguity | Low | Use descriptive prompts ("the 5mm bolt" instead of just "bolt"). |
| Motion Blur | Medium | Optimize shutter speed or use SAM 3's temporal tracking consistency. |
| Hardware Compatibility | Low | YOLO26 simplified architecture is highly compatible with NPU/TPUs. |
ai-engineer, robotics-expert, research-engineer, embedded-systems
Generated Mar 1, 2026
Deploy YOLO26 for real-time defect detection on assembly lines, using SAM 3's text-to-mask to segment specific faulty components without retraining for each variation. This reduces manual inspection costs and improves throughput in manufacturing.
Integrate YOLO26 for obstacle detection and Depth Anything V2 for spatial awareness in drones or robots, enabling real-time mapping and avoidance in dynamic environments. Visual SLAM enhances localization accuracy for autonomous operations.
Use VLMs like Florence-2 for semantic scene understanding to track product placements and customer interactions, combined with SAM 3 for text-guided segmentation of specific items. This optimizes shelf stocking and reduces stockouts.
Apply SAM 3's 3D reconstruction to medical scans for precise organ or tumor segmentation, leveraging text prompts for targeted analysis. YOLO26 can assist in fast anomaly detection to support diagnostic workflows.
Implement YOLO26 for real-time object detection in video feeds to monitor traffic or public safety, with SAM 3 enabling text-based queries to isolate events like 'red car on left'. This enhances situational awareness and response times.
Offer a cloud-based service where clients upload images or video streams for automated analysis using YOLO26 and SAM 3, with APIs for custom segmentation and detection tasks. Revenue comes from subscription tiers based on usage volume and features.
License optimized models like YOLO26 and quantized SAM 3 versions for deployment on IoT devices or embedded systems in industries like manufacturing or retail. Revenue is generated through one-time licenses or annual maintenance fees per device.
Provide expert services to design and implement custom vision pipelines for specific client needs, such as integrating SAM 3D for 3D reconstruction in construction or using VLMs for visual Q&A in customer support. Revenue is project-based with hourly rates.
💬 Integration Tip
Start with YOLO26 for fast detection as a foundation, then layer SAM 3 for precise segmentation to avoid redundant models; use ONNX/TensorRT exports for efficient edge deployment.
Guide for creating effective skills. This skill should be used when users want to create a new skill (or update an existing skill) that extends Claude's capabilities with specialized knowledge, workflows, or tool integrations.
Provides a 7-step debugging protocol plus language-specific commands to systematically identify, verify, and fix software bugs across multiple environments.
A comprehensive skill for using the Cursor CLI agent for various software engineering tasks (updated for 2026 features, includes tmux automation guide).
Write, run, and manage unit, integration, and E2E tests across TypeScript, Python, and Swift using recommended frameworks.
Control and operate Opencode via slash commands. Use this skill to manage sessions, select models, switch agents (plan/build), and coordinate coding through Opencode.
Coding style memory that adapts to your preferences, conventions, and patterns for consistent coding.