Generate/edit images with Nano Banana Pro (Gemini 3 Pro Image). Use for image create/modify requests incl. edits. Supports text-to-image + image-to-image; 1K/2K/4K; use --input-image.
Generate, edit, and transform images with AI — from DALL-E and Stable Diffusion to FLUX and Midjourney.
These skills connect your agent to leading image generation models for text-to-image creation, image editing, style transfer, upscaling, and batch processing. Used by designers, marketers, and content teams to produce visuals at scale.
Generate images from text prompts using DALL-E, Stable Diffusion, FLUX, and other models.
Generate/edit images with Nano Banana Pro (Gemini 3 Pro Image). Use for image create/modify requests incl. edits. Supports text-to-image + image-to-image; 1K/2K/4K; use --input-image.
Batch-generate images via OpenAI Images API. Random prompt sampler + `index.html` gallery.
Capture frames or clips from RTSP/ONVIF cameras.
Create, inspect, process, and optimize image files and visual assets with reliable format choice, resizing, compression, color-profile, metadata, and platfor...
使用内置 image_generate.py 脚本生成图片, 准备清晰具体的 `prompt`。
Extract text from images using Tesseract OCR
Quick install — most popular ai image generation skill:
clawdbot install steipete/sonoscli1,070 skills found
Page 1 of 45
Control Sonos speakers (discover/status/play/volume/group).
Generate/edit images with Nano Banana Pro (Gemini 3 Pro Image). Use for image create/modify requests incl. edits. Supports text-to-image + image-to-image; 1K/2K/4K; use --input-image.
Batch-generate images via OpenAI Images API. Random prompt sampler + `index.html` gallery.
Control Philips Hue lights/scenes via the OpenHue CLI.
BluOS CLI (blu) for discovery, playback, grouping, and volume.
Control Eight Sleep pods (status, temperature, alarms, schedules).
Capture frames or clips from RTSP/ONVIF cameras.
Control Home Assistant smart home devices, run automations, and receive webhook events. Use when controlling lights, switches, climate, scenes, scripts, or any HA entity. Supports bidirectional communication via REST API (outbound) and webhooks (inbound triggers from HA automations).
Create, inspect, process, and optimize image files and visual assets with reliable format choice, resizing, compression, color-profile, metadata, and platfor...
Capture, inspect, and compare screenshots of screens, windows, regions, web pages, simulators, and CI runs with the right tool, wait strategy, viewport, and...
Extract text from images using Tesseract OCR
AI image generation and photo editing powered by CellCog. Text-to-image, image-to-image, consistent characters, product photography, reference-based generati...
Best quality AI image generation (~$0.12-0.20/image). Text-to-image, image-to-image, and image editing via the EvoLink API.
Create AI images with GPT Image, Gemini Nano Banana, FLUX, Imagen, and top providers using prompt engineering, style control, and smart editing.
Edit images with AI inpainting, outpainting, background removal, upscaling, and restoration tools.
Generates article cover images with 5 dimensions (type, palette, rendering, text, mood) combining 11 color palettes and 7 rendering styles. Supports cinemati...
Generate publication-quality chart images from data. Supports line, bar, area, point, histogram, candlestick, pie/donut, heatmap, multi-series, and stacked c...
Automate TikTok slideshow marketing for any app or product. Researches competitors, generates AI images, adds text overlays, posts via Postiz, tracks analyti...
Smart ClawdBot documentation access with local search index, cached snippets, and on-demand fetch. Token-efficient and freshness-aware.
Generate and edit images with Gemini API using pure Python stdlib. Zero dependencies - works on locked-down environments where pip/uv aren't available.
Generate images using multiple AI models — Midjourney (via Legnext.ai), Flux, Nano Banana Pro (Gemini), Ideogram, Recraft, and more via fal.ai. Intelligently...
Perform image manipulation tasks like background removal, resizing, format conversion, rounding corners, watermarking, and color adjustments using ImageMagic...
SOTA Computer Vision Expert (2026). Specialized in YOLO26, Segment Anything 3 (SAM 3), Vision Language Models, and real-time spatial analysis.
AI image generation with OpenAI, Google, DashScope and Replicate APIs. Supports text-to-image, reference images, aspect ratios. Sequential by default; parall...
Skills cover DALL-E 3, Stable Diffusion (SDXL, SD 1.5), FLUX, Midjourney API, Imagen, and ComfyUI workflows. Each skill specifies which backend and any API keys required.
Both. Inpainting skills edit specific regions of existing images, while img2img skills apply styles or transformations. Upscaling skills enhance resolution of any image.