Generate/edit images with Nano Banana Pro (Gemini 3 Pro Image). Use for image create/modify requests incl. edits. Supports text-to-image + image-to-image; 1K/2K/4K; use --input-image.
Generate, edit, and transform images with AI — from DALL-E and Stable Diffusion to FLUX and Midjourney.
These skills connect your agent to leading image generation models for text-to-image creation, image editing, style transfer, upscaling, and batch processing. Used by designers, marketers, and content teams to produce visuals at scale.
Generate images from text prompts using DALL-E, Stable Diffusion, FLUX, and other models.
Generate/edit images with Nano Banana Pro (Gemini 3 Pro Image). Use for image create/modify requests incl. edits. Supports text-to-image + image-to-image; 1K/2K/4K; use --input-image.
Batch-generate images via OpenAI Images API. Random prompt sampler + `index.html` gallery.
Create, inspect, process, and optimize image files and visual assets with reliable format choice, resizing, compression, color-profile, metadata, and platfor...
使用内置 image_generate.py 脚本生成图片, 准备清晰具体的 `prompt`。
Generate images using Qwen Image API (Alibaba Cloud DashScope). Use when users request image generation with Chinese prompts or need high-quality AI-generated images from text descriptions.
Generate images using the internal Google Antigravity API (Gemini 3 Pro Image). High quality, native generation without browser automation.
Quick install — most popular ai image generation skill:
clawdbot install steipete/sonoscli979 skills found
Page 1 of 41
Control Sonos speakers (discover/status/play/volume/group).
Batch-generate images via OpenAI Images API. Random prompt sampler + `index.html` gallery.
Control Philips Hue lights/scenes via the OpenHue CLI.
BluOS CLI (blu) for discovery, playback, grouping, and volume.
Control Eight Sleep pods (status, temperature, alarms, schedules).
Create, inspect, process, and optimize image files and visual assets with reliable format choice, resizing, compression, color-profile, metadata, and platfor...
Capture, inspect, and compare screenshots of screens, windows, regions, web pages, simulators, and CI runs with the right tool, wait strategy, viewport, and...
Extract text from images using Tesseract OCR
Create AI images with GPT Image, Gemini Nano Banana, FLUX, Imagen, and top providers using prompt engineering, style control, and smart editing.
AI image generation and photo editing powered by CellCog. Text-to-image, image-to-image, consistent characters, product photography, reference-based generati...
Best quality AI image generation (~$0.12-0.20/image). Text-to-image, image-to-image, and image editing via the EvoLink API.
Edit images with AI inpainting, outpainting, background removal, upscaling, and restoration tools.
AI image generation with OpenAI GPT Image 2, Azure OpenAI, Google, OpenRouter, DashScope, Z.AI GLM-Image, MiniMax, Jimeng, Seedream and Replicate APIs. Suppo...
Generate or edit AI images with the NanoPhoto.AI Nano Banana Pro API. Use when: (1) User wants text-to-image generation from a prompt, (2) User wants image-t...
Generate publication-quality chart images from data. Supports line, bar, area, point, histogram, candlestick, pie/donut, heatmap, multi-series, and stacked c...
The complete operating system for OpenClaw 5.x agents. Built-in memory tool integration (memory_search, memory_get, DREAMS.md), Discord channel-routing fixes...
Generate high-quality images using a local ComfyUI instance. Use when the user wants private, powerful image generation via their own hardware and custom wor...
Pixel art desktop lobster that lip-syncs to OpenClaw TTS speech. Use when: (1) user wants a visual avatar for their AI agent, (2) user wants a desktop overla...
Smart ClawdBot documentation access with local search index, cached snippets, and on-demand fetch. Token-efficient and freshness-aware.
Participate in ArtWar AI art battles on Monad. Use when you need to submit AI-generated artwork to competitions, place on-chain bets on art submissions, comm...
Generate and edit images with Gemini API using pure Python stdlib. Zero dependencies - works on locked-down environments where pip/uv aren't available.
Analyze Samsung Health Connect data synced to Google Drive. Use for health tracking queries like sleep analysis, step counting, heart rate monitoring, SpO2 b...
Generate images using multiple AI models — Midjourney (via Legnext.ai), Flux, Nano Banana Pro (Gemini), Ideogram, Recraft, and more via fal.ai. Intelligently...
SOTA Computer Vision Expert (2026). Specialized in YOLO26, Segment Anything 3 (SAM 3), Vision Language Models, and real-time spatial analysis.
Skills cover DALL-E 3, Stable Diffusion (SDXL, SD 1.5), FLUX, Midjourney API, Imagen, and ComfyUI workflows. Each skill specifies which backend and any API keys required.
Both. Inpainting skills edit specific regions of existing images, while img2img skills apply styles or transformations. Upscaling skills enhance resolution of any image.