gemini-image-remixGenerate or remix images using Gemini models with text prompts and multiple input images, supporting various styles, resolutions, and advanced model options.
Install via ClawdBot CLI:
clawdbot install rdeangel/gemini-image-remixA versatile tool for text-to-image generation and complex image-to-image remixing. By default, it uses Gemini 2.5 Flash Image for fast, high-quality results. It also supports flagship models like Gemini 3.0 Pro (Nano Banana Pro) for advanced artistic tasks.
Create stunning visuals from a text prompt.
uv run {baseDir}/scripts/remix.py --prompt "a cybernetic owl in a neon forest" --filename "owl.png"
Use one or more reference images to guide the generation. Perfect for style transfers, background changes, or character modifications.
uv run {baseDir}/scripts/remix.py --prompt "change the art style to a pencil sketch" --filename "sketch.png" -i "original.png"
Combine elements from up to 14 different images into a single cohesive scene.
uv run {baseDir}/scripts/remix.py --prompt "place the character from image 1 into the environment of image 2" --filename "result.png" -i "character.png" -i "env.png"
Switch to advanced models like Nano Banana Pro for high-fidelity work.
uv run {baseDir}/scripts/remix.py --model "gemini-3-pro-image-preview" --prompt "highly detailed oil painting of a dragon" --filename "dragon.png"
--prompt, -p: Image description or specific edit instructions.--filename, -f: The output path for the generated PNG.--input-image, -i: Path to an input image (repeatable up to 14 times).--resolution, -r: 1K (default), 2K, or 4K.--aspect-ratio, -a: Output aspect ratio (e.g., 1:1, 16:9, 9:16, 4:3, 3:4).--model, -m: Model to use (defaults to gemini-2.5-flash-image). Supported: gemini-2.5-flash-image, gemini-3-pro-image-preview.--api-key, -k: Gemini API key (defaults to GEMINI_API_KEY env var).Generated Mar 1, 2026
Marketing teams can generate custom visuals for social media campaigns, ads, and blog posts based on text prompts, enabling rapid prototyping of creative assets without graphic design expertise. It supports quick iterations for A/B testing different visual styles.
Online retailers can remix product images to showcase items in various settings or artistic styles, such as placing a piece of furniture in different room backgrounds or applying seasonal themes to apparel photos. This enhances product listings and customer engagement.
Game developers can use multi-image composition to combine character designs, environments, and props into cohesive scenes for concept art or promotional materials. It accelerates the ideation phase by generating high-quality visuals from reference images.
Educators and content creators can generate illustrations for textbooks, presentations, or online courses based on descriptive prompts, such as historical events or scientific concepts. This reduces reliance on stock images and allows for tailored visual aids.
Architects and real estate professionals can remix building designs into different environments or artistic renderings, like pencil sketches or oil paintings, to present concepts to clients. It facilitates visualization of modifications without extensive 3D modeling.
Offer the skill as a cloud-based service with tiered pricing based on usage limits, such as number of image generations per month or access to advanced models like Gemini 3.0 Pro. This provides recurring revenue from businesses needing regular visual content.
Provide a free tier with basic features and limited usage, then charge for higher-resolution outputs, faster processing, or access to premium models. This attracts individual users and small businesses while monetizing heavy usage from enterprises.
License the skill to other platforms, such as design tools or content management systems, allowing them to embed image generation capabilities under their own branding. This generates revenue through licensing fees or revenue-sharing agreements.
đŹ Integration Tip
Set the GEMINI_API_KEY environment variable securely and ensure uv is installed via brew for smooth execution; use the default model for quick starts and switch to advanced models only for complex tasks.
Generate/edit images with Nano Banana Pro (Gemini 3 Pro Image). Use for image create/modify requests incl. edits. Supports text-to-image + image-to-image; 1K/2K/4K; use --input-image.
Capture frames or clips from RTSP/ONVIF cameras.
Batch-generate images via OpenAI Images API. Random prompt sampler + `index.html` gallery.
Generate images using the internal Google Antigravity API (Gemini 3 Pro Image). High quality, native generation without browser automation.
äœżçšć çœź image_generate.py èæŹçæćŸç, ć〿ž æ°ć ·äœç `prompt`ă
AI image generation powered by CellCog. Create images, edit photos, consistent characters, product photography, reference-based images, sets of images, style transfer. Professional image creation with AI.