google-gemini-mediaUse the Gemini API (Nano Banana image generation, Veo video, Gemini TTS speech and audio understanding) to deliver end-to-end multimodal media workflows and code templates for "generation + understanding".
Install via ClawdBot CLI:
clawdbot install Xsir0/google-gemini-mediaGrade Good — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Generated Mar 1, 2026
Generate high-quality images of products for online stores, such as custom-designed items or food dishes, to enhance listings and marketing materials. This reduces the need for expensive photoshoots and allows rapid iteration on visual concepts.
Create narrated videos and images for e-learning platforms, explaining complex topics with AI-generated visuals and speech. This automates content production for courses, tutorials, and interactive learning modules.
Produce short promotional videos and images for social media campaigns, using text-to-video and image generation to quickly create engaging content. This streamlines ad creation and allows for personalized messaging at scale.
Analyze customer-uploaded images, videos, or audio to provide automated support, such as identifying product issues or transcribing support calls. This improves response times and reduces manual effort in helpdesk operations.
Edit existing images or extend videos for film, television, or digital media projects, using AI to enhance or modify visual content efficiently. This accelerates post-production workflows and reduces costs for studios.
Offer a cloud-based platform where users pay a monthly fee to access AI-powered media generation and understanding tools via API. This provides recurring revenue and scales with customer usage across various industries.
Charge customers based on the number of API calls or media processing tasks, such as per image generated or minute of video analyzed. This model appeals to businesses with variable needs and allows low entry costs.
License the skill package to other companies for integration into their own products, such as marketing tools or content management systems, with custom branding. This generates upfront licensing fees and ongoing support contracts.
💬 Integration Tip
Implement a file size threshold (e.g., 10MB) to automatically switch between inline and Files API modes for optimal performance and compliance with request limits.
Scored Apr 15, 2026
Gemini CLI for one-shot Q&A, summaries, and generation.
Use CodexBar CLI local cost usage to summarize per-model usage for Codex or Claude, including the current (most recent) model or a full model breakdown. Trigger when asked for model-level usage/cost data from codexbar, or when you need a scriptable per-model summary from codexbar cost JSON.
Manages free AI models from OpenRouter for OpenClaw. Automatically ranks models by quality, configures fallbacks for rate-limit handling, and updates opencla...
Check Antigravity account quotas for Claude and Gemini models. Shows remaining quota and reset times with ban detection.
A comprehensive AI model routing system that automatically selects the optimal model for any task. Set up multiple AI providers (Anthropic, OpenAI, Gemini, Moonshot, Z.ai, GLM) with secure API key storage, then route tasks to the best model based on task type, complexity, and cost optimization. Includes interactive setup wizard, task classification, and cost-effective delegation patterns. Use when you need "use X model for this", "switch model", "optimal model", "which model should I use", or to balance quality vs cost across multiple AI providers.
Reduce OpenClaw AI costs by 97%. Haiku model routing, free Ollama heartbeats, prompt caching, and budget controls. Go from $1,500/month to $50/month in 5 min...