fal-aiGenerate images, videos, and audio via fal.ai API (FLUX, SDXL, Whisper, etc.)
Install via ClawdBot CLI:
clawdbot install agmmnn/fal-aiGenerate images, videos, and transcripts using fal.ai's API with support for FLUX, Stable Diffusion, Whisper, and more.
fal_client required)export FAL_KEY="your-api-key"
Or via clawdbot config:
clawdbot config set skill.fal_api.key YOUR_API_KEY
You: Generate a cyberpunk cityscape with FLUX
Klawf: Creates the image and returns the URL
from fal_api import FalAPI
api = FalAPI()
# Generate and wait
urls = api.generate_and_wait(
prompt="A serene Japanese garden",
model="flux-dev"
)
print(urls)
| Model | Endpoint | Type |
| ------------- | ------------------------------------- | ------------ |
| flux-schnell | fal-ai/flux/schnell | Image (fast) |
| flux-dev | fal-ai/flux/dev | Image |
| flux-pro | fal-ai/flux-pro/v1.1-ultra | Image (2K) |
| fast-sdxl | fal-ai/fast-sdxl | Image |
| recraft-v3 | fal-ai/recraft-v3 | Image |
| sd35-large | fal-ai/stable-diffusion-v35-large | Image |
| minimax-video | fal-ai/minimax-video/image-to-video | Video |
| wan-video | fal-ai/wan/v2.1/1.3b/text-to-video | Video |
| whisper | fal-ai/whisper | Audio |
For the full list, run:
python3 fal_api.py --list-models
| Parameter | Type | Default | Description |
| ---------- | ---- | ---------------- | -------------------------------------------------- |
| prompt | str | required | Image/video description |
| model | str | "flux-dev" | Model name from table above |
| image_size | str | "landscape_16_9" | Preset: square, portrait_4_3, landscape_16_9, etc. |
| num_images | int | 1 | Number of images to generate |
| seed | int | None | Random seed for reproducibility |
Built following the krea-api skill pattern. Uses fal.ai's queue-based API for reliable async generation.
Generated Mar 1, 2026
Marketing teams can use this skill to generate high-quality images and videos for social media campaigns, advertisements, and promotional materials. It enables rapid prototyping of visual content based on text prompts, reducing reliance on graphic designers and speeding up campaign launches.
Online retailers can create custom product images or lifestyle shots for listings without physical photography. This helps showcase products in various settings or styles, enhancing customer engagement and potentially increasing sales conversions.
Educators and e-learning platforms can generate illustrative images and videos to accompany course materials, making complex topics more accessible. The speech-to-text feature can also transcribe audio lectures for accessibility and note-taking purposes.
Game developers and content creators can produce concept art, character designs, or background visuals for games, videos, and animations. This accelerates the creative process by allowing quick iteration on visual ideas based on descriptive prompts.
Businesses can generate training videos or visual aids for internal processes, and use audio transcription to convert meetings or presentations into text records. This improves knowledge retention and compliance with documentation requirements.
Developers can integrate this skill into existing software-as-a-service platforms to offer AI-generated media as a feature. This could be monetized through subscription tiers or pay-per-use credits, adding value to creative tools or content management systems.
Freelancers and agencies can leverage this skill to offer rapid content creation services to clients, such as generating custom images for blogs or videos for marketing. This reduces production time and costs, allowing for scalable service offerings.
Businesses can build a wrapper around this skill to resell AI media generation capabilities to other developers or companies. This involves managing API keys, providing customer support, and potentially offering custom models or bulk discounts.
đŹ Integration Tip
Ensure the FAL_KEY environment variable is securely configured, and consider implementing error handling for API rate limits or failures to maintain reliability in production.
Generate/edit images with Nano Banana Pro (Gemini 3 Pro Image). Use for image create/modify requests incl. edits. Supports text-to-image + image-to-image; 1K/2K/4K; use --input-image.
Capture frames or clips from RTSP/ONVIF cameras.
Batch-generate images via OpenAI Images API. Random prompt sampler + `index.html` gallery.
Generate images using the internal Google Antigravity API (Gemini 3 Pro Image). High quality, native generation without browser automation.
äœżçšć çœź image_generate.py èæŹçæćŸç, ć〿ž æ°ć ·äœç `prompt`ă
AI image generation powered by CellCog. Create images, edit photos, consistent characters, product photography, reference-based images, sets of images, style transfer. Professional image creation with AI.