openclaw-aisa-image-video-models-wan2-6-gemini-3-pro-image-nano-bananaGenerate images & videos with AIsa. Gemini 3 Pro Image (image) + Qwen Wan 2.6 (video) via one API key.
Install via ClawdBot CLI:
clawdbot install 0xjordansg-yolo/openclaw-aisa-image-video-models-wan2-6-gemini-3-pro-image-nano-banana用 AIsa API 一把钥匙生成图片与视频:
gemini-3-pro-image-preview(Gemini GenerateContent)wan2.6-t2v(通义万相 / Qwen Wan 2.6,异步任务)API 文档索引见 AIsa API Reference(可从 https://aisa.mintlify.app/llms.txt 找到所有页面)。
"生成一张赛博朋克风格的城市夜景,霓虹灯,雨夜,电影感"
"用一张参考图生成 5 秒镜头:镜头缓慢推进,风吹动头发,电影感,浅景深"
export AISA_API_KEY="your-key"
https://api.aisa.one/v1POST /models/{model}:generateContent文档:google-gemini-chat(GenerateContent)见 https://aisa.mintlify.app/api-reference/chat/chat-api/google-gemini-chat.md。
curl -X POST "https://api.aisa.one/v1/models/gemini-3-pro-image-preview:generateContent" \
-H "Authorization: Bearer $AISA_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"contents":[
{"role":"user","parts":[{"text":"A cute red panda, ultra-detailed, cinematic lighting"}]}
]
}'
说明:该接口的响应中可能出现 candidates[].parts[].inline_data(通常包含 base64 数据与 mime 类型);客户端脚本会自动解析并保存文件。
https://api.aisa.one/apis/v1POST /services/aigc/video-generation/video-synthesisX-DashScope-Async: enable(必填,异步)文档:video-generation 见 https://aisa.mintlify.app/api-reference/aliyun/video/video-generation.md。
curl -X POST "https://api.aisa.one/apis/v1/services/aigc/video-generation/video-synthesis" \
-H "Authorization: Bearer $AISA_API_KEY" \
-H "Content-Type: application/json" \
-H "X-DashScope-Async: enable" \
-d '{
"model":"wan2.6-t2v",
"input":{
"prompt":"cinematic close-up, slow push-in, shallow depth of field",
"img_url":"https://upload.wikimedia.org/wikipedia/commons/thumb/3/3a/Cat03.jpg/320px-Cat03.jpg"
},
"parameters":{
"resolution":"720P",
"duration":5,
"shot_type":"single",
"watermark":false
}
}'
GET /services/aigc/tasks?task_id=...文档:task 见 https://aisa.mintlify.app/api-reference/aliyun/video/task.md。
curl "https://api.aisa.one/apis/v1/services/aigc/tasks?task_id=YOUR_TASK_ID" \
-H "Authorization: Bearer $AISA_API_KEY"
# 生成图片(保存到本地文件)
python3 {baseDir}/scripts/media_gen_client.py image \
--prompt "A cute red panda, cinematic lighting" \
--out "out.png"
# 创建视频任务(需要 img_url)
python3 {baseDir}/scripts/media_gen_client.py video-create \
--prompt "cinematic close-up, slow push-in" \
--img-url "https://upload.wikimedia.org/wikipedia/commons/thumb/3/3a/Cat03.jpg/320px-Cat03.jpg" \
--duration 5
# 轮询任务状态
python3 {baseDir}/scripts/media_gen_client.py video-status --task-id YOUR_TASK_ID
# 等待直到成功(可选:成功后打印 video_url)
python3 {baseDir}/scripts/media_gen_client.py video-wait --task-id YOUR_TASK_ID --poll 10 --timeout 600
# 等待直到成功并自动下载 mp4
python3 {baseDir}/scripts/media_gen_client.py video-wait --task-id YOUR_TASK_ID --download --out out.mp4
Generated Mar 1, 2026
Marketers and influencers can quickly generate eye-catching images and short videos for posts, stories, or ads. This skill enables rapid prototyping of visual content tailored to trends, reducing production time and costs.
Online retailers can create custom product images or promotional videos from text prompts or reference images. This helps showcase items in various styles or contexts, enhancing listings and boosting customer engagement.
Educators and trainers can generate illustrative images and explanatory videos for courses, presentations, or tutorials. This supports visual learning by creating custom content that aligns with specific topics or concepts.
Game developers and content creators can produce concept art, character designs, or short cinematic clips for projects. This accelerates the creative process by generating media based on descriptive prompts or reference images.
Designers and architects can visualize ideas by generating images of scenes or videos of spatial concepts. This aids in client presentations and iterative design without needing extensive manual rendering.
Offer this skill as a paid API service for developers and businesses to integrate media generation into their applications. Charge based on usage tiers or per request, providing scalable access to image and video creation.
Build a web or mobile platform where users can generate and customize media, with premium features like higher resolutions or faster processing. Monetize through freemium models, in-app purchases, or advertising.
License the skill to other companies for embedding in their products, such as marketing tools or design software. Provide customization and support, generating revenue through licensing fees and ongoing maintenance contracts.
💬 Integration Tip
Ensure the AISA_API_KEY is securely stored and manage asynchronous video tasks with polling to handle delays and status updates efficiently.
Generate/edit images with Nano Banana Pro (Gemini 3 Pro Image). Use for image create/modify requests incl. edits. Supports text-to-image + image-to-image; 1K/2K/4K; use --input-image.
Capture frames or clips from RTSP/ONVIF cameras.
Batch-generate images via OpenAI Images API. Random prompt sampler + `index.html` gallery.
Generate images using the internal Google Antigravity API (Gemini 3 Pro Image). High quality, native generation without browser automation.
使用内置 image_generate.py 脚本生成图片, 准备清晰具体的 `prompt`。
AI image generation powered by CellCog. Create images, edit photos, consistent characters, product photography, reference-based images, sets of images, style transfer. Professional image creation with AI.