minimax-speechManage MiniMax Speech 2.8 TTS requests, voice catalog lookups, and precise voice/audio configuration using MiniMax API via CLI or script.
Install via ClawdBot CLI:
clawdbot install wingchiu/minimax-speechpip install requests in the environment that will execute the script. The CLI talks to MiniMax's REST API, so you only need the requests library on top of Python 3.11+.MINIMAX_API_KEY with the API key the user promised to supply. The script will refuse to run without it.scripts/minimax_tts.py exposes two subcommands:tts: calls POST https://api.minimax.io/v1/t2a_v2 (Speech 2.8 T2A HTTP) with the desired voice_id, voice settings, audio configuration, and optional voice effects. Example:
python scripts/minimax_tts.py tts \
--text "Tonight in Shenzhen the skies are clear." \
--voice-id "Sweet_Girl_2" \
--model speech-2.8-hd \
--audio-format mp3 \
--output minimax-weather.mp3
The script decodes the hex/base64 payload, saves the file, and prints metadata. Override the endpoint with --endpoint if you must hit https://api-uw.minimax.io/v1/t2a_v2 or another region.
voices: calls POST https://api.minimax.io/v1/get_voice to enumerate system, voice_cloning, voice_generation, or all categories. Example:
python scripts/minimax_tts.py voices --voice-type all --print-response
--speed, --vol, --pitch, and --language-boost to shape the voice. Control audio fidelity with --sample-rate, --bitrate, --audio-format, and --channel. Add pronunciation overrides (--pronunciation "emoji=ee-moh-jee") or mix timbres (--timbre-weight "Sweet_Girl_2=0.8"). --voice-modify-* flags let you nudge pitch/intensity/timbre or append a sound effect such as "spacious_echo". --output-format tells the script how the API returns audio (hex, base64, or a download url).extra_info field so you can read bitrate/sample rate/length. Use --print-response on either subcommand to dump the entire API payload for debugging. Save catalog JSON to disk with --output when you need to reference voices later.Keep this skill loaded any time you are asked for MiniMax-specific voices or when precise speech settings are required. The CLI gives you deterministic control over voice_id, model, and audio quality so you always get the Sweet_Girl_2 (or any other) tone you expect. If you need to script these requests from another tool, copy the relevant requests.post logic from scripts/minimax_tts.py.
Generated Feb 25, 2026
Companies can deploy consistent brand voices across global customer service hotlines using cloned or generated voices with language boost settings. This ensures uniform tone and pronunciation for technical support, billing inquiries, and automated responses in multiple languages.
Publishers can generate audiobook narration with specific voice characteristics using timbre weighting and pronunciation overrides for character voices. The audio format controls allow optimization for streaming platforms while maintaining consistent voice quality across long recordings.
E-learning platforms can create localized voiceovers for course materials by adjusting pitch, speed, and language settings for different regional audiences. The voice catalog lookup helps select appropriate system voices for various age groups and educational contexts.
Financial institutions can implement branded IVR systems with specific voice IDs and audio configurations for different service tiers. Voice modification flags allow creating distinct tones for urgent alerts versus routine notifications while maintaining audio quality standards.
Developers can build specialized voice assistants for visually impaired users by customizing speech parameters like speed and pitch for optimal comprehension. The deterministic voice control ensures consistent auditory interfaces across different applications and devices.
Offer API access to MiniMax TTS capabilities with premium voice catalog access and advanced audio customization features. Charge per API call or through tiered subscription plans based on usage volume and voice quality options.
Package the skill with custom voice presets and branded interfaces for businesses needing consistent voice branding. Provide integration support and maintenance contracts while clients pay licensing fees for the customized voice system.
Use the skill's precise voice control to offer professional audio production services for audiobooks, advertisements, and training materials. Charge project-based fees for high-quality voice generation with specific tone and pronunciation requirements.
💬 Integration Tip
Always validate MINIMAX_API_KEY before execution and use the --print-response flag during initial integration to understand API response structure and error handling.
Use CodexBar CLI local cost usage to summarize per-model usage for Codex or Claude, including the current (most recent) model or a full model breakdown. Trigger when asked for model-level usage/cost data from codexbar, or when you need a scriptable per-model summary from codexbar cost JSON.
Gemini CLI for one-shot Q&A, summaries, and generation.
Research any topic from the last 30 days on Reddit + X + Web, synthesize findings, and write copy-paste-ready prompts. Use when the user wants recent social/web research on a topic, asks "what are people saying about X", or wants to learn current best practices. Requires OPENAI_API_KEY and/or XAI_API_KEY for full Reddit+X access, falls back to web search.
Check Antigravity account quotas for Claude and Gemini models. Shows remaining quota and reset times with ban detection.
Manages free AI models from OpenRouter for OpenClaw. Automatically ranks models by quality, configures fallbacks for rate-limit handling, and updates opencla...
Manages free AI models from OpenRouter for OpenClaw. Automatically ranks models by quality, configures fallbacks for rate-limit handling, and updates openclaw.json. Use when the user mentions free AI, OpenRouter, model switching, rate limits, or wants to reduce AI costs.