voice-ai-voicesHigh-quality voice synthesis with 9 personas, 11 languages, and streaming using Voice.ai API.
Install via ClawdBot CLI:
clawdbot install gizmogremlin/voice-ai-voicesRequires:
Grade Fair — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Sends data to undocumented external endpoint (potential exfiltration)
POST → https://dev.voice.ai/api/v1/tts/speechCalls external URL not in known-safe list
https://voice.ai/dashboardAudited Apr 17, 2026 · audit v1.0
Generated Mar 1, 2026
Vloggers and influencers can use the skill to generate voiceovers for videos in multiple languages, leveraging the 9 personas for different tones. The streaming mode allows real-time audio generation for long-form content, enhancing production efficiency.
Educational platforms can integrate the skill to produce multilingual audio for courses, using voices like Oliver for clear tutorials. Customizable parameters like temperature help tailor the narration to match educational content styles.
Game developers can utilize voices like Shadow or Commander for character dialogues and in-game announcements, with support for 11 languages to reach global audiences. Streaming mode enables dynamic audio generation during gameplay.
Publishers can generate high-quality audiobooks using voices like Smooth for deep narration, with multilingual capabilities for international distribution. The skill's audio formats like MP3 and WAV ensure compatibility with various platforms.
Businesses can deploy the skill for automated voice responses in customer support systems, using cheerful voices like Flora for upbeat interactions. Integration with OpenClaw allows easy command-based triggering for real-time use.
Offer the skill as a service with tiered subscriptions based on usage limits, targeting developers and businesses needing high-quality TTS. Revenue streams include monthly fees and pay-per-use options for scalable access.
License the skill to e-learning or gaming platforms as an embedded TTS solution, providing custom branding and voice options. Revenue is generated through licensing fees and revenue-sharing agreements with partners.
Provide basic voice synthesis for free to attract individual creators, while charging for advanced features like streaming mode, additional languages, or premium voices. Revenue comes from upgrades and in-app purchases.
💬 Integration Tip
Ensure the VOICE_AI_API_KEY is set as an environment variable and test with simple commands like /tts before scaling to complex streaming scenarios.
Scored Apr 19, 2026
Local speech-to-text with the Whisper CLI (no API key).
ElevenLabs text-to-speech with mac-style say UX.
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
Any-to-any AI sub-agent — research, images, video, audio, music, podcasts, avatars, voice cloning, documents, spreadsheets, dashboards, 3D models, diagrams,...
Speak responses aloud on macOS using the built-in `say` command when user input indicates Voice Wake/voice recognition (for example, messages starting with "User talked via voice recognition on <device>").
High-quality voice synthesis with 18 personas, 32 languages, sound effects, batch processing, and voice design using ElevenLabs API.