text-to-speech-heygenGenerate speech audio from text using HeyGen's Starfish TTS model. Use when: (1) Generating standalone speech audio files from text, (2) Converting text to s...
Install via ClawdBot CLI:
clawdbot install michaelwang11394/text-to-speech-heygenGrade Fair — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Sends data to undocumented external endpoint (potential exfiltration)
POST → https://api.heygen.com/v1/audio/text_to_speech`Calls external URL not in known-safe list
https://api.heygen.com/v1/audio/voicesAI Analysis
The skill's external API calls (api.heygen.com) are explicitly documented and directly serve its stated purpose of text-to-speech generation. The authentication requirement is clearly specified, and no hidden instructions, credential harvesting, or obfuscation are present.
Audited Apr 17, 2026 · audit v1.0
Generated Mar 20, 2026
Automate narration for podcast episodes by converting script text to speech with customizable voices and pacing. This enables solo creators to produce high-quality audio content without hiring voice actors, saving time and costs.
Generate voiceovers for online courses and training materials in multiple languages and accents. Educators can enhance accessibility and engagement by offering audio versions of text-based content for diverse learners.
Convert written books or articles into audiobooks using natural-sounding TTS voices with adjustable speed and pitch. This allows authors and publishers to quickly expand their offerings without studio recording sessions.
Create pre-recorded audio responses for IVR systems or chatbots to handle common inquiries. Businesses can improve customer experience with clear, consistent voice prompts in various languages.
Develop assistive applications that read out text from websites, documents, or apps for visually impaired users. This promotes inclusivity by providing real-time audio conversion with voice selection options.
Offer a cloud-based platform where users pay monthly fees for API access to generate and download speech audio. Revenue scales with usage tiers, targeting businesses needing regular audio content production.
Provide free limited access to TTS features with paid upgrades for advanced voices, higher quality, or increased quotas. This attracts individual developers and small teams who can later convert to enterprise plans.
License the TTS technology to other software companies for embedding into their products, such as e-learning platforms or content management systems. Revenue comes from one-time setup fees and ongoing royalties.
💬 Integration Tip
Start by listing available voices to match user preferences, then implement error handling for API key authentication to ensure smooth audio generation workflows.
Scored Apr 24, 2026
Local speech-to-text with the Whisper CLI (no API key).
ElevenLabs text-to-speech with mac-style say UX.
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
Text-to-speech conversion using node-edge-tts npm package for generating audio from text. Supports multiple voices, languages, speed adjustment, pitch control, and subtitle generation. Use when: (1) User requests audio/voice output with the "tts" trigger or keyword. (2) Content needs to be spoken rather than read (multitasking, accessibility, driving, cooking). (3) User wants a specific voice, speed, pitch, or format for TTS output.
Local text-to-speech via sherpa-onnx (offline, no cloud)
Start voice calls via the OpenClaw voice-call plugin.