edge-tts-englishGenerate high-quality English (and multilingual) audio using Microsoft Edge TTS. Use when the user asks to "speak this", "pronounce", "read aloud", "say this...
Install via ClawdBot CLI:
clawdbot install davydenkovm/edge-tts-englishGrade Limited — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Generated Mar 21, 2026
Students use the skill to hear correct pronunciation of English words or phrases, aiding in accent reduction and fluency development. It provides immediate audio feedback for self-study or classroom exercises.
The skill converts text content into speech, allowing visually impaired individuals to access written information audibly. It can be integrated into apps or devices to read documents, emails, or web pages.
Creators generate voiceovers for videos, podcasts, or e-learning modules using high-quality English voices. It streamlines production by offering diverse accents and genders without hiring voice actors.
Businesses implement the skill to provide automated spoken responses in English for IVR systems or chatbots, enhancing user experience with natural-sounding audio. It supports multilingual queries as noted in the skill description.
Users input text in various languages, and the skill outputs spoken English audio, useful for travelers or professionals needing quick translations with pronunciation guidance. It leverages the multilingual support mentioned in the skill.
Offer basic text-to-speech conversion for free with limited voices, then charge subscriptions for premium features like advanced voices, higher audio quality, or API access. Revenue comes from monthly or annual plans.
License the skill to companies for embedding into their products, such as educational software or customer service tools, with fees based on usage volume or per-seat pricing. This targets enterprises needing reliable TTS solutions.
Provide an API where developers pay per audio generation request, ideal for startups or projects with variable usage. It scales with demand and offers flexibility without upfront costs.
💬 Integration Tip
Ensure edge-tts is installed via pipx and scripts are correctly pathed; use the message tool with media parameter to send MP3 files directly to users.
Scored Apr 19, 2026
Local speech-to-text with the Whisper CLI (no API key).
ElevenLabs text-to-speech with mac-style say UX.
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
Text-to-speech conversion using node-edge-tts npm package for generating audio from text. Supports multiple voices, languages, speed adjustment, pitch control, and subtitle generation. Use when: (1) User requests audio/voice output with the "tts" trigger or keyword. (2) Content needs to be spoken rather than read (multitasking, accessibility, driving, cooking). (3) User wants a specific voice, speed, pitch, or format for TTS output.
Local text-to-speech via sherpa-onnx (offline, no cloud)
Start voice calls via the OpenClaw voice-call plugin.