edge-ttsText-to-speech conversion using node-edge-tts npm package for generating audio from text. Supports multiple voices, languages, speed adjustment, pitch control, and subtitle generation. Use when: (1) User requests audio/voice output with the "tts" trigger or keyword. (2) Content needs to be spoken rather than read (multitasking, accessibility, driving, cooking). (3) User wants a specific voice, speed, pitch, or format for TTS output.
Install via ClawdBot CLI:
clawdbot install i3130002/edge-ttsGrade Good — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Calls external URL not in known-safe list
http://localhost:7890Audited Apr 16, 2026 · audit v1.0
Generated Mar 1, 2026
Educational platforms can integrate this skill to convert textbooks and course materials into audio, allowing visually impaired students to listen to content. The adjustable speed and pitch features help customize the listening experience for different learning paces and preferences.
Smart home devices can use this TTS skill to read out recipes, news summaries, or reminders while users cook or clean. The support for multiple languages and voices enables personalized interactions in diverse household settings.
Media companies can generate high-quality voiceovers for podcasts, audiobooks, or video narrations using customizable voices and output formats. The subtitle generation feature adds value by creating synchronized text for accessibility or translations.
Call centers can deploy this skill to convert automated responses or FAQs into natural-sounding speech for IVR systems. The ability to adjust rate and pitch helps convey urgency or calmness based on customer needs.
Language learning apps can integrate TTS to provide audio examples of vocabulary and phrases in different accents, such as British or American English. The pitch control aids in emphasizing intonation patterns for better pronunciation practice.
Offer this TTS skill as a cloud-based API service where developers pay a monthly fee based on usage volume, such as number of audio minutes generated. It targets app builders needing reliable, scalable text-to-speech without managing infrastructure.
Integrate the skill into a mobile app that offers basic TTS features for free, with premium upgrades for advanced voices, higher audio quality, or ad-free usage. Monetize through in-app purchases and advertisements.
License the skill to large enterprises for internal use in training modules, accessibility tools, or communication systems, with custom support and integration services. Charge based on the number of users or annual contracts.
💬 Integration Tip
Start by using the built-in tts tool for simple implementations, then leverage the scripts for advanced customization like voice selection and rate adjustments.
Scored Apr 16, 2026
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
Local speech-to-text with the Whisper CLI (no API key).
ElevenLabs text-to-speech with mac-style say UX.
Local text-to-speech via sherpa-onnx (offline, no cloud)
Speak responses aloud on macOS using the built-in `say` command when user input indicates Voice Wake/voice recognition (for example, messages starting with "User talked via voice recognition on <device>").
Transcribe audio files to text using local Whisper (Docker). Use when receiving voice messages, audio files (.mp3, .m4a, .ogg, .wav, .webm), or when asked to transcribe audio content.