edge-tts-uvxText-to-speech conversion using `uvx edge-tts` for generating audio from text. Use when: (1) User requests audio/voice output with the "tts" trigger or keyword. (2) Content needs to be spoken rather than read (multitasking, accessibility, driving, cooking). (3) User wants a specific voice, speed, pitch, or format for TTS output.
Install via ClawdBot CLI:
clawdbot install al-one/edge-tts-uvxGrade Good — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Generated Mar 6, 2026
This skill converts text content into speech, enabling visually impaired users to access written information through audio. It can be integrated into websites or applications to read articles, emails, or documents aloud, enhancing inclusivity and compliance with accessibility standards.
Educators and content creators can use this skill to generate voiceovers for online courses, tutorials, or presentations. By adjusting voices and parameters like speed, it allows for engaging audio content that caters to diverse learning styles and languages, improving student engagement.
This skill can power voice responses in smart home assistants, IoT devices, or chatbots, providing natural-sounding speech output. It supports multiple languages and customizable voices, making it suitable for global applications where users interact with devices through spoken feedback.
Producers can leverage this skill to convert scripts into high-quality audio for podcasts or audiobooks, with options to tweak pitch and speed for dramatic effect. It reduces production time and costs by automating voice generation, especially for multilingual or large-scale projects.
In automotive or navigation apps, this skill can read out directions, alerts, or messages to drivers, promoting safety by minimizing distractions. The adjustable parameters allow for clear, audible output that can be tailored to different driving environments and user preferences.
Offer this skill as a cloud-based API service where businesses pay a monthly fee to integrate text-to-speech capabilities into their applications. Revenue is generated through tiered pricing based on usage volume, voice options, and advanced features like subtitle generation.
Develop a mobile app that uses this skill for free basic TTS conversions, with premium features such as additional voices, higher audio quality, or ad-free experience available via in-app purchases. This model attracts a broad user base while monetizing power users.
License the skill to large corporations for internal use in tools like customer service systems, training platforms, or accessibility software. Revenue comes from upfront licensing fees and ongoing support contracts, tailored to specific business needs and compliance requirements.
💬 Integration Tip
Ensure the 'uvx' binary is installed and accessible in the system PATH before using this skill, and test voice outputs with sample text to verify quality and compatibility.
Scored Apr 22, 2026
Local speech-to-text with the Whisper CLI (no API key).
ElevenLabs text-to-speech with mac-style say UX.
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
Text-to-speech conversion using node-edge-tts npm package for generating audio from text. Supports multiple voices, languages, speed adjustment, pitch control, and subtitle generation. Use when: (1) User requests audio/voice output with the "tts" trigger or keyword. (2) Content needs to be spoken rather than read (multitasking, accessibility, driving, cooking). (3) User wants a specific voice, speed, pitch, or format for TTS output.
Local text-to-speech via sherpa-onnx (offline, no cloud)
Start voice calls via the OpenClaw voice-call plugin.