telegram-voiceVoice communication via Telegram. Automatically transcribes incoming voice messages using faster-whisper and replies with TTS voice. Use for all voice-relate...
Install via ClawdBot CLI:
clawdbot install Polityang/telegram-voiceGrade Limited — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Generated Mar 22, 2026
Businesses can use this skill to automatically transcribe customer voice messages in Chinese or other languages, enabling AI agents to understand queries and respond with TTS voice replies. This reduces manual handling and improves response times for voice-based support channels.
Language learners can send voice messages in their target language, such as Chinese, to receive transcriptions and TTS feedback for pronunciation practice. The skill facilitates interactive voice-based exercises and corrections on Telegram.
Content creators can use this skill to transcribe voice notes into text for blog posts or social media content, and generate TTS voiceovers for videos or podcasts. It streamlines the workflow from voice input to published media.
Visually impaired users can interact with Telegram by sending voice messages that are transcribed into text for processing, and receive audio replies via TTS. This enhances accessibility in messaging platforms.
Offer this skill as a subscription service for companies needing automated voice support on Telegram, with tiered pricing based on usage volume or features. Revenue comes from monthly or annual fees paid by business clients.
Provide the transcription and TTS capabilities as an API, charging users per voice message processed or per minute of TTS generated. This model targets developers and small businesses with variable needs.
Offer a free basic version with limited features, such as a cap on voice messages per month, and charge for premium features like faster processing or multiple language support. Revenue is generated from upgrades to paid plans.
💬 Integration Tip
Ensure faster-whisper is installed and TTS is configured in OpenClaw before deployment to avoid runtime errors.
Scored Apr 19, 2026
Local speech-to-text with the Whisper CLI (no API key).
ElevenLabs text-to-speech with mac-style say UX.
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
Text-to-speech conversion using node-edge-tts npm package for generating audio from text. Supports multiple voices, languages, speed adjustment, pitch control, and subtitle generation. Use when: (1) User requests audio/voice output with the "tts" trigger or keyword. (2) Content needs to be spoken rather than read (multitasking, accessibility, driving, cooking). (3) User wants a specific voice, speed, pitch, or format for TTS output.
Local text-to-speech via sherpa-onnx (offline, no cloud)
Start voice calls via the OpenClaw voice-call plugin.