feishu-voice-loopAccept text or voice input, transcribe if needed, generate natural OpenAI TTS speech, and send audio output to Feishu chat or web player.
Install via ClawdBot CLI:
clawdbot install pengzhuowen/feishu-voice-loopGrade Fair — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Generated Mar 18, 2026
Integrate into Feishu to handle voice inquiries from customers, transcribe them to text, generate automated voice responses using OpenAI TTS, and deliver replies directly in chat. Ideal for reducing response times in support ticketing systems.
Use for language education platforms where students submit voice recordings for pronunciation feedback. The skill transcribes audio, assesses text accuracy, and provides corrected audio responses via Feishu to enhance learning engagement.
Deploy within enterprise Feishu workflows to convert text-based internal memos into voice messages for delivery to employees. Useful for accessibility and ensuring important updates are heard directly, especially in remote teams.
Apply in retail or logistics to process voice orders from field staff via Feishu. Transcribe orders to text, validate with backend systems, and confirm with synthesized voice replies, streamlining operations without manual input.
Create immersive audio experiences for entertainment apps by generating dynamic voice narratives based on user text or voice inputs. Deliver stories through Feishu chats or web playback for engaging user interactions.
Offer the skill as a subscription service on a platform, charging businesses monthly for access to voice loop capabilities integrated with Feishu. Revenue scales with usage tiers and additional features like custom voice presets.
License the skill package to other companies for embedding into their own products, such as CRM or education tools. Provide customization and support services, generating revenue through upfront licensing and ongoing maintenance fees.
Expose the voice loop functionality via an API, charging per transaction for transcription and TTS generation. Target developers building voice-enabled apps, with revenue tied to API call volume and data processing.
💬 Integration Tip
Ensure ffmpeg and Feishu credentials are pre-configured to avoid common setup failures during deployment.
Scored Apr 19, 2026
Local speech-to-text with the Whisper CLI (no API key).
ElevenLabs text-to-speech with mac-style say UX.
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
Text-to-speech conversion using node-edge-tts npm package for generating audio from text. Supports multiple voices, languages, speed adjustment, pitch control, and subtitle generation. Use when: (1) User requests audio/voice output with the "tts" trigger or keyword. (2) Content needs to be spoken rather than read (multitasking, accessibility, driving, cooking). (3) User wants a specific voice, speed, pitch, or format for TTS output.
Start voice calls via the OpenClaw voice-call plugin.
Local text-to-speech via sherpa-onnx (offline, no cloud)