discord-voiceReal-time voice conversations in Discord voice channels with Claude AI
Install via ClawdBot CLI:
clawdbot install avatarneil/discord-voiceGrade Good — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Generated Mar 1, 2026
Integrates with Discord voice channels to provide real-time transcription and AI-driven summaries during team meetings. It can answer questions based on the conversation and generate action items, enhancing productivity for remote teams.
Uses speech-to-text and text-to-speech to simulate conversations in Discord voice channels for language practice. Learners can speak in a target language, receive corrections, and engage in interactive dialogues with Claude AI.
Deploys in Discord communities to handle voice-based customer inquiries in real-time. It transcribes user questions, processes them with Claude AI for responses, and speaks back solutions, reducing wait times for support.
Joins Discord voice channels during gaming sessions to provide live assistance, such as strategy tips, lore explanations, or social moderation. It uses barge-in to respond quickly without interrupting gameplay.
Enables visually impaired users to participate in Discord voice chats by transcribing spoken content into text and reading back responses aloud. It offers customizable TTS voices and low-latency streaming for seamless interaction.
Offers basic voice conversation features for free with limited API calls, while charging for advanced features like custom TTS voices, higher transcription accuracy, or priority support. Targets small Discord communities and indie developers.
Provides customized deployments for large organizations, including on-premise hosting, enhanced security, and integration with existing CRM or project management tools. Includes dedicated support and SLAs for reliability.
Charges based on usage metrics such as minutes of transcription, TTS characters processed, or number of voice channel interactions. Appeals to developers building scalable applications with variable demand.
💬 Integration Tip
Ensure system dependencies like ffmpeg are installed and test with different STT/TTS providers to balance cost and latency based on your use case.
Scored Apr 15, 2026
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
Local speech-to-text with the Whisper CLI (no API key).
ElevenLabs text-to-speech with mac-style say UX.
Local text-to-speech via sherpa-onnx (offline, no cloud)
Speak responses aloud on macOS using the built-in `say` command when user input indicates Voice Wake/voice recognition (for example, messages starting with "User talked via voice recognition on <device>").
Transcribe audio files to text using local Whisper (Docker). Use when receiving voice messages, audio files (.mp3, .m4a, .ogg, .wav, .webm), or when asked to transcribe audio content.