elevenlabs-sttTranscribe audio files using ElevenLabs Speech-to-Text (Scribe v2).
Install via ClawdBot CLI:
clawdbot install clawdbotborges/elevenlabs-sttRequires:
Grade Good — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Generated Mar 1, 2026
Transcribe recorded business meetings with speaker diarization to identify who said what, enabling efficient minute-taking and action item extraction. Useful for remote teams and compliance documentation.
Convert podcast audio into text transcripts with timestamps for SEO optimization, accessibility, and repurposing into blog posts or social media snippets. Supports multiple languages for global content.
Transcribe customer service calls to analyze sentiment, identify common issues, and train AI models for automated responses. Speaker diarization helps track agent and customer interactions.
Transcribe university lectures or educational videos into text for student notes, accessibility for hearing-impaired learners, and content indexing for searchable archives.
Accurately transcribe legal depositions with speaker identification and timestamps for evidence tracking and case preparation. JSON output facilitates integration with legal databases.
Offer a cloud-based transcription service with tiered pricing based on audio length or features like diarization and event tagging. Target small businesses and freelancers needing regular transcription.
License the transcription API to developers and enterprises for integration into their own applications, such as call centers or content management systems, with pay-per-use or bulk pricing.
Provide a customizable transcription platform for resellers like media companies or educational institutions, allowing them to brand it as their own service with added features.
💬 Integration Tip
Set the ELEVENLABS_API_KEY environment variable and use the provided shell scripts for easy command-line integration into automated workflows.
Scored Apr 15, 2026
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
Local speech-to-text with the Whisper CLI (no API key).
ElevenLabs text-to-speech with mac-style say UX.
Local text-to-speech via sherpa-onnx (offline, no cloud)
Speak responses aloud on macOS using the built-in `say` command when user input indicates Voice Wake/voice recognition (for example, messages starting with "User talked via voice recognition on <device>").
Transcribe audio files to text using local Whisper (Docker). Use when receiving voice messages, audio files (.mp3, .m4a, .ogg, .wav, .webm), or when asked to transcribe audio content.