faster-whisper-localLocal speech-to-text using faster-whisper. High-performance transcription with GPU acceleration support. Includes word-level timestamps and distilled models....
Install via ClawdBot CLI:
clawdbot install Damirikys/faster-whisper-localGrade Fair — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Generated Mar 20, 2026
Automatically transcribe podcast episodes for creating show notes, subtitles, and searchable archives. Word-level timestamps enable precise editing and content indexing, improving workflow efficiency.
Transcribe interviews, lectures, and focus group recordings for qualitative analysis. Local processing ensures data privacy and security, while GPU acceleration speeds up handling of large audio datasets.
Convert court proceedings, depositions, and client meetings into accurate text records. The ability to use specific models like large-v3-turbo ensures high accuracy for critical legal documentation.
Generate transcriptions for videos and audio in multiple languages to create subtitles or dubbed content. Multilingual model support facilitates global content distribution and accessibility.
Transcribe patient consultations and medical notes for electronic health records. Local deployment with GPU acceleration provides fast, secure processing compliant with healthcare privacy regulations.
Offer a cloud-based or on-premise transcription platform using this skill for businesses needing high-volume audio processing. Charge per minute of audio transcribed, with tiered pricing based on model accuracy and features like word timestamps.
Provide custom integration services to embed this skill into existing workflows for media companies, legal firms, or research institutions. Offer setup, optimization, and ongoing support for GPU acceleration and model selection.
Sell pre-configured hardware systems (e.g., servers with NVIDIA GPUs) bundled with this skill for organizations requiring on-premise, high-performance transcription. Include setup scripts and maintenance services.
💬 Integration Tip
Ensure ffmpeg and Python 3.10+ are installed system-wide, and use the setup script to automate GPU detection for optimal performance in production environments.
Scored Apr 15, 2026
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
Local speech-to-text with the Whisper CLI (no API key).
ElevenLabs text-to-speech with mac-style say UX.
Local text-to-speech via sherpa-onnx (offline, no cloud)
Speak responses aloud on macOS using the built-in `say` command when user input indicates Voice Wake/voice recognition (for example, messages starting with "User talked via voice recognition on <device>").
Transcribe audio files to text using local Whisper (Docker). Use when receiving voice messages, audio files (.mp3, .m4a, .ogg, .wav, .webm), or when asked to transcribe audio content.