whisper-local-apiSecure, offline, OpenAI-compatible local Whisper ASR endpoint for OpenClaw. Features faster-whisper (large-v3-turbo), built-in privacy with no cloud telemetr...
Install via ClawdBot CLI:
clawdbot install hantok/whisper-local-apiGrade Fair — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Calls external URL not in known-safe list
https://github.com/Hantok/local-whisper-backend.git}Audited Apr 16, 2026 · audit v1.0
Generated Mar 21, 2026
Doctors and medical staff can securely dictate patient notes and clinical observations using local voice-to-text. This ensures HIPAA compliance by keeping sensitive health data completely offline while streamlining documentation workflows.
Law firms can transcribe client interviews, depositions, and meeting recordings locally without risking confidential case information being exposed to cloud services. The high accuracy handles legal terminology effectively.
Financial professionals can use voice commands to query portfolio data, generate reports, or execute trades through their AI agent while ensuring sensitive financial information never leaves their secure office network.
Factory inspectors can verbally document quality issues, maintenance needs, or production metrics while walking the floor. The offline operation works reliably in industrial environments with limited internet connectivity.
Government agencies can transcribe classified briefings and sensitive meetings locally, eliminating cloud telemetry risks. The low-RAM footprint allows deployment on secure, air-gapped systems.
Offer managed deployment and support packages for businesses requiring guaranteed privacy compliance. Include regular security updates, custom integration support, and priority troubleshooting for mission-critical voice applications.
Pre-install the solution on dedicated edge computing devices optimized for whisper processing. Target organizations needing turnkey privacy solutions without technical setup overhead, such as law firms or medical clinics.
License the technology to software developers building privacy-focused applications. Provide SDKs, documentation, and commercial use rights for embedding local whisper capabilities into their own products.
💬 Integration Tip
Ensure audio files are properly formatted (MP3, WAV, etc.) before sending to the /v1/audio/transcriptions endpoint, and verify the service is running on localhost:9000 before integration attempts.
Scored Apr 19, 2026
Local speech-to-text with the Whisper CLI (no API key).
ElevenLabs text-to-speech with mac-style say UX.
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
Text-to-speech conversion using node-edge-tts npm package for generating audio from text. Supports multiple voices, languages, speed adjustment, pitch control, and subtitle generation. Use when: (1) User requests audio/voice output with the "tts" trigger or keyword. (2) Content needs to be spoken rather than read (multitasking, accessibility, driving, cooking). (3) User wants a specific voice, speed, pitch, or format for TTS output.
Local text-to-speech via sherpa-onnx (offline, no cloud)
Start voice calls via the OpenClaw voice-call plugin.