macos-local-voiceLocal STT and TTS on macOS using native Apple capabilities. Speech-to-text via yap (Apple Speech.framework), text-to-speech via say + ffmpeg. Fully offline, no API keys required. Includes voice quality detection and smart voice selection.
Install via ClawdBot CLI:
clawdbot install strrl/macos-local-voiceGrade Fair — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Calls external URL not in known-safe list
https://openclaw.aiAudited Apr 16, 2026 · audit v1.0
Generated Mar 1, 2026
Enables local, offline voice interactions for customer support bots on macOS, handling inquiries in languages like English, Chinese, or Japanese without cloud dependencies. Ideal for privacy-sensitive industries where data must stay on-device, reducing latency and API costs.
Provides text-to-speech and speech-to-text capabilities for educational apps on macOS, assisting students with disabilities by converting study materials to audio or transcribing lectures locally. Supports multiple languages to cater to diverse learners in offline environments.
Facilitates voiceover generation and transcription for media producers on macOS, allowing creators to dub videos or transcribe interviews in languages such as Spanish or French without internet access. Uses high-quality premium voices for professional audio output.
Assists healthcare professionals on macOS by transcribing patient consultations locally to maintain confidentiality, with support for medical terminology in languages like German or Russian. Integrates with voice notes for secure, offline record-keeping without cloud risks.
Powers offline voice assistants for travel apps on macOS, enabling tourists to get spoken translations or transcribe local conversations in languages such as Thai or Vietnamese. Leverages smart voice selection for natural interactions in diverse regions.
Offer the skill as a free component in macOS productivity tools, with premium features like advanced voice quality detection or custom voice packs available for purchase. Targets developers and businesses seeking to enhance apps with offline voice capabilities.
License the skill to enterprises for integration into internal systems like call centers or training platforms, providing local, secure voice processing without cloud dependencies. Includes support for multiple languages and compliance with data privacy regulations.
Sell the skill as part of a developer toolkit for macOS app creators, including documentation and support for implementing STT and TTS in applications. Appeals to indie developers and startups building voice-enabled tools for education or media.
💬 Integration Tip
Ensure all required binaries like yap and ffmpeg are installed via Homebrew, and use the voices.mjs script to check voice availability before TTS calls to avoid silent fallbacks.
Scored Apr 19, 2026
Local speech-to-text with the Whisper CLI (no API key).
ElevenLabs text-to-speech with mac-style say UX.
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
Text-to-speech conversion using node-edge-tts npm package for generating audio from text. Supports multiple voices, languages, speed adjustment, pitch control, and subtitle generation. Use when: (1) User requests audio/voice output with the "tts" trigger or keyword. (2) Content needs to be spoken rather than read (multitasking, accessibility, driving, cooking). (3) User wants a specific voice, speed, pitch, or format for TTS output.
Local text-to-speech via sherpa-onnx (offline, no cloud)
Start voice calls via the OpenClaw voice-call plugin.