Transcription & STT AI Agent Skills — Voice & Audio AI | ClawHub

🎤Speech & Audio

Openai Whisper

openai-whisper

steipete

v1.0.0

View Details

Local speech-to-text with the Whisper CLI (no API key).

76.1k

302

2mo ago

🎤Speech & Audio

Sag

sag

steipete

v1.0.0

View Details

ElevenLabs text-to-speech with mac-style say UX.

1.2k

24.5k

2mo ago

🎤Speech & Audio

Openai Whisper Api

openai-whisper-api

steipete

v1.0.0

View Details

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

968

16.9k

8d ago

🎤Speech & Audio

Edge TTS

edge-tts

i3130002

v2.0.0

View Details

Text-to-speech conversion using node-edge-tts npm package for generating audio from text. Supports multiple voices, languages, speed adjustment, pitch control, and subtitle generation. Use when: (1) User requests audio/voice output with the "tts" trigger or keyword. (2) Content needs to be spoken rather than read (multitasking, accessibility, driving, cooking). (3) User wants a specific voice, speed, pitch, or format for TTS output.

261

18.6k

2mo ago

🎤Speech & Audio

Local Whisper

local-whisper

araa47

v1.0.0

View Details

Local speech-to-text using OpenAI Whisper. Runs fully offline after model download. High quality transcription with multiple model sizes.

11.3k

2mo ago

🎤Speech & Audio

whisper

whisper

fiddlybit

v1.0.0

View Details

End-to-end encrypted agent-to-agent private messaging via Moltbook dead drops. Use when agents need to communicate privately, exchange secrets, or coordinate without human visibility.

4.5k

2mo ago

🎤Speech & Audio

OpenAI TTS

openai-tts

pors

v1.0.0

View Details

Text-to-speech via OpenAI Audio Speech API.

6.7k

2mo ago

🎤Speech & Audio

Openai Whisper 1.0.0

openai-whisper-1-0-0

czubi1928

v1.0.0

View Details

Local speech-to-text with the Whisper CLI (no API key).

990

2mo ago

🎤Speech & Audio

Mac TTS

mac-tts

kalijason

v1.0.0

View Details

Text-to-speech using macOS built-in `say` command. Use for voice notifications, audio alerts, reading text aloud, or announcing messages through Mac speakers. Supports multiple languages including Chinese (Mandarin), English, Japanese, etc.

6.2k

2mo ago

🎤Speech & Audio

Tts

tts

AMSTKO

v1.0.0

View Details

Convert text to speech using Hume AI (or OpenAI) API. Use when the user asks for an audio message, a voice reply, or to hear something "of vive voix".

3.7k

2mo ago

🎤Speech & Audio

Voice Transcribe

voice-transcribe

darinkishore

v1.0.1

View Details

Transcribe audio files using OpenAI's gpt-4o-mini-transcribe model with vocabulary hints and text replacements. Requires uv (https://docs.astral.sh/uv/).

5.8k

2mo ago

🎤Speech & Audio

Qwen3-tts

qwen-tts

paki81

v1.0.0

View Details

Local text-to-speech using Qwen3-TTS-12Hz-1.7B-CustomVoice. Use when generating audio from text, creating voice messages, or when TTS is requested. Supports 10 languages including Italian, 9 premium speaker voices, and instruction-based voice control (emotion, tone, style). Alternative to cloud-based TTS services like ElevenLabs. Runs entirely offline after initial model download.

3.4k

2mo ago

🎤Speech & Audio

Audio Cog

audio-cog

nitishgargiitd

v1.0.12

View Details

AI audio generation and text-to-speech powered by CellCog. Voiceover, narration, voice cloning, avatar voices, sound effects, music, podcasts, dialogue. Thre...

5.2k

14d ago

🎤Speech & Audio

Voice Wake Say

voice-wake-say

xadenryan

v1.0.1

View Details

Speak responses aloud on macOS using the built-in `say` command when user input indicates Voice Wake/voice recognition (for example, messages starting with "User talked via voice recognition on <device>").

5.9k

8d ago

🎤Speech & Audio

Kokoro TTS

kokoro-tts

edkief

v0.1.0

View Details

Generate spoken audio from text using the local Kokoro TTS engine. Use when the user asks to "say" something, requests a voice message, or wants text converted to speech.

6.3k

2mo ago

🎤Speech & Audio

Voice Reply

voice-reply

stolot0mt0m

v1.0.0

View Details

Local text-to-speech using Piper voices via sherpa-onnx. 100% offline, no API keys required. Use when user asks for a voice reply, audio response, spoken answer, or wants to hear something read aloud. Supports multiple languages including German (thorsten) and English (ryan) voices. Outputs Telegram-compatible voice notes with [[audio_as_voice]] tag.

4.2k

2mo ago

🎤Speech & Audio

Discord Voice

discord-voice

avatarneil

v0.1.6

View Details

Real-time voice conversations in Discord voice channels with Claude AI

5.7k

2mo ago

🎤Speech & Audio

Tarot from Univoice

tarot

yangsenessa

v1.0.0

View Details

A reflective tarot draw for emotional support (presence-first, non-clinical, non-predictive).

903

2mo ago

🎤Speech & Audio

Transcribe

transcribe

javicasper

v1.0.2

View Details

Transcribe audio files to text using local Whisper (Docker). Use when receiving voice messages, audio files (.mp3, .m4a, .ogg, .wav, .webm), or when asked to transcribe audio content.

2.6k

8d ago

🎤Speech & Audio

speech-recognition

speech-recognition

demo112

v1.0.1

View Details

通用语音识别 Skill。支持多种音频格式（ogg/mp3/wav/m4a），使用硅基流动 SenseVoice API 进行语音转文字。当用户发送语音消息、音频文件，或需要转录音频时触发。

4.7k

2mo ago

🎤Speech & Audio

Vocal Chat

vocal-chat

rubenfb23

v1.0.0

View Details

Handles voice-to-voice conversations on WhatsApp. Automatically transcribes incoming audio and responds with local TTS audio. Use when the user wants to "talk" instead of type.

3.6k

2mo ago

🎤Speech & Audio

Voice Agent

voice-agent

ricardotrevisan

v1.1.0

View Details

Local Voice Input/Output for Agents using the AI Voice Agent API.

8d ago

🎤Speech & Audio

Mlx Whisper

mlx-whisper

Kevin37Li

v1.0.0

View Details

Local speech-to-text with MLX Whisper (Apple Silicon optimized, no API key).

3.8k

2mo ago

🎤Speech & Audio

Faster Whisper

faster-whisper

theplasmak

v1.5.1

View Details

Local speech-to-text using faster-whisper. 4-6x faster than OpenAI Whisper with identical accuracy; GPU acceleration enables ~20x realtime transcription. SRT...

5.1k

8d ago

📝 Transcription & STT AI Skills

Openai Whisper

Sag

Openai Whisper Api

Edge TTS

Local Whisper

whisper

OpenAI TTS

Openai Whisper 1.0.0

Mac TTS

Tts

Voice Transcribe

Qwen3-tts

Audio Cog

Voice Wake Say

Kokoro TTS

Voice Reply

Discord Voice

Tarot from Univoice

Transcribe

speech-recognition

Vocal Chat

Voice Agent

Mlx Whisper

Faster Whisper

Other 🎙️ Voice & Audio AI Phases