Logo
ClawHub Skills Lib
HomeTrending
Home/🤖 AI & Agents/🎤 Speech & Audio

🎤 Speech & Audio AI Skills

181 AI agent skills for Speech & Audio. Part of the 🤖 AI & Agents category.

Speech & Audio Skills — Page 5

181 skills
🎤Speech & Audio

Dialogue Audio

dialogue-audio
v0.1.5
View Details

Multi-speaker dialogue audio creation with Dia TTS. Covers speaker tags, emotion control, pacing, conversation flow, and post-production. Use for: podcasts,...

355
today
🎤Speech & Audio

Ai Podcast Pipeline

ai-podcast-pipeline
jeong-wooseok
v0.1.5
View Details

Create Korean AI podcast packages from QuickView trend notes. Use for dual-host script writing (Callie × Nick), Gemini multi-speaker TTS audio generation, subtitle timing/render fixes, thumbnail+MP4 packaging, and YouTube title/description output. Supports both full (15~20 min) and compressed (5~7 min) editions.

404
4d ago
🎤Speech & Audio

Local Whisper

whisper-mlx-local
v1.5.0
View Details

Free local speech-to-text for Telegram and WhatsApp using MLX Whisper on Apple Silicon. Private, no API costs.

1.4k
7
3d ago
🎤Speech & Audio

Piper TTS

beware-piper-tts
v1.0.1
View Details

Local text-to-speech using Piper for voice message delivery. Use when the user asks for voice responses, audio messages, TTS, text-to-speech, voice notes, or...

195
today
🎤Speech & Audio

Podcastifier

agents-skill-podcastifier
v0.1.0
View Details

Turn incoming text (email/newsletter) into a short TTS podcast with chunking + ffmpeg concat.

96
3d ago
🎤Speech & Audio

Pywayne Tts

tts-2
v0.1.0
View Details

Text-to-speech conversion tool. Use when converting text to speech audio files (opus or mp3 format). Supports macOS native 'say' command and Google TTS (gTTS...

109
4d ago
🎤Speech & Audio

Whisnap

whisnap
v1.0.0
View Details

macOS CLI for transcribing audio and video files using local Whisper models or Whisnap Cloud.

95
4d ago
🎤Speech & Audio

Pullthatupjamie

pullthatupjamie
v1.5.2
View Details

PullThatUpJamie — Podcast Intelligence. A semantically indexed podcast corpus (109+ feeds, ~7K episodes, ~1.9M paragraphs) that works as a vector DB for podc...

+2
144
1
3d ago
🎤Speech & Audio

Speech to Text Skill (Yandex SpeechKit) for OpenClaw

sergei-mikhailov-stt
v1.1.2
View Details

Speech recognition from voice messages using Yandex SpeechKit (with an extensible architecture for other providers). Use when you need to convert a voice mes...

173
yesterday
🎤Speech & Audio

Audio Editor

audio-editor
v1.0.0
View Details

Perform audio editing tasks including trimming, volume adjustment, format conversion, and extracting audio from video files using natural language commands.

yesterday
🎤Speech & Audio

Audio Content Generator

audio-gen
v1.0.0
View Details

Generate audiobooks, podcasts, or educational audio content on demand. User provides an idea or topic, Claude AI writes a script, and ElevenLabs converts it to high-quality audio. Supports multiple formats (audiobook, podcast, educational), custom lengths, and voice effects. Use when asked to create audio content, make a podcast, generate an audiobook, or produce educational audio. Returns MP3 audio file via MEDIA token.

1.9k
1
today
🎤Speech & Audio

MLX TTS

mlx-tts
v0.0.3
View Details

Text-To-Speech with MLX (Apple Silicon) and opensource models (default QWen3-TTS) locally.

+6
615
today
🎤Speech & Audio

ANY WHISPER API

any-whisper-api
v1.3.0
View Details

Transcribe audio via API Whisper with any compatible local servers.

75
2
today
🎤Speech & Audio

Speech to Text Transcription

speech-to-text-transcription
v1.0.0
View Details

Transcribe audio and video files to text with speaker detection, timestamps, and format conversion.

72
3d ago
🎤Speech & Audio

Qwen3 Tts Mlx

qwen3-tts-mlx
v2.1.0
View Details

Local Qwen3-TTS speech synthesis on Apple Silicon via MLX. Use for offline narration, audiobooks, video voiceovers, and multilingual TTS.

10
yesterday
🎤Speech & Audio

Audio Transcribe

audio-transcribe
AKTheKnight
v1.0.0
View Details

Auto-transcribe voice messages locally using faster-whisper with selectable Whisper models, no API key required.

191
3d ago
🎤Speech & Audio

Sapi Tts

sapi-tts
v1.1.0
View Details

Windows SAPI5 text-to-speech with Neural voices. Lightweight alternative to GPU-heavy TTS - zero GPU usage, instant generation. Auto-detects best available voice for your language. Works on Windows 10/11.

713
3d ago
🎤Speech & Audio

Whisper Stt

openclaw-skill-whisper-stt
v0.1.0
View Details

语音转文字 - 使用OpenAI Whisper将音频文件识别为文字

3d ago
🎤Speech & Audio

Voice Assistant

openclaw-voice-assistant
v1.0.4
View Details

Windows voice companion for OpenClaw. Custom wake word via Porcupine, local STT via faster-whisper, streamed responses over the gateway WebSocket, and ElevenLabs TTS with natural chime/thinking sounds. Supports multi-turn conversation with automatic follow-up listening, mic suppression to prevent feedback, and a system tray with pause/resume. Recommended voices: Matilda (XrExE9yKIg1WjnnlVkGX, free tier) or Ivy (MClEFoImJXBTgLwdLI5n, paid tier). Fully customizable wake word, voice, hotkey, and silence thresholds.

211
3d ago
🎤Speech & Audio

Deepdub TTS

deepdub-tts
v0.1.5
View Details

Generate speech audio using Deepdub and attach it as a MEDIA file (Telegram-compatible).

671
4
4d ago
🎤Speech & Audio

ElevenLabs CLI

elevenlabs-cli
v0.1.6
View Details

Command-line interface for ElevenLabs API enabling text-to-speech, speech-to-text, voice cloning, audio effects, dubbing, and resource management with full S...

+4
9
11d ago
🎤Speech & Audio

Telnyx Stt

telnyx-stt
v1.0.1
View Details

Transcribe audio files to text using Telnyx Speech-to-Text API. Use when you need to convert audio recordings, voice messages, or spoken content to text.

509
today
🎤Speech & Audio

Elevenlabs AI

elevenlabs-ai
codedao12
v1.0.0
View Details

Access ElevenLabs APIs for text-to-speech, speech-to-speech, realtime speech-to-text, voice/model management, and dialogue workflows with direct HTTP calls.

633
yesterday
🎤Speech & Audio

Gettr Transcribe

gettr-transcribe
v1.0.1
View Details

Download audio from a GETTR post or streaming page and transcribe it locally with MLX Whisper on Apple Silicon (with timestamps via VTT). Use when given a GE...

+1
111
today
←1…456…8→

Data sourced from clawhub.ai · Built with Next.js, Supabase, Prisma