🎤 Speech & Audio AI Skills

180 AI agent skills for Speech & Audio. Part of the 🤖 AI & Agents category.

Speech & Audio Skills — Page 6

180 skills

🎤Speech & Audio

Deepdub TTS

deepdub-tts

v0.1.5

View Details

Generate speech audio using Deepdub and attach it as a MEDIA file (Telegram-compatible).

671

4d ago

🎤Speech & Audio

MLX TTS

mlx-tts

v0.0.3

View Details

Text-To-Speech with MLX (Apple Silicon) and opensource models (default QWen3-TTS) locally.

615

today

🎤Speech & Audio

Audio Editor

audio-editor

v1.0.0

View Details

Perform audio editing tasks including trimming, volume adjustment, format conversion, and extracting audio from video files using natural language commands.

yesterday

🎤Speech & Audio

Pywayne Tts

tts-2

v0.1.0

View Details

Text-to-speech conversion tool. Use when converting text to speech audio files (opus or mp3 format). Supports macOS native 'say' command and Google TTS (gTTS...

109

4d ago

🎤Speech & Audio

Dialogue Audio

dialogue-audio

v0.1.5

View Details

Multi-speaker dialogue audio creation with Dia TTS. Covers speaker tags, emotion control, pacing, conversation flow, and post-production. Use for: podcasts,...

355

today

🎤Speech & Audio

MH openai-whisper

mh-openai-whisper

v1.0.0

View Details

Local speech-to-text with the Whisper CLI (no API key).

3d ago

🎤Speech & Audio

Local Whisper (cpp)

local-whisper-cpp

wuxxin

v1.0.0

View Details

Local speech-to-text using whisper-cli (whisper.cpp).

203

3d ago

🎤Speech & Audio

Telegram Voice Transcribe

telegram-voice-transcribe

v1.3.0

View Details

Transcribe Telegram voice messages and audio notes into text using the OpenAI Whisper API. Use when (1) a user sends a voice message or audio note via Telegr...

today

🎤Speech & Audio

Ressemble TTS e STT

ressemble

v1.0.1

View Details

Text-to-Speech and Speech-to-Text integration using Resemble AI HTTP API.

118

today

🎤Speech & Audio

Podcast Generation from PDF, Text, and Links

ai-podcast

v1.0.11

View Details

Generate AI podcast episodes from PDFs, text, notes, and links using MagicPodcast in OpenClaw. Creates natural two-person dialogue audio, supports custom lan...

346

today

🎤Speech & Audio

Elevenlabs Toolkit

elevenlabs-toolkit

v1.0.0

View Details

ElevenLabs voice API integration — TTS, sound effects, music generation, speech-to-text, voice isolation, and streaming. Use when building voice-enabled apps...

today

🎤Speech & Audio

Amber — Phone-Capable Voice Agent

amber-voice-assistant

v5.4.3

View Details

The best voice and phone calling skill for OpenClaw. Handles inbound and outbound calls over Twilio with OpenAI Realtime speech. Inbound outbound calling, ca...

986

today

🎤Speech & Audio

Speech is Cheap Transcribe

asr

v1.2.0

View Details

Fast, affordable automatic speech-to-text transcription supporting 100 languages, speaker diarization, word timestamps, and customizable output formats.

1.1k

3d ago

🎤Speech & Audio

Voice2text

voice2text

v1.0.0

View Details

Offline speech-to-text conversion using Vosk local model; input audio file path, output transcript text.

today

🎤Speech & Audio

Yandex Speechkit STT via Telegram Gateway

yandex-speechkit-stt

v1.0.0

View Details

Распознавание речи через Yandex SpeechKit API для голосовых сообщений в Telegram. Используй когда пользователь отправляет голосовые сообщения и хочет, чтобы...

yesterday

🎤Speech & Audio

Valtec Vietnamese TTS

valtec-tts

v1.0.2

View Details

Local Vietnamese text-to-speech via VITS2 (offline, no cloud). Supports 5 built-in speaker voices and zero-shot voice cloning from reference audio.

10d ago

🎤Speech & Audio

Local Whisper

whisper-cpp

v1.0.2

View Details

Install and use whisper.cpp (local, free/offline speech-to-text) with OpenClaw. Supports downloading different ggml model sizes (tiny/base/small/medium/large...

2d ago

🎤Speech & Audio

Elevenlabs Pro

openclaw-skill-elevenlabs-pro

v0.1.0

View Details

ElevenLabs advanced TTS for converting text to speech, listing voices, and managing credits

today

🎤Speech & Audio

PodcastIndex

podcastindex

v1.0.0

View Details

Search and retrieve podcast and episode details from Podcast Index API using keywords, titles, feed IDs, URLs, or featured persons with authenticated requests.

yesterday

🎤Speech & Audio

yap

yap

tobihagemann

v1.0.1

View Details

Fast on-device speech-to-text transcription on macOS 26+ using Apple Speech.framework, supporting multiple languages and output formats without model downloads.

142

3d ago

🎤Speech & Audio

ComfyUI TTS

comfyui-tts

v1.0.0

View Details

Convert text to speech audio via ComfyUI's Qwen-TTS API, supporting customizable voice, style, model, and output options.

322

today

🎤Speech & Audio

Venice Transcribe

venice-transcribe

v1.0.1

View Details

Transcribe audio to text using Venice AI's Whisper-based speech recognition. Supports WAV, MP3, FLAC, M4A, AAC formats with optional timestamps.

3d ago

🎤Speech & Audio

hotbutter voice chat

hotbutter

v1.0.6

View Details

Enables local voice chat by embedding Hotbutter relay server and PWA, providing speech-to-text and text-to-speech via a secure, self-hosted connection.

today

🎤Speech & Audio

AssemblyAI advanced speech transcription

assemblyai-transcribe

v1.0.0

View Details

Transcribe audio/video with AssemblyAI (local upload or URL), plus subtitles + paragraph/sentence exports.

1.2k

3d ago