Logo
ClawHub Skills Lib
HomeTrending
Home/🤖 AI & Agents/🎤 Speech & Audio

🎤 Speech & Audio AI Skills

178 AI agent skills for Speech & Audio. Part of the 🤖 AI & Agents category.

Speech & Audio Skills — Page 3

178 skills
🎤Speech & Audio

Transcribe Audio with Parakeet MLX

parakeet-mlx
kylehowells
v1.0.0
View Details

Local speech-to-text with Parakeet MLX (ASR) for Apple Silicon (no API key).

2
1.6k
3d ago
🎤Speech & Audio

Alicloud Ai Audio Tts

alicloud-ai-audio-tts
cinience
v1.0.3
View Details

Generate human-like speech audio with Model Studio DashScope Qwen TTS models (qwen3-tts-flash, qwen3-tts-instruct-flash). Use when converting text to speech,...

2
702
today
🎤Speech & Audio

Zhipu AI ASR

zhipu-asr
franklu0819-lang
v1.0.1
View Details

Automatic Speech Recognition (ASR) using Zhipu AI (BigModel) GLM-ASR model. Use when you need to transcribe audio files to text. Supports Chinese audio trans...

2
175
today
🎤Speech & Audio

WebSocket

websocket
ivangdavila
v1.0.0
View Details

Implement reliable WebSocket connections with proper reconnection, heartbeats, and scaling.

2
651
2
3d ago
🎤Speech & Audio

Alicloud Ai Audio Tts Realtime

alicloud-ai-audio-tts-realtime
cinience
v1.0.0
View Details

Real-time speech synthesis with Alibaba Cloud Model Studio Qwen TTS Realtime models. Use when low-latency interactive speech is required, including instruction-controlled realtime synthesis.

2
325
today
🎤Speech & Audio

audio-broadcast

audio-broadcast
oxiaom
v1.0.1
View Details

控制小播鼠广播系统进行音频播放和广播通知。使用当用户需要向广播设备播放音频、设置音量、管理定时广播任务、或查看设备状态时。支持播放音频文件、URL播放、音量调节、设备管理、定时任务管理、文字转语音(TTS)广播等功能。Control xiaoboshu broadcast system for audio pla...

2
238
3d ago
🎤Speech & Audio

Ai Podcast Creation

ai-podcast-creation
okaris
v0.1.5
View Details

Create AI-powered podcasts with text-to-speech, music, and audio editing. Tools: Kokoro TTS, DIA TTS, Chatterbox, AI music generation, media merger. Capabili...

2
809
today
🎤Speech & Audio

Farmos Observations

farmos-observations
brianppetty
v1.0.0
View Details

Query and create field observations and AI-processed captures. Photos, voice notes, and text notes from the field.

2
136
today
🎤Speech & Audio

RingBot

ringbot
gbessoni
v1.1.0
View Details

Make outbound AI phone calls. Use when asked to call a business, make a phone call, order food by phone, schedule appointments, or any task requiring voice calls. Triggers on "call", "phone", "dial", "ring", "order pizza", "make reservation", "schedule appointment".

2
1.9k
3
2d ago
🎤Speech & Audio

Anova Oven

anova-skill
dodeja
v0.1.0
View Details

Control Anova Precision Ovens and Precision Cookers (sous vide) via WiFi WebSocket API. Start cooking modes (sous vide, roasting, steam), set temperatures, monitor status, and stop cooking remotely.

2
1.3k
3d ago
🎤Speech & Audio

pod-cog

pod-cog
nitishgargiitd
v1.0.2
View Details

A great podcast needs three things: compelling content, natural-sounding voices, and polished production. CellCog delivers all three — #1 on DeepResearch Bench (Feb 2026) for script depth, frontier multi-voice dialogue, and automatic music + editing. Podcast production, episode scripts, show notes, interview prep, audiograms — single prompt to finished MP3.

2
1.3k
2
3d ago
🎤Speech & Audio

say

say
tobihagemann
v1.0.2
View Details

Text-to-Speech via macOS say command with Siri Natural Voices. Use for generating speech audio, TTS clips, or speaking text aloud on macOS.

2
273
3d ago
🎤Speech & Audio

Eachlabs Voice Audio

eachlabs-voice-audio
eftalyurtseven
v0.1.0
View Details

Text-to-speech, speech-to-text, voice conversion, and audio processing using EachLabs AI models. Supports ElevenLabs TTS, Whisper transcription with diarization, and RVC voice conversion. Use when the user needs TTS, transcription, or voice conversion.

2
627
today
🎤Speech & Audio

Zhipu AI TTS

zhipu-tts
franklu0819-lang
v1.0.0
View Details

Text-to-speech conversion using Zhipu AI (BigModel) GLM-TTS model. Use when you need to convert text to audio files with various voice options. Supports Chin...

2
202
3d ago
🎤Speech & Audio

Podcast to Substack

podcast-to-substack
danielfoch
v1.0.0
View Details

Publish podcast episodes from RSS and Notion to Substack with Apple Podcasts embeds and images, then generate LinkedIn-ready companion posts.

+1
2
303
today
🎤Speech & Audio

Youtube Podcast summarizer via Elevenlabs

youtube-voice-summarizer-elevenlabs
Franciscoandsam
v1.0.0
View Details

Transform YouTube videos into podcast-style voice summaries using ElevenLabs TTS

2
1.4k
yesterday
🎤Speech & Audio

macOS Local Voice

macos-local-voice
STRRL
v1.0.0
View Details

Local STT and TTS on macOS using native Apple capabilities. Speech-to-text via yap (Apple Speech.framework), text-to-speech via say + ffmpeg. Fully offline, no API keys required. Includes voice quality detection and smart voice selection.

2
748
today
🎤Speech & Audio

ElevenLabs Phone Reminder (Lite)

elevenlabs-phone-reminder-lite
dAAAb
v1.1.0
View Details

Build AI phone call reminders with ElevenLabs Conversational AI + Twilio. Free starter guide.

2
1.6k
2
yesterday
🎤Speech & Audio

The Botcast

botcast
cpascoli
v1.0.1
View Details

The Botcast — a podcast platform for AI agents. Be a guest or host on long-form interview episodes. Use when an agent is invited to The Botcast, wants to participate in a podcast episode, or needs to interact with The Botcast API.

1
344
3d ago
🎤Speech & Audio

通义晓蜜 - 智能外呼

xiaomi-outbound-call
Raven-XIA
v1.0.1
View Details

触发阿里云晓蜜外呼机器人任务,自动批量拨打电话。适用于批量外呼、客户回访、满意度调查、简历筛查约面试等场景。可从前置工具或节点获取外呼名单。

1
1.5k
today
🎤Speech & Audio

Aliyun TTS

aliyun-tts
guang384
v1.0.0
View Details

Alibaba Cloud Text-to-Speech synthesis service.

1
1.8k
2d ago
🎤Speech & Audio

Audio Visualization

audio-visualization
eftalyurtseven
v1.0.0
View Details

Generate audio visualization videos using each::sense AI. Create waveforms, spectrum analyzers, particle effects, 3D visualizations, and beat-synced animatio...

1
93
3d ago
🎤Speech & Audio

Faster Whisper Local Service

faster-whisper-local-service
neldar
v0.1.7
View Details

OpenClaw local speech-to-text backend using faster-whisper over HTTP on 127.0.0.1:18790. Use when you want voice transcription without external APIs, without...

+9
1
565
today
🎤Speech & Audio

Openai Tts.Bak 2026 01 28T18:01:23+10:30

openai-tts-bak-2026-01-28t18-01-23-10-30
nicoataiza
v1.0.0
View Details

Text-to-speech via OpenAI Audio Speech API.

1
982
2d ago
←1234…8→

Data sourced from clawhub.ai · Built with Next.js, Supabase, Prisma