Logo
ClawHub Skills Lib
HomeCategoriesUse CasesTrendingBlog
HomeCategoriesUse CasesTrendingBlog
ClawHub Skills Lib
ClawHub Skills Lib

Browse 27,000+ community-built AI agent skills for OpenClaw. Updated daily from clawhub.ai.

Explore

  • Home
  • Trending
  • Use Cases
  • Blog

Categories

  • Development
  • AI & Agents
  • Productivity
  • Communication
  • Data & Research
  • Business
  • Platforms
  • Lifestyle
  • Education
  • Design

Use Cases

  • Security Auditing
  • Workflow Automation
  • Finance & Fintech
  • MCP Integration
  • Crypto Trading
  • Web3 & DeFi
  • Data Analysis
  • Social Media
  • 中文平台技能
  • All Use Cases →
© 2026 ClawHub Skills Lib. All rights reserved.Built with Next.js · Neon · Prisma
Home/Use Cases/🎙️ Voice & Audio AI/🔊 Text-to-Speech

🔊 Text-to-Speech AI Skills

Convert text to natural-sounding speech with AI voice models and custom voice styles.

457 skillsPart of 🎙️ Voice & Audio AI

457 skills found

Page 1 of 20

🎤Speech & Audio

Openai Whisper Api

openai-whisper-api
steipete
v1.0.0
View Details

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

+1
835
14.8k
30
7d ago
🎤Speech & Audio

Openai Whisper

openai-whisper
steipete
v1.0.0
View Details

Local speech-to-text with the Whisper CLI (no API key).

356
13.4k
70
25d ago
🎤Speech & Audio

Sag

sag
steipete
v1.0.0
View Details

ElevenLabs text-to-speech with mac-style say UX.

272
6.8k
8
25d ago
🎤Speech & Audio

Voice Wake Say

voice-wake-say
xadenryan
v1.0.1
View Details

Speak responses aloud on macOS using the built-in `say` command when user input indicates Voice Wake/voice recognition (for example, messages starting with "User talked via voice recognition on <device>").

36
5.9k
4
7d ago
🎤Speech & Audio

Jarvis Voice

jarvis-voice
globalcaos
v2.2.1
View Details

Turn your AI into JARVIS. Voice, wit, and personality — the complete package. Humor cranked to maximum.

27
4.4k
3
8d ago
🎤Speech & Audio

Transcribe

transcribe
javicasper
v1.0.2
View Details

Transcribe audio files to text using local Whisper (Docker). Use when receiving voice messages, audio files (.mp3, .m4a, .ogg, .wav, .webm), or when asked to transcribe audio content.

27
2.6k
2
7d ago
🎤Speech & Audio

Edge TTS

edge-tts
i3130002
v2.0.0
View Details

Text-to-speech conversion using node-edge-tts npm package for generating audio from text. Supports multiple voices, languages, speed adjustment, pitch control, and subtitle generation. Use when: (1) User requests audio/voice output with the "tts" trigger or keyword. (2) Content needs to be spoken rather than read (multitasking, accessibility, driving, cooking). (3) User wants a specific voice, speed, pitch, or format for TTS output.

25
3.8k
6
25d ago
🎤Speech & Audio

whisper

whisper
fiddlybit
v1.0.0
View Details

End-to-end encrypted agent-to-agent private messaging via Moltbook dead drops. Use when agents need to communicate privately, exchange secrets, or coordinate without human visibility.

24
2.4k
24d ago
🎤Speech & Audio

Voice Agent

voice-agent
ricardotrevisan
v1.1.0
View Details

Local Voice Input/Output for Agents using the AI Voice Agent API.

23
3k
7d ago
🎤Speech & Audio

OpenAI TTS

openai-tts
pors
v1.0.0
View Details

Text-to-speech via OpenAI Audio Speech API.

22
3.5k
4
24d ago
🎤Speech & Audio

Faster Whisper

faster-whisper
ThePlasmak
v1.5.1
View Details

Local speech-to-text using faster-whisper. 4-6x faster than OpenAI Whisper with identical accuracy; GPU acceleration enables ~20x realtime transcription. SRT...

22
5.1k
4
7d ago
🎤Speech & Audio

Elevenlabs Tts

elevenlabs-tts
Shaharsha
v2.2.0
View Details

ElevenLabs TTS (Text-to-Speech) with emotional audio tags for expressive voice synthesis. WhatsApp-compatible voice messages with Opus conversion. Supports 7...

21
5k
6
7d ago
🎤Speech & Audio

Speech To Text

speech-to-text
okaris
v0.1.5
View Details

Transcribe audio to text with Whisper models via inference.sh CLI. Models: Fast Whisper Large V3, Whisper V3 Large. Capabilities: transcription, translation,...

20
1.8k
7d ago
🎤Speech & Audio

ElevenLabs Voices

elevenlabs-voices
robbyczgw-cla
v2.1.6
View Details

High-quality voice synthesis with 18 personas, 32 languages, sound effects, batch processing, and voice design using ElevenLabs API.

20
5.6k
16
7d ago
🎤Speech & Audio

Alexa CLI

alexa-cli
buddyh
v1.3.0
View Details

Control Amazon Alexa devices and smart home via the `alexacli` CLI. Use when a user asks to speak/announce on Echo devices, control lights/thermostats/locks, send voice commands, or query Alexa.

18
2.9k
13
25d ago
🎤Speech & Audio

audio-cog

audio-cog
nitishgargiitd
v1.0.3
View Details

AI audio generation powered by CellCog. Text-to-speech, voice synthesis, voiceovers, podcast audio, narration, music generation, background music, sound design. Professional audio creation with AI.

15
2.9k
2
25d ago
🎤Speech & Audio

Discord Voice

discord-voice
avatarneil
v0.1.6
View Details

Real-time voice conversations in Discord voice channels with Claude AI

+1
14
3.3k
3
25d ago
🎤Speech & Audio

Voice Transcribe

voice-transcribe
darinkishore
v1.0.1
View Details

Transcribe audio files using OpenAI's gpt-4o-mini-transcribe model with vocabulary hints and text replacements. Requires uv (https://docs.astral.sh/uv/).

13
3.7k
10
24d ago
🎤Speech & Audio

ElevenLabs Music

elevenlabs-music
clawdbotborges
v1.0.1
View Details

Generate music from text prompts using ElevenLabs Eleven Music API. Use when creating songs, soundtracks, jingles, lullabies, or any audio music from descriptions. Supports vocals with AI-generated lyrics, instrumental tracks, and multiple genres/styles. Requires paid ElevenLabs plan.

13
2.5k
1
25d ago
🎤Speech & Audio

Qwen3-tts

qwen-tts
paki81
v1.0.0
View Details

Local text-to-speech using Qwen3-TTS-12Hz-1.7B-CustomVoice. Use when generating audio from text, creating voice messages, or when TTS is requested. Supports 10 languages including Italian, 9 premium speaker voices, and instruction-based voice control (emotion, tone, style). Alternative to cloud-based TTS services like ElevenLabs. Runs entirely offline after initial model download.

12
2k
6
23d ago
🎤Speech & Audio

Kokoro TTS

kokoro-tts
edkief
v0.1.0
View Details

Generate spoken audio from text using the local Kokoro TTS engine. Use when the user asks to "say" something, requests a voice message, or wants text converted to speech.

12
3.1k
24d ago
🎤Speech & Audio

Vocal Chat

vocal-chat
rubenfb23
v1.0.0
View Details

Handles voice-to-voice conversations on WhatsApp. Automatically transcribes incoming audio and responds with local TTS audio. Use when the user wants to "talk" instead of type.

+1
12
2.6k
8
18d ago
🎤Speech & Audio

speech-recognition

speech-recognition
demo112
v1.0.1
View Details

通用语音识别 Skill。支持多种音频格式(ogg/mp3/wav/m4a),使用硅基流动 SenseVoice API 进行语音转文字。当用户发送语音消息、音频文件,或需要转录音频时触发。

12
1.3k
2
25d ago
🎤Speech & Audio

Phone Voice Integration

phone-voice
cortexuvula
v2.0.0
View Details

Connect ElevenLabs Agents to your OpenClaw via phone with Twilio. Includes caller ID auth, voice PIN security, call screening, memory injection, and cost tracking.

+1
11
2.3k
4
7d ago
…

Other 🎙️ Voice & Audio AI Phases

📝
Transcription & STT
Transcribe audio and video files to text with speaker labels and timestamps.
🌍
Audio Translation
Translate spoken content across languages — transcribe, translate, and re-synthesize.
🎚️
Audio Processing
Clean audio, remove noise, separate vocals, and process audio files at scale.