Logo
ClawHub Skills Lib
HomeCategoriesUse CasesTrendingStatisticsBlog
HomeCategoriesUse CasesTrendingStatisticsBlog
ClawHub Skills Lib
ClawHub Skills Lib

Browse 50.000+ community-built AI agent skills for OpenClaw. Updated daily from clawhub.ai.

Explore

  • Home
  • Categories
  • Use Cases
  • Trending
  • Blog

Categories

  • Development
  • AI & Agents
  • Productivity
  • Communication
  • Data & Research
  • Business
  • Platforms
  • Lifestyle
  • Education
  • Design

Use Cases

  • AI Code Generation
  • Code Review & Testing
  • DevOps & Cloud
  • Security & Compliance
  • Build an AI Agent
  • Agent Memory & RAG
  • Multi-Agent Orchestration
  • Browser & Web Automation
  • Financial & Market Data
  • Crypto & Web3
  • Real-Time Web Search
  • News & Media Monitoring
  • Academic Research
  • Data & Analytics
  • AI Image Generation
  • Voice & Audio AI
  • AI Video Creation
  • Content Writing
  • Task & Project Management
  • Knowledge Management
  • Email & Messaging
  • SEO & Content Marketing
  • Sales & CRM
  • Workflow Automation
  • Social Media
  • Chinese Platforms
  • E-Commerce
  • Education & Tutoring
  • HR & Recruiting
  • Legal & Compliance
  • AI Code Generation
  • Code Review & Testing
  • DevOps & Cloud
  • Security & Compliance
  • Build an AI Agent
  • Agent Memory & RAG
  • Multi-Agent Orchestration
  • Browser & Web Automation
  • Financial & Market Data
  • Crypto & Web3
  • Real-Time Web Search
  • News & Media Monitoring
  • Academic Research
  • Data & Analytics
  • AI Image Generation
  • Voice & Audio AI
  • AI Video Creation
  • Content Writing
  • Task & Project Management
  • See all use cases →
  • AI Code Generation
  • Code Review & Testing
  • DevOps & Cloud
  • Security & Compliance
  • Build an AI Agent
  • Agent Memory & RAG
  • Multi-Agent Orchestration
  • Browser & Web Automation
  • Financial & Market Data
  • See all use cases →
© 2026 ClawHub Skills Lib. All rights reserved.Built with Next.js · Neon · Prisma
Home/Use Cases/🎙️ Voice & Audio AI/🔊 Text-to-Speech

🔊 Text-to-Speech AI Skills

Convert text to natural-sounding speech with AI voice models and custom voice styles.

653 skillsPart of 🎙️ Voice & Audio AI
Lang:

653 skills found

Page 1 of 28

🎤Speech & Audio

Openai Whisper

openai-whisper
steipete
v1.0.0
View Details

Local speech-to-text with the Whisper CLI (no API key).

2.1k
84k
321
3mo ago
🎤Speech & Audio

Sag

sag
steipete
v1.0.0
View Details

ElevenLabs text-to-speech with mac-style say UX.

1.3k
26.7k
26
3mo ago
🎤Speech & Audio

Edge TTS

edge-tts
i3130002
v2.0.0
View Details

Text-to-speech conversion using node-edge-tts npm package for generating audio from text. Supports multiple voices, languages, speed adjustment, pitch control, and subtitle generation. Use when: (1) User requests audio/voice output with the "tts" trigger or keyword. (2) Content needs to be spoken rather than read (multitasking, accessibility, driving, cooking). (3) User wants a specific voice, speed, pitch, or format for TTS output.

267
20.6k
31
3mo ago
🎤Speech & Audio

Sherpa ONNX TTS

sherpa-onnx-tts
danielsinewe
v0.1.0
View Details

Local text-to-speech via sherpa-onnx (offline, no cloud)

+3
137
3.9k
1
1mo ago
🎤Speech & Audio

Voice Call

voice-call
danielsinewe
v0.1.0
View Details

Start voice calls via the OpenClaw voice-call plugin.

+3
134
3.8k
2
1mo ago
🎤Speech & Audio

macOS Local Voice

macos-local-voice
strrl
v1.0.0
View Details

Local STT and TTS on macOS using native Apple capabilities. Speech-to-text via yap (Apple Speech.framework), text-to-speech via say + ffmpeg. Fully offline, no API keys required. Includes voice quality detection and smart voice selection.

105
2.8k
1
1mo ago
🎤Speech & Audio

Cellcog

cellcog
nitishgargiitd
v2.0.15
View Details

Any-to-any AI sub-agent — research, images, video, audio, music, podcasts, avatars, voice cloning, documents, spreadsheets, dashboards, 3D models, diagrams,...

97
14.1k
8
1mo ago
🎤Speech & Audio

Local Whisper

local-whisper
araa47
v1.0.0
View Details

Local speech-to-text using OpenAI Whisper. Runs fully offline after model download. High quality transcription with multiple model sizes.

79
12.2k
12
3mo ago
🎤Speech & Audio

AssemblyAI Transcriber

assemblyai-transcriber
xenofex7
v1.1.0
View Details

Transcribe audio files with speaker diarization (who speaks when). Supports 100+ languages, automatic language detection, and timestamps. Use for meetings, interviews, podcasts, or voice messages. Requires AssemblyAI API key.

71
1.9k
1mo ago
🎤Speech & Audio

Voice Message

voice-message
xmanrui
v1.0.4
View Details

Send voice messages across chat channels (Telegram, Discord, Feishu/Lark, Signal, WhatsApp, and others) using edge-tts for text-to-speech and ffmpeg for audi...

69
1.8k
2
1mo ago
🎤Speech & Audio

Faster Whisper Transcription

faster-whisper-transcribe
kalmuraee
v1.0.0
View Details

Transcribes local voice messages to text using Faster Whisper models for fast, privacy-focused speech recognition on audio files.

66
1.8k
1mo ago
🎤Speech & Audio

Voice Wake Say

voice-wake-say
xadenryan
v1.0.1
View Details

Speak responses aloud on macOS using the built-in `say` command when user input indicates Voice Wake/voice recognition (for example, messages starting with "User talked via voice recognition on <device>").

65
9.7k
5
1mo ago
🎤Speech & Audio

Free Groq Voice Recognition

free-groq-voice
huixionghexiyi
v1.0.0
View Details

FREE voice recognition using Groq's complimentary Whisper API. Transcribe audio messages to text in 50+ languages at no cost. Perfect for voice-to-text autom...

64
1.7k
1mo ago
🎤Speech & Audio

Invoices

invoices
ivangdavila
v1.0.1
View Details

Capture, extract, and organize received invoices with automatic OCR, provider detection, and searchable archive.

64
1.7k
2
1mo ago
🎤Speech & Audio

VEED UGC

veed-ugc
pauldelavallaz
v1.0.1
View Details

Generate UGC-style promotional videos with AI lip-sync. Takes an image (person with product from Morpheus/Ad-Ready) and a script (pure dialogue), creates a video of the person speaking. Uses ElevenLabs for voice synthesis.

63
1.7k
5
1mo ago
🎤Speech & Audio

Local Vosk STT

local-vosk
sfkiwi
v1.0.1
View Details

Local speech-to-text using Vosk. Lightweight, fast, fully offline. Perfect for transcribing Telegram voice messages, audio files, or any speech-to-text task without cloud APIs.

62
1.6k
1mo ago
🎤Speech & Audio

LH Edge TTS

lh-edge-tts
liuhedev
v1.0.0
View Details

Text-to-speech conversion using Python edge-tts for generating audio from text. Supports multiple voices, languages, speed adjustment, pitch control, and sub...

62
1.7k
1mo ago
🎤Speech & Audio

OpenAI TTS

openai-tts
pors
v1.0.0
View Details

Text-to-speech via OpenAI Audio Speech API.

57
7.2k
6
3mo ago
🎤Speech & Audio

ComfyUI TTS

comfyui-tts
yhsi5358
v1.0.0
View Details

Convert text to speech audio via ComfyUI's Qwen-TTS API, supporting customizable voice, style, model, and output options.

57
1.5k
1mo ago
🎤Speech & Audio

Telnyx Stt

telnyx-stt
teamtelnyx
v1.0.1
View Details

Transcribe audio files to text using Telnyx Speech-to-Text API. Use when you need to convert audio recordings, voice messages, or spoken content to text.

55
1.5k
1mo ago
🎤Speech & Audio

Local Llama TTS

local-llama-tts
wuxxin
v1.0.0
View Details

Local text-to-speech using llama-tts (llama.cpp) and OuteTTS-1.0-0.6B model.

54
1.4k
1mo ago
🎤Speech & Audio

Piper TTS

beware-piper-tts
bewareofddog
v1.0.1
View Details

Local text-to-speech using Piper for voice message delivery. Use when the user asks for voice responses, audio messages, TTS, text-to-speech, voice notes, or...

53
1.4k
1mo ago
🎤Speech & Audio

Telnyx Tts

telnyx-tts
teamtelnyx
v1.0.0
View Details

Generate speech audio from text using Telnyx Text-to-Speech API. Use when you need to convert text to spoken audio, create voice messages, or generate audio content.

53
1.4k
1mo ago
🎤Speech & Audio

Openai Whisper 1.0.0

openai-whisper-1-0-0
czubi1928
v1.0.0
View Details

Local speech-to-text with the Whisper CLI (no API key).

52
1.3k
3mo ago
…

Other 🎙️ Voice & Audio AI Phases

📝
Transcription & STT
Transcribe audio and video files to text with speaker labels and timestamps.
🌍
Audio Translation
Translate spoken content across languages — transcribe, translate, and re-synthesize.
🎚️
Audio Processing
Clean audio, remove noise, separate vocals, and process audio files at scale.