Logo
ClawHub Skills Lib
HomeCategoriesUse CasesTrendingStatisticsBlog
HomeCategoriesUse CasesTrendingStatisticsBlog
ClawHub Skills Lib
ClawHub Skills Lib

Browse 50.000+ community-built AI agent skills for OpenClaw. Updated daily from clawhub.ai.

Explore

  • Home
  • Categories
  • Use Cases
  • Trending
  • Blog

Categories

  • Development
  • AI & Agents
  • Productivity
  • Communication
  • Data & Research
  • Business
  • Platforms
  • Lifestyle
  • Education
  • Design

Use Cases

  • AI Code Generation
  • Code Review & Testing
  • DevOps & Cloud
  • Security & Compliance
  • Build an AI Agent
  • Agent Memory & RAG
  • Multi-Agent Orchestration
  • Browser & Web Automation
  • Financial & Market Data
  • Crypto & Web3
  • Real-Time Web Search
  • News & Media Monitoring
  • Academic Research
  • Data & Analytics
  • AI Image Generation
  • Voice & Audio AI
  • AI Video Creation
  • Content Writing
  • Task & Project Management
  • Knowledge Management
  • Email & Messaging
  • SEO & Content Marketing
  • Sales & CRM
  • Workflow Automation
  • Social Media
  • Chinese Platforms
  • E-Commerce
  • Education & Tutoring
  • HR & Recruiting
  • Legal & Compliance
  • AI Code Generation
  • Code Review & Testing
  • DevOps & Cloud
  • Security & Compliance
  • Build an AI Agent
  • Agent Memory & RAG
  • Multi-Agent Orchestration
  • Browser & Web Automation
  • Financial & Market Data
  • Crypto & Web3
  • Real-Time Web Search
  • News & Media Monitoring
  • Academic Research
  • Data & Analytics
  • AI Image Generation
  • Voice & Audio AI
  • AI Video Creation
  • Content Writing
  • Task & Project Management
  • See all use cases →
  • AI Code Generation
  • Code Review & Testing
  • DevOps & Cloud
  • Security & Compliance
  • Build an AI Agent
  • Agent Memory & RAG
  • Multi-Agent Orchestration
  • Browser & Web Automation
  • Financial & Market Data
  • See all use cases →
© 2026 ClawHub Skills Lib. All rights reserved.Built with Next.js · Neon · Prisma
Home/Use Cases/🎙️ Voice & Audio AI/📝 Transcription & STT

📝 Transcription & STT AI Skills

Transcribe audio and video files to text with speaker labels and timestamps.

375 skillsPart of 🎙️ Voice & Audio AI
Lang:

375 skills found

Page 1 of 16

🎤Speech & Audio

Openai Whisper

openai-whisper
steipete
v1.0.0
View Details

Local speech-to-text with the Whisper CLI (no API key).

2.1k
84k
321
3mo ago
🎤Speech & Audio

Sag

sag
steipete
v1.0.0
View Details

ElevenLabs text-to-speech with mac-style say UX.

1.3k
26.7k
26
3mo ago
🎤Speech & Audio

Openai Whisper Api

openai-whisper-api
steipete
v1.0.0
View Details

Transcribe audio via OpenAI Audio Transcriptions API (Whisper).

1.2k
25.7k
52
1mo ago
🎤Speech & Audio

Edge TTS

edge-tts
i3130002
v2.0.0
View Details

Text-to-speech conversion using node-edge-tts npm package for generating audio from text. Supports multiple voices, languages, speed adjustment, pitch control, and subtitle generation. Use when: (1) User requests audio/voice output with the "tts" trigger or keyword. (2) Content needs to be spoken rather than read (multitasking, accessibility, driving, cooking). (3) User wants a specific voice, speed, pitch, or format for TTS output.

267
20.6k
31
3mo ago
🎤Speech & Audio

macOS Local Voice

macos-local-voice
strrl
v1.0.0
View Details

Local STT and TTS on macOS using native Apple capabilities. Speech-to-text via yap (Apple Speech.framework), text-to-speech via say + ffmpeg. Fully offline, no API keys required. Includes voice quality detection and smart voice selection.

105
2.8k
1
1mo ago
🎤Speech & Audio

Audio

audio
ivangdavila
v1.0.1
View Details

Process, enhance, and convert audio files with noise removal, normalization, format conversion, transcription, and podcast workflows.

95
2.5k
2
1mo ago
🎤Speech & Audio

Local Whisper

local-whisper
araa47
v1.0.0
View Details

Local speech-to-text using OpenAI Whisper. Runs fully offline after model download. High quality transcription with multiple model sizes.

79
12.2k
12
3mo ago
🎤Speech & Audio

AssemblyAI Transcriber

assemblyai-transcriber
xenofex7
v1.1.0
View Details

Transcribe audio files with speaker diarization (who speaks when). Supports 100+ languages, automatic language detection, and timestamps. Use for meetings, interviews, podcasts, or voice messages. Requires AssemblyAI API key.

71
1.9k
1mo ago
🎤Speech & Audio

Faster Whisper Transcription

faster-whisper-transcribe
kalmuraee
v1.0.0
View Details

Transcribes local voice messages to text using Faster Whisper models for fast, privacy-focused speech recognition on audio files.

66
1.8k
1mo ago
🎤Speech & Audio

Free Groq Voice Recognition

free-groq-voice
huixionghexiyi
v1.0.0
View Details

FREE voice recognition using Groq's complimentary Whisper API. Transcribe audio messages to text in 50+ languages at no cost. Perfect for voice-to-text autom...

64
1.7k
1mo ago
🎤Speech & Audio

Local Vosk STT

local-vosk
sfkiwi
v1.0.1
View Details

Local speech-to-text using Vosk. Lightweight, fast, fully offline. Perfect for transcribing Telegram voice messages, audio files, or any speech-to-text task without cloud APIs.

62
1.6k
1mo ago
🎤Speech & Audio

OpenAI TTS

openai-tts
pors
v1.0.0
View Details

Text-to-speech via OpenAI Audio Speech API.

57
7.2k
6
3mo ago
🎤Speech & Audio

Telnyx Stt

telnyx-stt
teamtelnyx
v1.0.1
View Details

Transcribe audio files to text using Telnyx Speech-to-Text API. Use when you need to convert audio recordings, voice messages, or spoken content to text.

55
1.5k
1mo ago
🎤Speech & Audio

Openai Whisper 1.0.0

openai-whisper-1-0-0
czubi1928
v1.0.0
View Details

Local speech-to-text with the Whisper CLI (no API key).

52
1.3k
3mo ago
🎤Speech & Audio

Mac TTS

mac-tts
kalijason
v1.0.0
View Details

Text-to-speech using macOS built-in `say` command. Use for voice notifications, audio alerts, reading text aloud, or announcing messages through Mac speakers. Supports multiple languages including Chinese (Mandarin), English, Japanese, etc.

49
6.8k
2
3mo ago
🎤Speech & Audio

usewhisper

usewhisper
alinxus
v1.0.0
View Details

Official Whisper Context skill for OpenClaw. Cuts context tokens via delta compression + caching, and adds long-term memory across sessions.

49
1.3k
1mo ago
🎤Speech & Audio

Youtube Transcript Api

youtube-transcript-api
volodstaimi
v0.1.0
View Details

Extract, transcribe, and translate YouTube video transcripts using the YouTubeTranscript.dev V2 API. Supports captions, ASR audio transcription, batch proces...

47
1.2k
1mo ago
🎤Speech & Audio

Tts

tts
AMSTKO
v1.0.0
View Details

Convert text to speech using Hume AI (or OpenAI) API. Use when the user asks for an audio message, a voice reply, or to hear something "of vive voix".

46
4.4k
1
3mo ago
🎤Speech & Audio

openclaw-voice

openclaw-voice
frank-bot07
v1.0.0
View Details

Transcribe audio to text and generate spoken AI responses using Whisper and ElevenLabs via CLI with transcript storage and search.

43
1.2k
2
1mo ago
🎤Speech & Audio

Faster Whisper

faster-whisper
theplasmak
v1.5.1
View Details

Local speech-to-text using faster-whisper. 4-6x faster than OpenAI Whisper with identical accuracy; GPU acceleration enables ~20x realtime transcription. SRT...

42
7.3k
5
1mo ago
🎤Speech & Audio

Voice Transcribe

voice-transcribe
darinkishore
v1.0.1
View Details

Transcribe audio files using OpenAI's gpt-4o-mini-transcribe model with vocabulary hints and text replacements. Requires uv (https://docs.astral.sh/uv/).

40
6.3k
13
1mo ago
🎤Speech & Audio

Qwen3-tts

qwen-tts
paki81
v1.0.0
View Details

Local text-to-speech using Qwen3-TTS-12Hz-1.7B-CustomVoice. Use when generating audio from text, creating voice messages, or when TTS is requested. Supports 10 languages including Italian, 9 premium speaker voices, and instruction-based voice control (emotion, tone, style). Alternative to cloud-based TTS services like ElevenLabs. Runs entirely offline after initial model download.

38
3.8k
9
1mo ago
🎤Speech & Audio

Audio Cog

audio-cog
nitishgargiitd
v1.0.12
View Details

AI audio generation and text-to-speech powered by CellCog. Voiceover, narration, voice cloning, avatar voices, sound effects, music, podcasts, dialogue. Thre...

38
5.8k
4
1mo ago
🎤Speech & Audio

Discord Local STT/TTS Installer (macOS)

discord-local-stt-tts-installer
vilmire
v0.1.1
View Details

(macOS) Discord voice assistant installer. Install/update discord-local-stt-tts (Discord voice, Discord local, local STT + local TTS) from GitHub Releases.

38
1k
1mo ago
…

Other 🎙️ Voice & Audio AI Phases

🔊
Text-to-Speech
Convert text to natural-sounding speech with AI voice models and custom voice styles.
🌍
Audio Translation
Translate spoken content across languages — transcribe, translate, and re-synthesize.
🎚️
Audio Processing
Clean audio, remove noise, separate vocals, and process audio files at scale.