youtube-audio-downloadDownload YouTube video audio and convert to MP3. Supports age-restricted videos with cookies.
Install via ClawdBot CLI:
clawdbot install banner90/youtube-audio-downloadGrade Good — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Generated Mar 20, 2026
Content creators and localization teams use this skill to download audio from YouTube videos for translation, dubbing, or subtitling workflows. It integrates with tools like Doubao for automated subtitle capture, streamlining the production of multilingual content.
Educators and students download audio from educational YouTube videos to create offline study materials or podcasts. This supports learning in areas with limited internet access or for auditory-focused revision sessions.
Market researchers analyze audio from competitor videos or industry talks to extract insights on trends, sentiment, or branding. The skill enables bulk downloading for processing with speech-to-text or AI analysis tools.
Organizations use this skill to download audio for creating accessible content, such as converting video lectures into audio-only formats for visually impaired users or generating transcripts for compliance with accessibility standards.
Podcast producers extract audio from YouTube interviews or clips to repurpose content into podcast episodes. The skill handles age-restricted videos via cookies, ensuring reliable access to diverse source material.
Offer a cloud platform where users can upload YouTube URLs to download and convert audio automatically, with features like batch processing and API access. Revenue comes from monthly subscriptions based on usage tiers, targeting content creators and businesses.
Develop a user-friendly desktop app with a free tier for basic downloads and a paid version for advanced features like high-speed conversions, cookie management, and integration with translation tools. Revenue is generated through one-time purchases or in-app upgrades.
Provide customized integrations for companies in media, education, or research, offering white-labeled tools, dedicated support, and compliance features. Revenue is based on annual contracts, with pricing scaled to the number of users or volume of downloads.
💬 Integration Tip
Ensure the Windows environment is visible for GUI automation and map WSL paths correctly to access Windows directories like /mnt/h/ for seamless file handling.
Scored Apr 19, 2026
Local speech-to-text with the Whisper CLI (no API key).
ElevenLabs text-to-speech with mac-style say UX.
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
Text-to-speech conversion using node-edge-tts npm package for generating audio from text. Supports multiple voices, languages, speed adjustment, pitch control, and subtitle generation. Use when: (1) User requests audio/voice output with the "tts" trigger or keyword. (2) Content needs to be spoken rather than read (multitasking, accessibility, driving, cooking). (3) User wants a specific voice, speed, pitch, or format for TTS output.
Start voice calls via the OpenClaw voice-call plugin.
Local text-to-speech via sherpa-onnx (offline, no cloud)