youtube-voice-summarizer-elevenlabsTransform YouTube videos into podcast-style voice summaries using ElevenLabs TTS
Install via ClawdBot CLI:
clawdbot install Franciscoandsam/youtube-voice-summarizer-elevenlabsGrade Good — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Sends data to undocumented external endpoint (potential exfiltration)
POST → http://127.0.0.1:3050/api/summarizeCalls external URL not in known-safe list
https://github.com/Franciscomoney/elevenlabs-moltbotAI Analysis
The skill requires a user-deployed backend server and sends data only to that local endpoint (127.0.0.1:3050) and documented third-party APIs (ElevenLabs, Supadata, OpenRouter) for its stated purpose. There is no evidence of credential harvesting, hidden instructions, or obfuscation, but it relies on the user's own API keys and server security.
Audited Apr 16, 2026 · audit v1.0
Generated Mar 1, 2026
Students can quickly grasp key points from lengthy educational YouTube lectures or tutorials by generating concise audio summaries. This helps in efficient study sessions and revision, especially for auditory learners who benefit from listening to content.
Business professionals can summarize competitor product demos, industry webinars, or market trend videos into brief audio reports. This saves time during research phases and allows teams to stay updated without watching full videos.
This skill converts video content into audio summaries, making YouTube videos more accessible for individuals with visual impairments. It provides an alternative way to consume information through natural-sounding speech output.
Content creators and social media managers can generate quick audio teasers or summaries from YouTube videos to repurpose content for podcasts, promotional clips, or audience engagement. It streamlines workflow by automating summarization and voiceover tasks.
Companies can use this skill to summarize training videos or onboarding materials into short audio clips for employees. This facilitates quick learning and retention, especially for remote teams or those with busy schedules.
Offer a limited number of free summaries per month with basic voices, then charge a monthly subscription for unlimited access, premium voices, and longer summary lengths. This model attracts users with free trials and monetizes heavy usage.
Provide API access to developers and businesses who integrate the summarization service into their own applications, charging per summary based on length and voice options. This scales with usage and targets tech-savvy customers.
Sell customized enterprise packages to companies for internal use, such as in education or corporate training, with bulk pricing, dedicated support, and integration assistance. This model focuses on high-value, long-term contracts.
💬 Integration Tip
Ensure the backend server is properly configured with all required API keys and publicly accessible to handle user requests seamlessly. Test with various YouTube URLs to confirm transcript availability and audio output quality.
Scored Apr 22, 2026
Local speech-to-text with the Whisper CLI (no API key).
ElevenLabs text-to-speech with mac-style say UX.
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
Text-to-speech conversion using node-edge-tts npm package for generating audio from text. Supports multiple voices, languages, speed adjustment, pitch control, and subtitle generation. Use when: (1) User requests audio/voice output with the "tts" trigger or keyword. (2) Content needs to be spoken rather than read (multitasking, accessibility, driving, cooking). (3) User wants a specific voice, speed, pitch, or format for TTS output.
Start voice calls via the OpenClaw voice-call plugin.
Local text-to-speech via sherpa-onnx (offline, no cloud)