youtube-transcribeTranscribe YouTube videos with smart fallback: extracts captions first (fast, free), falls back to local Whisper transcription when no captions available. Au...
Install via ClawdBot CLI:
clawdbot install iml885203/youtube-transcribeGrade Fair — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Accesses system directories or attempts privilege escalation
/proc/Calls external URL not in known-safe list
https://www.youtube.com/watch?v=VIDEO_IDAI Analysis
The skill's external API usage (YouTube) is consistent with its stated purpose of transcribing YouTube videos, and there's no evidence of unauthorized data exfiltration or credential harvesting. The /proc/ access mentioned in signals appears to be a false positive, as the skill definition shows only legitimate operations with yt-dlp and Whisper backends for transcription.
Audited Apr 16, 2026 · audit v1.0
Generated Mar 20, 2026
Researchers can quickly extract transcripts from academic lectures or conference videos on YouTube to summarize key points without watching entire videos. This aids in literature reviews or gathering insights from video-based sources efficiently, saving time and enabling text-based analysis.
Media producers use this skill to generate accurate subtitles for YouTube videos lacking captions, improving accessibility for deaf or hard-of-hearing audiences. It ensures compliance with accessibility standards and enhances viewer engagement by providing text alternatives.
Business analysts transcribe competitor product demos or industry webinars from YouTube to analyze trends and extract actionable insights. This supports competitive analysis and strategic planning by converting video content into searchable, analyzable text data.
Language learners extract transcripts from YouTube videos in target languages to study vocabulary, grammar, and pronunciation. They can use the output for practice exercises or create study materials, enhancing immersion and comprehension skills.
Legal professionals transcribe YouTube videos containing depositions, public statements, or evidence for case preparation and documentation. This ensures accurate records, aids in review processes, and supports compliance with legal requirements for video evidence.
Offer a cloud-based service where content creators and marketers pay a monthly fee to transcribe and analyze YouTube videos at scale. Features include batch processing, API access, and integration with content management systems for automated workflows.
Provide a free basic version for individual educators to transcribe videos, with premium upgrades offering advanced features like multilingual support, higher accuracy models, and export to educational platforms. Monetize through tiered plans and institutional licenses.
License the transcription technology to media companies or video platforms for embedding into their services, such as automatic subtitle generation or content indexing. Charge based on API calls or through white-label partnerships.
💬 Integration Tip
Ensure ffmpeg and yt-dlp are installed for full functionality, and consider setting environment variables like YT_WHISPER_BACKEND to optimize performance based on hardware.
Scored Apr 19, 2026
Local speech-to-text with the Whisper CLI (no API key).
ElevenLabs text-to-speech with mac-style say UX.
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
Text-to-speech conversion using node-edge-tts npm package for generating audio from text. Supports multiple voices, languages, speed adjustment, pitch control, and subtitle generation. Use when: (1) User requests audio/voice output with the "tts" trigger or keyword. (2) Content needs to be spoken rather than read (multitasking, accessibility, driving, cooking). (3) User wants a specific voice, speed, pitch, or format for TTS output.
Start voice calls via the OpenClaw voice-call plugin.
Local text-to-speech via sherpa-onnx (offline, no cloud)