youtube-transcript-apiExtract, transcribe, and translate YouTube video transcripts using the YouTubeTranscript.dev V2 API. Supports captions, ASR audio transcription, batch proces...
Install via ClawdBot CLI:
clawdbot install volodstaimi/youtube-transcript-apiGrade Limited — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Sends data to undocumented external endpoint (potential exfiltration)
POST → https://youtubetranscript.dev/api/v2/transcribeCalls external URL not in known-safe list
https://youtubetranscript.dev/api/v2`Audited Apr 17, 2026 · audit v1.0
Generated Mar 22, 2026
Bloggers and content creators can extract transcripts from YouTube videos to repurpose video content into articles, social media posts, or e-books. This enables efficient content recycling and SEO optimization by converting spoken content into searchable text.
Educational platforms and language learners can use this skill to translate YouTube video transcripts into multiple languages, aiding in language acquisition and accessibility. It supports over 100 languages, making it ideal for global content localization.
Developers building AI models or LLM applications can batch-process up to 100 videos to gather training data or real-time content analysis. This skill provides structured text output for feeding into machine learning pipelines or chatbots.
Organizations can generate captions and transcripts for YouTube videos to meet accessibility standards like ADA or WCAG. This includes using ASR for videos without existing captions, ensuring inclusivity for hearing-impaired audiences.
Business analysts can extract and analyze transcripts from competitor videos or industry webinars to gather insights on trends, keywords, and messaging. Batch processing allows efficient comparison across multiple video sources.
Offer a web-based platform where users pay a monthly fee to access transcript extraction, translation, and batch processing features. Monetize through tiered pricing based on credit usage, targeting content creators and businesses needing regular video analysis.
Resell the YouTube Transcript API as part of a larger toolkit or integrate it into custom software for clients. Charge per API call or offer enterprise licenses, catering to developers and agencies building video-centric applications.
Provide a free basic version with limited credits to attract users, then upsell premium features like advanced translation, higher batch limits, and priority support. Focus on individual creators and small teams to build a user base.
💬 Integration Tip
Start by obtaining a free API key from youtubetranscript.dev and test with a single video using the provided code examples to ensure connectivity before scaling to batch processing.
Scored Apr 19, 2026
Local speech-to-text with the Whisper CLI (no API key).
ElevenLabs text-to-speech with mac-style say UX.
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
Text-to-speech conversion using node-edge-tts npm package for generating audio from text. Supports multiple voices, languages, speed adjustment, pitch control, and subtitle generation. Use when: (1) User requests audio/voice output with the "tts" trigger or keyword. (2) Content needs to be spoken rather than read (multitasking, accessibility, driving, cooking). (3) User wants a specific voice, speed, pitch, or format for TTS output.
Local text-to-speech via sherpa-onnx (offline, no cloud)
Start voice calls via the OpenClaw voice-call plugin.