gemini-yt-video-transcriptCreate a verbatim transcript for a YouTube URL using Google Gemini (speaker labels, paragraph breaks; no time codes). Use when the user asks to transcribe a YouTube video or wants a clean transcript (no timestamps).
Install via ClawdBot CLI:
clawdbot install odrobnik/gemini-yt-video-transcriptGrade Fair — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Calls external URL not in known-safe list
https://github.com/odrobnik/gemini-yt-video-transcript-skillUses known external API (expected, informational)
googleapis.comAudited Apr 17, 2026 · audit v1.0
Generated Feb 24, 2026
Researchers and students can use this skill to transcribe educational YouTube videos, such as lectures or conference talks, for detailed analysis, note-taking, or citation purposes. It supports speaker labeling, making it ideal for multi-speaker content like panel discussions.
Content creators, editors, and marketers can generate clean transcripts for YouTube videos to create subtitles, blog posts, or social media snippets. The verbatim output without timestamps streamlines repurposing video content into written formats.
Legal professionals can transcribe YouTube videos containing testimonies, public statements, or training sessions for evidence gathering or compliance records. Speaker labels help identify individuals in multi-party recordings.
Organizations focused on accessibility can use this skill to provide transcripts for deaf or hard-of-hearing users, enhancing video accessibility on platforms like YouTube. It supports creating readable text versions without technical clutter.
Companies can transcribe internal training videos or external webinars hosted on YouTube for employee reference, archiving, or translation into other languages. The clean format facilitates integration into learning management systems.
Offer a free tier for basic transcript generation with limited videos per month, and premium plans for higher volume, faster processing, or API access. Revenue can come from subscriptions and enterprise licenses.
Provide the transcription functionality as an API that developers can integrate into their applications, such as content management systems or e-learning platforms. Charge based on usage tiers or per-transaction fees.
License the skill to marketing agencies, legal firms, or educational institutions as a white-label tool they can rebrand and offer to their clients. Revenue is generated through one-time licensing fees or ongoing support contracts.
💬 Integration Tip
Ensure the GEMINI_API_KEY is securely stored as an environment variable and that the workspace has Python3 installed for script execution.
Scored Apr 19, 2026
Extract frames or short clips from videos using ffmpeg.
Generate FFmpeg commands from natural language video editing requests - cut, trim, convert, compress, change aspect ratio, extract audio, and more.
AI视频脚本生成器。根据用户输入的主题/关键词,生成完整的视频脚本,包含分镜描述、画面提示词、配音文案。适用于短视频创作者、AI视频制作者、内容营销人员。触发词:视频脚本、分镜、AI视频、短视频文案、视频策划。
Download videos, audio, subtitles, and clean paragraph-style transcripts from YouTube and any other yt-dlp supported site. Use when asked to “download this video”, “save this clip”, “rip audio”, “get subtitles”, “get transcript”, or to troubleshoot yt-dlp/ffmpeg and formats/playlists.
Create product demo videos by automating browser interactions and capturing frames. Use when the user wants to record a demo, walkthrough, product showcase, or interactive video of a web application. Supports Playwright CDP screencast for high-quality capture and FFmpeg for video encoding.
Generate SRT subtitles from video/audio with translation support. Transcribes Hebrew (ivrit.ai) and English (whisper), translates between languages, burns subtitles into video. Use for creating captions, transcripts, or hardcoded subtitles for WhatsApp/social media.