video-captionsGenerate professional captions and subtitles with multi-engine transcription, word-level timing, styling presets, and burn-in.
Install via ClawdBot CLI:
clawdbot install ivangdavila/video-captionsRequires:
Grade Fair — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Sends data to undocumented external endpoint (potential exfiltration)
POST → https://api.deepgram.com/v1/listen?model=nova-2Calls external URL not in known-safe list
https://clawic.com/skills/video-captionsAI Analysis
The skill's primary function uses local engines (Whisper) by default, keeping data offline. The external API calls (Deepgram, AssemblyAI) are optional, declared in metadata, and require user-provided keys, indicating they are opt-in cloud alternatives rather than mandatory data exfiltration. The homepage URL is informational, not a data sink.
Audited Apr 16, 2026 · audit v1.0
Generated Feb 26, 2026
Creators need accurate, platform-compliant captions for videos to improve accessibility and SEO. This skill generates VTT or SRT files with professional timing standards, ready for upload to YouTube Studio, ensuring sync and character limits are met.
Marketers require burned-in, styled captions for TikTok and Instagram Reels to enhance engagement and accessibility. The skill provides word-level timestamps for animated effects and applies bold, centered styling via FFmpeg, optimized for mobile viewing.
Production studios need Netflix-compliant subtitles in TTML format for streaming platforms, adhering to strict timing and formatting rules. This skill uses high-accuracy engines like Whisper large-v3 and verifies line limits and gaps for quality assurance.
Podcasters and journalists require multi-speaker transcription with diarization to label speakers and format dialogue. The skill enables local processing for privacy, outputting SDH-compliant captions with speaker IDs and non-speech descriptions.
Offer basic local transcription for free to attract users, with premium features like cloud engine integration, advanced styling, and batch processing via subscription plans. Revenue comes from monthly fees for high-volume or enterprise users.
License the skill to video editing software companies or production studios as an embedded tool, providing custom integrations and support. Revenue is generated through one-time licensing fees or annual contracts based on usage tiers.
Deploy the skill as a cloud API for developers, charging per minute of video processed with options for different engines and formats. This model scales with usage and appeals to apps needing automated caption generation without local setup.
💬 Integration Tip
Integrate with existing video workflows by using command-line tools like FFmpeg and Whisper, ensuring compatibility across Linux and macOS; provide clear documentation for env vars and platform-specific setups.
Scored Apr 18, 2026
Extract frames or short clips from videos using ffmpeg.
Download videos, audio, subtitles, and clean paragraph-style transcripts from YouTube and any other yt-dlp supported site. Use when asked to “download this video”, “save this clip”, “rip audio”, “get subtitles”, “get transcript”, or to troubleshoot yt-dlp/ffmpeg and formats/playlists.
Generate SRT subtitles from video/audio with translation support. Transcribes Hebrew (ivrit.ai) and English (whisper), translates between languages, burns subtitles into video. Use for creating captions, transcripts, or hardcoded subtitles for WhatsApp/social media.
AI视频脚本生成器。根据用户输入的主题/关键词,生成完整的视频脚本,包含分镜描述、画面提示词、配音文案。适用于短视频创作者、AI视频制作者、内容营销人员。触发词:视频脚本、分镜、AI视频、短视频文案、视频策划。
Generate document, outline, and image-text AI notes by providing a video URL, using Baidu's video analysis and note extraction APIs.
Process video and audio using FFmpeg CLI for transcoding, cutting, merging, audio extraction, thumbnails, GIFs, speed, filters, subtitles, and watermarks.