⚠️Install with caution. This skill has very few installs. Always review the source and verify it on clawhub.ai before installing. Community-built skills run with agent permissions — only install ones you trust.

🎥 Video Processing Tools

Video Captionsv1.0.1

Name: Video Captions
Author: ivangdavila

video-captions

ivangdavila

Generate professional captions and subtitles with multi-engine transcription, word-level timing, styling presets, and burn-in.

latest

Download Package View on ClawHub

Installs (all time)

Installs (current)

Downloads

1.5K

Stars

CreatedFeb 19, 2026

UpdatedMay 17, 2026

Install & Quick Start

Install via ClawdBot CLI:

clawdbot install ivangdavila/video-captions

Requires:

ffmpegwhisper

https://clawic.com/skills/video-captions

Skill Package5 files

📋SKILL.mdmarkdown

Failed to load file.

Quality Score

B59/100

Grade Fair — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.

Market Validation8/35

· 1147 downloads (moderate demand)
· No tracked installs (may still have real users via manual install)
· 2 stars

Documentation20/25

· SKILL.md present
· Detailed documentation (≥3000 chars)
· Contains usage examples or trigger description
· Detailed summary

Package Completeness6/15

Security Analysis

💙 Low Risk

UNKNOWN_DATA_SINKhigh

Sends data to undocumented external endpoint (potential exfiltration)

POST → https://api.deepgram.com/v1/listen?model=nova-2

UNDOCUMENTED_EXTERNALlow

Calls external URL not in known-safe list

https://clawic.com/skills/video-captions

AI Analysis

The skill's primary function uses local engines (Whisper) by default, keeping data offline. The external API calls (Deepgram, AssemblyAI) are optional, declared in metadata, and require user-provided keys, indicating they are opt-in cloud alternatives rather than mandatory data exfiltration. The homepage URL is informational, not a data sink.

Audited Apr 16, 2026 · audit v1.0

💡

Usage Guide

Generated Feb 26, 2026

Video Content CreatorsProfessional Editors and Studiosintermediate

💡 Application Scenarios

YouTube Content CreationDigital Media and Entertainment

Creators need accurate, platform-compliant captions for videos to improve accessibility and SEO. This skill generates VTT or SRT files with professional timing standards, ready for upload to YouTube Studio, ensuring sync and character limits are met.

Social Media MarketingMarketing and Advertising

Marketers require burned-in, styled captions for TikTok and Instagram Reels to enhance engagement and accessibility. The skill provides word-level timestamps for animated effects and applies bold, centered styling via FFmpeg, optimized for mobile viewing.

Professional Video ProductionFilm and Television

Production studios need Netflix-compliant subtitles in TTML format for streaming platforms, adhering to strict timing and formatting rules. This skill uses high-accuracy engines like Whisper large-v3 and verifies line limits and gaps for quality assurance.

Podcast and Interview TranscriptionMedia and Journalism

Podcasters and journalists require multi-speaker transcription with diarization to label speakers and format dialogue. The skill enables local processing for privacy, outputting SDH-compliant captions with speaker IDs and non-speech descriptions.

💼 Business Models

Freemium SaaSSubscription-based, e.g., $10-50/month per user

Offer basic local transcription for free to attract users, with premium features like cloud engine integration, advanced styling, and batch processing via subscription plans. Revenue comes from monthly fees for high-volume or enterprise users.

B2B LicensingLicensing fees, e.g., $500-5000/year per client

License the skill to video editing software companies or production studios as an embedded tool, providing custom integrations and support. Revenue is generated through one-time licensing fees or annual contracts based on usage tiers.

API-as-a-ServicePay-per-use, e.g., $0.01-0.10 per minute of video

Deploy the skill as a cloud API for developers, charging per minute of video processed with options for different engines and formats. This model scales with usage and appeals to apps needing automated caption generation without local setup.

💬 Integration Tip

Integrate with existing video workflows by using command-line tools like FFmpeg and Whisper, ensuring compatibility across Linux and macOS; provide clear documentation for env vars and platform-specific setups.