ffmpeg-masterUse when performing video/audio processing tasks including transcoding, filtering, streaming, metadata manipulation, or complex filtergraph operations with FFmpeg.
Install via ClawdBot CLI:
clawdbot install liudu2326526/ffmpeg-masterComprehensive guide for professional video and audio manipulation using FFmpeg and FFprobe.
FFmpeg is the leading multimedia framework, able to decode, encode, transcode, mux, demux, stream, filter and play almost anything that humans and machines have created. It is a command-line tool that processes streams through a complex pipeline of demuxers, decoders, filters, encoders, and muxers.
# Basic Transcoding (MP4 to MKV)
ffmpeg -i input.mp4 output.mkv
# Change Video Codec (to H.265/HEVC)
ffmpeg -i input.mp4 -c:v libx265 -crf 28 -c:a copy output.mp4
# Extract Audio (No Video)
ffmpeg -i input.mp4 -vn -c:a libmp3lame -q:a 2 output.mp3
# Resize/Scale Video
ffmpeg -i input.mp4 -vf "scale=1280:720" output.mp4
# Cut Video (Start at 10s, Duration 30s)
ffmpeg -i input.mp4 -ss 00:00:10 -t 00:00:30 -c copy output.mp4
# Fast Precise Cut (Re-encoding only the cut points is complex, so standard re-encoding is safer for precision)
ffmpeg -ss 00:00:10 -i input.mp4 -to 00:00:40 -c:v libx264 -crf 23 -c:a aac output.mp4
# Concatenate Files (using demuxer)
# Create filelist.txt: file 'part1.mp4' \n file 'part2.mp4'
ffmpeg -f concat -safe 0 -i filelist.txt -c copy output.mp4
# Speed Up/Slow Down Video (2x speed)
ffmpeg -i input.mp4 -filter_complex "[0:v]setpts=0.5*PTS[v];[0:a]atempo=2.0[a]" -map "[v]" -map "[a]" output.mp4
| Option | Use When |
|-----------|----------|
| -c:v libx264 | Standard H.264 encoding (best compatibility) |
| -c:v libx265 | H.265/HEVC encoding (best compression/quality) |
| -crf [0-51] | Constant Rate Factor (lower is higher quality, 18-28 recommended) |
| -preset | Encoding speed vs compression (ultrafast, medium, veryslow) |
| -c:a copy | Pass-through audio without re-encoding (saves time/quality) |
| Filter | Use When |
|-----------|----------|
| scale | Changing resolution (e.g., scale=1920:-1 for 1080p width) |
| crop | Removing edges (e.g., crop=w:h:x:y) |
| transpose | Rotating video (1=90deg CW, 2=90deg CCW) |
| fps | Changing frame rate (e.g., fps=30) |
| drawtext | Adding text overlays/watermarks |
| overlay | Picture-in-picture or adding image watermarks |
| fade | Adding fade-in/out effects (e.g., fade=in:0:30 for first 30 frames) |
| volume | Adjusting audio levels (e.g., volume=1.5 for 150% volume) |
| setpts | Changing video speed (e.g., setpts=0.5*PTS for double speed) |
| atempo | Changing audio speed without pitch shift (0.5 to 2.0) |
| Tool/Option | Use When |
|-----------|----------|
| ffprobe -v error -show_format -show_streams | Getting detailed technical info of a file |
| -metadata title="Name" | Setting global metadata tags |
| -map | Selecting specific streams (e.g., -map 0:v:0 -map 0:a:1) |
Use filter_complex when you need to process multiple inputs or create non-linear filter chains.
# Example: Adding a watermark at the bottom right
ffmpeg -i input.mp4 -i watermark.png -filter_complex "overlay=main_w-overlay_w-10:main_h-overlay_h-10" output.mp4
# Example: Vertical Stack (2 videos)
ffmpeg -i top.mp4 -i bottom.mp4 -filter_complex "vstack=inputs=2" output.mp4
# Example: Side-by-Side (2 videos)
ffmpeg -i left.mp4 -i right.mp4 -filter_complex "hstack=inputs=2" output.mp4
# Example: Grid (4 videos 2x2)
ffmpeg -i v1.mp4 -i v2.mp4 -i v3.mp4 -i v4.mp4 -filter_complex "[0:v][1:v]hstack=inputs=2[top];[2:v][3:v]hstack=inputs=2[bottom];[top][bottom]vstack=inputs=2" output.mp4
# Example: Fade Transition (Simple crossfade between two clips)
# Requires manual offset calculation, using xfade is better
ffmpeg -i input1.mp4 -i input2.mp4 -filter_complex "xfade=transition=fade:duration=1:offset=9" output.mp4
| Platform | Codec | Command |
|----------|-------|---------|
| NVIDIA (NVENC) | H.264 | -c:v h264_nvenc |
| Intel (QSV) | H.264 | -c:v h264_qsv |
| Apple (VideoToolbox) | H.265 | -c:v hevc_videotoolbox |
-map for complex files to ensure you get the right audio/subtitle tracks.-ss before -i for fast seeking (input seeking), or after -i for accurate seeking (output seeking).Generated Mar 1, 2026
Content creators and marketers use this skill to transcode, resize, and add watermarks or text overlays to videos for platforms like YouTube, Instagram, and TikTok. It enables efficient batch processing and format optimization to meet platform-specific requirements, enhancing engagement and reach.
Professionals in film and TV utilize this skill for complex editing tasks such as color grading, audio synchronization, and applying special effects using filtergraphs. It supports high-quality codecs like H.265 for compression and hardware acceleration to speed up rendering workflows.
Educational institutions and corporate trainers use this skill to convert, cut, and concatenate video lectures or tutorials into cohesive modules. It allows for adding subtitles, adjusting playback speed, and optimizing file sizes for online delivery across various devices.
Security firms and law enforcement agencies apply this skill to transcode and analyze large volumes of video footage from cameras. It enables tasks like extracting specific time segments, enhancing video quality, and converting formats for evidence storage and review.
Streaming services and event organizers use this skill to encode and stream video in real-time with hardware acceleration for low latency. It supports adaptive bitrate streaming, format conversion, and metadata manipulation to ensure smooth delivery across different networks.
Freelancers offer customized video processing services, such as transcoding, filtering, and watermarking, to clients like small businesses and content creators. They charge per project or hourly, leveraging this skill to deliver high-quality outputs efficiently and build a recurring client base.
A software-as-a-service company integrates this skill into a cloud-based platform that automates video tasks like resizing, format conversion, and metadata editing. Subscriptions are offered to media companies and enterprises, generating revenue through monthly or annual plans.
Consultants provide expertise in advanced FFmpeg usage, offering workshops and support to media production teams in industries like broadcasting and film. Revenue comes from training sessions, custom script development, and ongoing technical support contracts.
💬 Integration Tip
Integrate this skill by embedding FFmpeg commands into automation scripts or APIs, ensuring proper error handling for stream mapping and format compatibility to avoid processing failures.
Generate spectrograms and feature-panel visualizations from audio with the songsee CLI.
Best practices for Remotion - Video creation in React
Best practices for Remotion - Video creation in React
Long-form AI video production: the frontier of multi-agent coordination. CellCog orchestrates 6-7 foundation models to produce up to 4-minute videos from a single prompt — scripted, filmed, voiced, lipsync'd, scored, and edited automatically. Create marketing videos, product demos, explainer videos, educational content, spokesperson videos, training materials, UGC content, news reports.
HeyGen AI video creation API. Use when: (1) Using Video Agent for one-shot prompt-to-video generation, (2) Generating AI avatar videos with /v2/video/generat...
Complete toolkit for programmatic video creation with Remotion + React. Covers animations, timing, rendering (CLI/Node.js/Lambda/Cloud Run), captions, 3D, charts, text effects, transitions, and media handling. Use when writing Remotion code, building video generation pipelines, or creating data-driven video templates.