transcriptGet transcripts from any YouTube video ā for summarization, research, translation, quoting, or content analysis. Use when the user shares a video link or asks "what did they say", "get the transcript", "transcribe this video", "summarize this video", or wants to analyze spoken content.
Install via ClawdBot CLI:
clawdbot install therohitdas/transcriptFetch video transcripts via TranscriptAPI.com.
If $TRANSCRIPT_API_KEY is not set, help the user create an account (100 free credits, no card):
Step 1 ā Register: Ask user for their email.
node ./scripts/tapi-auth.js register --email USER_EMAIL
ā OTP sent to email. Ask user: _"Check your email for a 6-digit verification code."_
Step 2 ā Verify: Once user provides the OTP:
node ./scripts/tapi-auth.js verify --token TOKEN_FROM_STEP_1 --otp CODE
API key saved to ~/.openclaw/openclaw.json. See File Writes below for details. Existing file is backed up before modification.
Manual option: transcriptapi.com/signup ā Dashboard ā API Keys.
The verify and save-key commands save the API key to ~/.openclaw/openclaw.json (sets skills.entries.transcriptapi.apiKey and enabled: true). Existing file is backed up to ~/.openclaw/openclaw.json.bak before modification.
To use the API key in terminal/CLI outside the agent, add to your shell profile manually:
export TRANSCRIPT_API_KEY=
curl -s "https://transcriptapi.com/api/v2/youtube/transcript\
?video_url=VIDEO_URL&format=text&include_timestamp=true&send_metadata=true" \
-H "Authorization: Bearer $TRANSCRIPT_API_KEY"
| Param | Required | Default | Values |
| ------------------- | -------- | ------- | ------------------------------- |
| video_url | yes | ā | YouTube URL or 11-char video ID |
| format | no | json | json, text |
| include_timestamp | no | true | true, false |
| send_metadata | no | false | true, false |
Accepts: full URLs (youtube.com/watch?v=ID), short URLs (youtu.be/ID), shorts (youtube.com/shorts/ID), or bare video IDs.
Default: Always use format=text&include_timestamp=true&send_metadata=true unless user specifies otherwise.
Response (format=json):
{
"video_id": "dQw4w9WgXcQ",
"language": "en",
"transcript": [
{ "text": "We're no strangers to love", "start": 18.0, "duration": 3.5 },
{ "text": "You know the rules and so do I", "start": 21.5, "duration": 2.8 }
],
"metadata": {
"title": "Rick Astley - Never Gonna Give You Up",
"author_name": "Rick Astley",
"author_url": "https://www.youtube.com/@RickAstley",
"thumbnail_url": "https://i.ytimg.com/vi/dQw4w9WgXcQ/maxresdefault.jpg"
}
}
Response (format=text):
{
"video_id": "dQw4w9WgXcQ",
"language": "en",
"transcript": "[00:00:18] We're no strangers to love\n[00:00:21] You know the rules...",
"metadata": {...}
}
| Code | Meaning | Action |
| ---- | ------------- | ----------------------------------- |
| 401 | Bad API key | Check key or re-setup |
| 402 | No credits | Top up at transcriptapi.com/billing |
| 404 | No transcript | Video may not have captions enabled |
| 408 | Timeout | Retry once after 2s |
| 429 | Rate limited | Wait and retry |
format=json when you need precise timestamps for quoting specific moments.include_timestamp=false for clean text suitable for translation or analysis.Generated Mar 1, 2026
Researchers and students use the skill to transcribe educational or documentary YouTube videos for qualitative analysis, citation, or literature reviews. It enables extracting spoken content for coding themes, verifying quotes, and analyzing discourse without manual transcription.
Content creators and marketers transcribe videos to repurpose spoken content into blog posts, subtitles, or social media snippets. It aids in SEO optimization by generating text from video content and creating accessible materials for diverse audiences.
Language learners and translation agencies use the skill to obtain transcripts for practicing listening comprehension or translating video content. It supports creating multilingual subtitles and educational materials from YouTube videos efficiently.
Legal professionals and compliance officers transcribe video evidence, depositions, or public statements from YouTube for documentation and analysis. It ensures accurate records for case preparation, regulatory reviews, or archival purposes.
Journalists and fact-checkers use the skill to quickly transcribe news reports, interviews, or speeches from YouTube for verification and reporting. It enables extracting exact quotes and analyzing spoken information for accuracy and context.
Offer a free tier with limited credits to attract users, then charge for premium plans with higher quotas, faster processing, or advanced features like multilingual support. Revenue comes from subscription fees and pay-as-you-go credit purchases.
Provide custom API integrations for businesses in education, media, or legal sectors, offering dedicated support, SLA guarantees, and white-labeled solutions. Revenue is generated through licensing fees and enterprise contracts.
Partner with content platforms, transcription tools, or educational services to offer bundled solutions or referrals. Revenue comes from affiliate commissions, co-marketing deals, and revenue-sharing agreements based on user referrals.
š¬ Integration Tip
Ensure the TRANSCRIPT_API_KEY is securely stored in environment variables or configuration files, and handle errors like 402 for no credits by prompting users to top up their accounts.
Generate spectrograms and feature-panel visualizations from audio with the songsee CLI.
Best practices for Remotion - Video creation in React
Best practices for Remotion - Video creation in React
Long-form AI video production: the frontier of multi-agent coordination. CellCog orchestrates 6-7 foundation models to produce up to 4-minute videos from a single prompt ā scripted, filmed, voiced, lipsync'd, scored, and edited automatically. Create marketing videos, product demos, explainer videos, educational content, spokesperson videos, training materials, UGC content, news reports.
HeyGen AI video creation API. Use when: (1) Using Video Agent for one-shot prompt-to-video generation, (2) Generating AI avatar videos with /v2/video/generat...
Complete toolkit for programmatic video creation with Remotion + React. Covers animations, timing, rendering (CLI/Node.js/Lambda/Cloud Run), captions, 3D, charts, text effects, transitions, and media handling. Use when writing Remotion code, building video generation pipelines, or creating data-driven video templates.