assemblyai-transcribeTranscribe, diarise, translate, post-process, and structure audio/video with AssemblyAI. Use this skill when the user wants AssemblyAI specifically, needs hi...
Install via ClawdBot CLI:
clawdbot install tristanmanchester/assemblyai-transcribeGrade Good — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Calls external URL not in known-safe list
https://www.assemblyai.com/docsAudited Apr 16, 2026 · audit v1.0
Generated Mar 20, 2026
Transcribe international business meetings with multiple speakers, automatically detecting languages and providing speaker labels. Useful for global teams needing accurate records with speaker identification and language support.
Generate transcripts for podcasts with speaker diarization and sentiment analysis, enabling content creators to produce subtitles, extract topics, and analyze audience engagement through structured outputs.
Transcribe legal depositions with high accuracy, speaker role mapping, and entity extraction to identify key names and terms. Supports creating searchable records and summaries for case preparation.
Convert university lectures or online courses into transcripts with paragraph exports and subtitles, aiding accessibility and study materials. Language detection handles diverse student populations.
Analyze customer service calls by transcribing audio, extracting sentiments and entities, and using LLM Gateway for structured insights to improve service quality and identify common issues.
Offer a cloud-based platform where businesses pay monthly fees for AssemblyAI transcription, diarization, and analysis features. Revenue comes from tiered plans based on usage volume and advanced capabilities like translation.
Sell API access to developers and companies embedding speech-to-text into their applications, charging per minute of audio processed. Focus on easy integration with agent-friendly outputs for AI workflows.
Provide customized packages for industries like legal or media, including on-premise deployment, dedicated support, and compliance features. Revenue is generated through annual contracts and service fees.
💬 Integration Tip
Ensure Node.js 18+ is installed and set the ASSEMBLYAI_API_KEY environment variable; use the bundled CLI with --bundle-dir for agent-friendly outputs in workflows.
Scored Apr 19, 2026
Local speech-to-text with the Whisper CLI (no API key).
ElevenLabs text-to-speech with mac-style say UX.
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
Text-to-speech conversion using node-edge-tts npm package for generating audio from text. Supports multiple voices, languages, speed adjustment, pitch control, and subtitle generation. Use when: (1) User requests audio/voice output with the "tts" trigger or keyword. (2) Content needs to be spoken rather than read (multitasking, accessibility, driving, cooking). (3) User wants a specific voice, speed, pitch, or format for TTS output.
Local text-to-speech via sherpa-onnx (offline, no cloud)
Start voice calls via the OpenClaw voice-call plugin.