assemblyai-transcribeTranscribe, diarise, translate, post-process, and structure audio/video with AssemblyAI. Use this skill when the user wants AssemblyAI specifically, needs hi...
Install via ClawdBot CLI:
clawdbot install tristanmanchester/assemblyai-transcribeGrade Good — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Generated Mar 20, 2026
Transcribe international business meetings with multiple speakers, automatically detecting languages and providing speaker labels. Useful for global teams needing accurate records with speaker identification and language support.
Generate transcripts for podcasts with speaker diarization and sentiment analysis, enabling content creators to produce subtitles, extract topics, and analyze audience engagement through structured outputs.
Transcribe legal depositions with high accuracy, speaker role mapping, and entity extraction to identify key names and terms. Supports creating searchable records and summaries for case preparation.
Convert university lectures or online courses into transcripts with paragraph exports and subtitles, aiding accessibility and study materials. Language detection handles diverse student populations.
Analyze customer service calls by transcribing audio, extracting sentiments and entities, and using LLM Gateway for structured insights to improve service quality and identify common issues.
Offer a cloud-based platform where businesses pay monthly fees for AssemblyAI transcription, diarization, and analysis features. Revenue comes from tiered plans based on usage volume and advanced capabilities like translation.
Sell API access to developers and companies embedding speech-to-text into their applications, charging per minute of audio processed. Focus on easy integration with agent-friendly outputs for AI workflows.
Provide customized packages for industries like legal or media, including on-premise deployment, dedicated support, and compliance features. Revenue is generated through annual contracts and service fees.
💬 Integration Tip
Ensure Node.js 18+ is installed and set the ASSEMBLYAI_API_KEY environment variable; use the bundled CLI with --bundle-dir for agent-friendly outputs in workflows.
Scored Apr 15, 2026
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
Local speech-to-text with the Whisper CLI (no API key).
ElevenLabs text-to-speech with mac-style say UX.
Local text-to-speech via sherpa-onnx (offline, no cloud)
Speak responses aloud on macOS using the built-in `say` command when user input indicates Voice Wake/voice recognition (for example, messages starting with "User talked via voice recognition on <device>").
Transcribe audio files to text using local Whisper (Docker). Use when receiving voice messages, audio files (.mp3, .m4a, .ogg, .wav, .webm), or when asked to transcribe audio content.