gemini-sttTranscribe audio files using Google's Gemini API or Vertex AI
Install via ClawdBot CLI:
clawdbot install araa47/gemini-sttGrade Good — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Calls external URL not in known-safe list
https://ai.google.dev/gemini-api/docs/modelsUses known external API (expected, informational)
googleapis.comAudited Apr 17, 2026 · audit v1.0
Generated Mar 1, 2026
Transcribe customer service calls from audio recordings for quality assurance and training. Enables analysis of customer interactions and agent performance. Useful for identifying common issues and improving service protocols.
Transcribe doctor-patient consultations or medical dictations into text for electronic health records. Helps streamline documentation, reduce administrative burden, and ensure accurate patient records. Supports compliance with healthcare regulations.
Convert recorded lectures or classroom discussions into text for accessibility and study materials. Assists students with disabilities and provides searchable notes for review. Enhances learning resources in online or hybrid education.
Transcribe court hearings, depositions, or client meetings for legal documentation and case preparation. Ensures accurate records for evidence and reduces manual transcription costs. Supports law firms in managing case files efficiently.
Generate transcripts for podcasts, videos, or live streams to create subtitles or closed captions. Improves accessibility for hearing-impaired audiences and enhances content reach. Useful for media producers and content creators.
Offer monthly or annual subscriptions for unlimited or tiered transcription usage. Target small businesses or individuals needing regular audio processing. Revenue generated from recurring fees based on usage limits or features.
Provide the skill as an API that charges per audio minute or file processed. Integrate with third-party platforms like CRM or project management tools. Revenue comes from transaction fees based on volume and processing time.
License the transcription technology to large organizations for internal use or resale. Customize branding and integrate with existing enterprise systems like healthcare or legal software. Revenue generated through upfront licensing fees and ongoing support contracts.
💬 Integration Tip
Use environment variables for API keys to simplify deployment and ensure security in production environments.
Scored Apr 19, 2026
Use CodexBar CLI local cost usage to summarize per-model usage for Codex or Claude, including the current (most recent) model or a full model breakdown. Trigger when asked for model-level usage/cost data from codexbar, or when you need a scriptable per-model summary from codexbar cost JSON.
Gemini CLI for one-shot Q&A, summaries, and generation.
Manages free AI models from OpenRouter for OpenClaw. Automatically ranks models by quality, configures fallbacks for rate-limit handling, and updates opencla...
Manages free AI models from OpenRouter for OpenClaw. Automatically ranks models by quality, configures fallbacks for rate-limit handling, and updates opencla...
Manages free AI models from OpenRouter for OpenClaw. Automatically ranks models by quality, configures fallbacks for rate-limit handling, and updates openclaw.json. Use when the user mentions free AI, OpenRouter, model switching, rate limits, or wants to reduce AI costs.
使用 MiniMax MCP 进行图像理解和分析。触发条件:(1) 用户要求分析图片、理解图像、描述图片内容 (2) 需要识别图片中的物体、文字、场景 (3) 使用 MiniMax 的 understand_image 功能