yandex-speechkit-sttРаспознавание речи через Yandex SpeechKit API для голосовых сообщений в Telegram. Используй когда пользователь отправляет голосовые сообщения и хочет, чтобы...
Install via ClawdBot CLI:
clawdbot install strydex/yandex-speechkit-sttGrade Fair — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Calls external URL not in known-safe list
https://iam.api.cloud.yandex.net/iam/v1/tokensAudited Apr 16, 2026 · audit v1.0
Generated Mar 20, 2026
Businesses can integrate this skill into their Telegram support bots to automatically transcribe voice messages from customers. This enables faster response times by converting spoken queries into text for agents or automated systems to process, improving efficiency in handling customer inquiries.
Content creators or media agencies can use this skill to transcribe voice notes sent via Telegram for podcasts, interviews, or meetings. It provides a quick way to generate text transcripts from audio, saving time on manual transcription and facilitating content repurposing.
Educational platforms can leverage this skill to transcribe student voice submissions in language learning apps on Telegram. It allows for automated feedback on pronunciation and fluency by converting spoken responses into text for analysis or grading by instructors.
Healthcare providers can use this skill in Telegram-based telemedicine services to transcribe voice messages from patients describing symptoms. This helps in creating accurate medical records and summaries, reducing documentation time for healthcare professionals.
Law firms or administrative offices can integrate this skill to transcribe voice memos sent via Telegram for case notes, meeting minutes, or client instructions. It streamlines the documentation process by converting audio into searchable text for legal records.
Offer this skill as part of a subscription-based service where users pay a monthly fee for a certain number of transcriptions. Target small businesses or individuals who need regular voice-to-text conversion via Telegram, with tiered pricing based on usage limits.
Monetize by providing this skill as an API that other developers can integrate into their applications, charging per transcription request. This model appeals to tech companies or startups building voice-enabled features, with revenue generated from API call fees.
License this skill as a white-label solution for large enterprises or organizations to embed into their internal communication tools on Telegram. Charge a one-time setup fee and ongoing support costs, targeting sectors like healthcare or customer service for customized deployments.
💬 Integration Tip
Ensure proper configuration of Yandex Cloud credentials and install required dependencies like ffmpeg and Python packages to handle audio format conversions seamlessly.
Scored Apr 19, 2026
Local speech-to-text with the Whisper CLI (no API key).
ElevenLabs text-to-speech with mac-style say UX.
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
Text-to-speech conversion using node-edge-tts npm package for generating audio from text. Supports multiple voices, languages, speed adjustment, pitch control, and subtitle generation. Use when: (1) User requests audio/voice output with the "tts" trigger or keyword. (2) Content needs to be spoken rather than read (multitasking, accessibility, driving, cooking). (3) User wants a specific voice, speed, pitch, or format for TTS output.
Local text-to-speech via sherpa-onnx (offline, no cloud)
Start voice calls via the OpenClaw voice-call plugin.