qwen-asrTranscribe audio files using Qwen ASR (千问STT). Use when the user sends voice messages and wants them converted to text.
Install via ClawdBot CLI:
clawdbot install al-one/qwen-asrGrade Fair — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Calls external URL not in known-safe list
https://github.com/aahl/qwen-asr2apiAudited Apr 16, 2026 · audit v1.0
Generated Mar 21, 2026
Users send voice messages in messaging apps and need them converted to text for accessibility or documentation. This skill can be integrated into chat platforms to automatically transcribe audio, aiding users with hearing impairments or those in noisy environments.
Professionals record meetings or interviews and require accurate transcripts for note-taking, legal compliance, or content creation. This skill processes audio files to generate text summaries, saving time and improving productivity in business settings.
Educators and students use audio lectures or podcasts that need transcription for study materials or accessibility purposes. This skill helps convert spoken content into text, supporting learning for individuals with different needs and enhancing educational resources.
Content creators produce videos or podcasts and require transcripts for subtitles, SEO optimization, or audience engagement. This skill automates the transcription process, reducing manual effort and speeding up content delivery in media and entertainment.
Companies record customer support calls for quality assurance and training. This skill transcribes these recordings into text for analysis, helping identify common issues, improve service, and maintain compliance records in customer service operations.
Offer basic transcription services for free with limited features, such as file size or processing speed. Charge for premium plans that include higher accuracy, faster processing, and advanced analytics, targeting small businesses and individual users.
License the transcription technology as an API to other developers and companies for integration into their applications. Generate revenue through usage-based pricing or flat-rate licensing fees, appealing to tech startups and enterprise clients.
Provide customized transcription solutions for large organizations in sectors like legal, healthcare, or media. Offer additional features such as data security, compliance support, and integration with existing workflows, with revenue from long-term contracts.
💬 Integration Tip
Ensure all dependencies like uv and aiohttp are installed via pip or brew, and test with sample audio files to verify functionality before deployment.
Scored Apr 23, 2026
Local speech-to-text with the Whisper CLI (no API key).
ElevenLabs text-to-speech with mac-style say UX.
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
Text-to-speech conversion using node-edge-tts npm package for generating audio from text. Supports multiple voices, languages, speed adjustment, pitch control, and subtitle generation. Use when: (1) User requests audio/voice output with the "tts" trigger or keyword. (2) Content needs to be spoken rather than read (multitasking, accessibility, driving, cooking). (3) User wants a specific voice, speed, pitch, or format for TTS output.
Local text-to-speech via sherpa-onnx (offline, no cloud)
Start voice calls via the OpenClaw voice-call plugin.