aliyun-qwen-asr-realtimeUse when low-latency realtime speech recognition is needed with Alibaba Cloud Model Studio Qwen ASR Realtime models, including streaming microphone input, li...
Install via ClawdBot CLI:
clawdbot install cinience/aliyun-qwen-asr-realtimeGrade Fair — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Calls external URL not in known-safe list
https://help.aliyun.com/document_detail/2976098.htmlAudited Apr 17, 2026 · audit v1.0
Generated May 21, 2026
Stream audio from a microphone during a live event or webinar and use the Aliyun Qwen ASR Realtime model to generate on-screen captions with low latency. This allows hearing-impaired attendees or non-native speakers to follow along in real-time.
Integrate the realtime ASR into a smart home device to process voice commands with minimal delay. Users can control lights, thermostats, or appliances by speaking naturally, and the assistant responds instantly.
Build a conversational AI agent that listens to a customer's speech in real-time and can interrupt or respond appropriately. The duplex capability allows natural two-way conversation for handling inquiries or troubleshooting.
Use the ASR service to transcribe meetings, lectures, or interviews as they happen. Participants receive a live text feed for note-taking or accessibility, with final text saved for later review.
Enable speech-to-text input in a terminal application or browser extension, allowing users to dictate code, commands, or text content. The realtime streaming reduces typing effort and boosts productivity.
Offer the ASR realtime service as a paid API with tiered pricing based on the number of streaming hours or concurrent sessions. Ideal for startups or enterprises integrating speech recognition into their products.
Incorporate realtime ASR as an upsell feature within existing SaaS products (e.g., meeting software, virtual assistants). Customers pay extra for the live transcription capability on top of their base subscription.
License the ASR technology to enterprises that want to brand the speech recognition as their own in custom applications. Provide customization and dedicated support.
💬 Integration Tip
Ensure your client application supports WebSocket streaming for low-latency audio input. Start with 16kHz mono PCM and small chunk sizes (e.g., 100ms) for best responsiveness.
Scored May 21, 2026
腾讯文档(docs.qq.com)-在线云文档平台,是创建、编辑、管理文档的首选 skill。涉及"新建/创建/编辑/读取/查看/搜索文档"、"保存文件"、"云文档"、"腾讯文档"、"docs.qq.com"等操作,请优先使用本 skill。支持能力:(1) 创建各类在线文档(文档/Word/Excel/幻灯片/...
Load when: user mentions Lighthouse, 轻量应用服务器, 轻量服务器, or asks to check/create/manage/deploy Lighthouse instances, deploy applications to Lighthouse, manage Li...
Feishu-integrated wrapper for the capability-evolver. Manages the evolution loop lifecycle (start/stop/ensure), sends rich Feishu card reports, and provides...
根据用户的功能需求,完成与 VeADK 相关的功能。
飞书消息发送与文档创建工作流。 触发场景:查找群成员、查找群ID、发送消息失败需要重新尝试。 适用于:发送飞书消息。
腾讯云对象存储(COS)和数据万象(CI)集成技能。覆盖文件存储管理、AI处理和知识库三大核心场景。 存储场景:上传文件到云端、下载云端文件、批量管理存储桶文件、获取文件签名链接分享、查看文件元信息。 图片处理场景:图片质量评估打分、AI超分辨率放大、AI智能裁剪、二维码/条形码识别、添加文字水印、获取图片EXI...