🎤 Speech & Audio

speech-recognitionv1.0.1

Name: speech-recognition
Author: demo112

speech-recognition

demo112

通用语音识别 Skill。支持多种音频格式（ogg/mp3/wav/m4a），使用硅基流动 SenseVoice API 进行语音转文字。当用户发送语音消息、音频文件，或需要转录音频时触发。

stttranscriptiontts

Download Package View on ClawHub

Installs (all time)

Installs (current)

Downloads

1.3K

Stars

CreatedFeb 25, 2026

UpdatedFeb 25, 2026

Install & Quick Start

Install via ClawdBot CLI:

clawdbot install demo112/speech-recognition

Skill Package2 files

📋SKILL.mdmarkdown

Failed to load file.

Quality Score

B63/100

Grade Fair — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.

Market Validation16/35

· 12 installs (average)
· 1256 downloads (moderate demand)
· 2 stars

Documentation14/25

· SKILL.md present
· Moderate documentation (≥1500 chars)
· Detailed summary

Package Completeness8/15

· skillAssets present (1 files)

Security Analysis

💙 Low Risk

UNKNOWN_DATA_SINKhigh

Sends data to undocumented external endpoint (potential exfiltration)

POST → https://api.siliconflow.cn/v1/audio/transcriptions

UNDOCUMENTED_EXTERNALlow

Calls external URL not in known-safe list

https://api.siliconflow.cn/v1/audio/transcriptions

AI Analysis

The skill's external API call to siliconflow.cn is explicitly documented for its stated purpose of speech recognition, with clear privacy disclosure that audio is uploaded to their servers. No hidden instructions, credential harvesting, or obfuscation are present, but the data transfer to a third-party service warrants user awareness.

Audited Apr 16, 2026 · audit v1.0

💡

Usage Guide

Generated Mar 20, 2026

Developers integrating audio processingBusinesses needing transcription servicesbeginner

💡 Application Scenarios

Customer Service Call TranscriptionCustomer Support

Transcribes customer service audio calls into text for analysis and record-keeping. Enables automated logging of inquiries and complaints, improving response tracking and compliance.

Meeting Minutes GenerationCorporate

Converts recorded meeting audio into written transcripts for documentation. Facilitates easy sharing of key points and action items among team members, enhancing productivity.

Educational Content AccessibilityEducation

Transcribes lecture recordings or educational podcasts into text to support students with hearing impairments or those who prefer reading. Aids in creating study materials and subtitles.

Media Production SubtitlingMedia and Entertainment

Generates text transcripts from audio in videos or podcasts for creating subtitles or closed captions. Helps content creators reach wider audiences and comply with accessibility standards.

Legal Deposition DocumentationLegal

Transcribes audio recordings of legal depositions or interviews into accurate text records. Supports legal professionals in case preparation and evidence organization.

💼 Business Models

API-as-a-ServiceUsage-based fees or monthly subscriptions

Offers the speech recognition API to developers on a pay-per-use or subscription basis. Charges based on audio duration or number of requests, providing scalable access for various applications.

Enterprise LicensingOne-time license fees or annual contracts

Sells customized licenses to businesses for integrating the skill into internal systems like CRM or collaboration tools. Includes support, customization, and volume discounts for large-scale deployments.

Freemium with Premium FeaturesUpgrades to premium tiers and add-on services

Provides basic transcription for free with limited usage, while charging for advanced features like higher accuracy, faster processing, or bulk file handling. Targets individual users and small teams.

💬 Integration Tip

Ensure API key is securely stored and handle audio format conversions using FFmpeg for compatibility.