alicloud-ai-audio-cosyvoice-voice-cloneUse when creating cloned voices with Alibaba Cloud Model Studio CosyVoice customization models, especially cosyvoice-v3.5-plus or cosyvoice-v3.5-flash, from...
Install via ClawdBot CLI:
clawdbot install cinience/alicloud-ai-audio-cosyvoice-voice-cloneGrade Fair — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Sends data to undocumented external endpoint (potential exfiltration)
POST → https://dashscope.aliyuncs.com/api/v1/services/audio/tts/customization`Calls external URL not in known-safe list
https://dashscope.aliyuncs.com/api/v1/services/audio/tts/customization`AI Analysis
The skill interacts with documented Alibaba Cloud AI services (DashScope) for its stated purpose of voice cloning, which is consistent with legitimate functionality. While it sends audio data to external servers, these are official Alibaba Cloud endpoints, not unauthorized or hidden destinations. No credential harvesting, hidden instructions, or obfuscation were found in the provided definition.
Audited Apr 16, 2026 · audit v1.0
Generated Mar 21, 2026
Audiobook platforms can clone a user's favorite narrator's voice from a public sample to generate custom narrations for new titles. This enhances user engagement by offering familiar voice styles across different books, leveraging the voice_id for consistent TTS synthesis.
Companies can create branded voice assistants by cloning a spokesperson's voice from promotional audio, enabling personalized customer interactions via chatbots or IVR systems. The voice_id ensures uniform voice output across all digital touchpoints.
E-learning platforms use voice cloning to adapt course materials into regional dialects or languages by enrolling reference audio from local speakers. This supports multilingual education with authentic accents, using the target_model for synthesis in specific deployments.
Game developers clone voices from actors or public samples to generate dynamic dialogue for non-player characters, reducing recording costs. The voice_id allows real-time voice synthesis during gameplay, compatible with cosyvoice models for varied audio effects.
Healthcare providers assist patients with voice disorders by cloning their pre-impairment voice from archived recordings, enabling communication through synthesized speech. This relies on the enrollment API to create a personalized voice_id for daily use.
Offer a SaaS platform where users pay monthly fees to clone and manage custom voices, with tiered plans based on enrollment quotas and synthesis usage. Revenue is generated from recurring subscriptions and overage charges for API calls.
License the voice cloning technology to large enterprises for internal use in customer service or training, with upfront fees and annual support contracts. Revenue comes from one-time licensing deals and ongoing maintenance services.
Integrate the skill into an API marketplace where developers pay per enrollment and TTS call, with volume discounts for high usage. Revenue is driven by transaction fees and partnerships with cloud providers like Alibaba Cloud.
💬 Integration Tip
Ensure the target_model matches between enrollment and synthesis calls to avoid failures, and use the domestic endpoint for cosyvoice-v3.5-plus in China mainland deployments.
Scored Apr 19, 2026
Local speech-to-text with the Whisper CLI (no API key).
ElevenLabs text-to-speech with mac-style say UX.
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
Text-to-speech conversion using node-edge-tts npm package for generating audio from text. Supports multiple voices, languages, speed adjustment, pitch control, and subtitle generation. Use when: (1) User requests audio/voice output with the "tts" trigger or keyword. (2) Content needs to be spoken rather than read (multitasking, accessibility, driving, cooking). (3) User wants a specific voice, speed, pitch, or format for TTS output.
Local text-to-speech via sherpa-onnx (offline, no cloud)
Start voice calls via the OpenClaw voice-call plugin.