🎤 Speech & Audio

Voice Clone TTSv1.0.0

Name: Voice Clone TTS
Author: oliviapp8

voice-clone-tts

声纹克隆和语音合成。上传音频样本克隆声纹，用克隆声纹或预设声纹生成语音。支持多个后端：MiniMax、ElevenLabs、Fish Audio、Azure TTS、OpenAI TTS。支持情绪控制、语速调整、批量生成。触发词：语音合成、TTS、声纹克隆、voice clone、text to speech、配...

latest

Download Package View on ClawHub

Installs (all time)

Installs (current)

Downloads

599

Stars

CreatedMar 22, 2026

UpdatedMay 1, 2026

Install & Quick Start

Install via ClawdBot CLI:

clawdbot install oliviapp8/voice-clone-tts

Skill Package2 files

📋SKILL.mdmarkdown

Failed to load file.

Quality Score

B52/100

Grade Fair — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.

Market Validation5/35

· 229 downloads (low demand)
· 1 installs (minimal)

Documentation16/25

· SKILL.md present
· Detailed documentation (≥3000 chars)
· Detailed summary

Package Completeness6/15

· skillAssets present (1 files)

Security Analysis

💙 Low Risk

UNKNOWN_DATA_SINKhigh

Sends data to undocumented external endpoint (potential exfiltration)

POST → https://api.minimax.chat/v1/text_to_speech

UNDOCUMENTED_EXTERNALlow

Calls external URL not in known-safe list

https://www.minimax.chat/

KNOWN_EXTERNALlow

Uses known external API (expected, informational)

api.openai.com

AI Analysis

The skill's external API calls (MiniMax, ElevenLabs, etc.) are consistent with its stated purpose of voice cloning and TTS. While it sends user audio/text data to third-party services, this is expected functionality, not unauthorized exfiltration. No hidden instructions, credential harvesting, or obfuscation were found in the provided definition.

💡

Usage Guide

Generated May 8, 2026

Content creators and influencers producing video or audio contentMarketing and advertising agencies automating voiceover productioneLearning and corporate training developers needing multilingual narrationintermediate

💡 Application Scenarios

Digital Human Video Production with Custom VoicesContent creation, marketing

Create video content with a consistent digital human avatar and a custom voice cloned from a short audio sample. The voice is synthesized per scene and synchronized with the avatar's mouth movements, enabling personalized spokesperson videos without reliance on platform-specific voice cloning.

Audiobook and Podcast NarrationPublishing, entertainment, education

Convert written content such as books, articles, or scripts into natural-sounding audio using cloned or preset voices. Supports batch generation with emotion and speed control for engaging listening experiences.

Multilingual Voiceover for Video and eLearningeLearning, corporate training, localization

Generate voiceovers in multiple languages by using backends like ElevenLabs or Azure TTS. Cloned voices can be used across languages, enabling consistent brand voice for global audiences.

Interactive Voice Applications (IVAs) with Custom TTSCustomer service, healthcare, finance

Integrate with chatbots or voice assistants to provide a unique, branded voice for responses. Clone a voice for personalized interaction or use preset voices for different personas.

Batch Dubbing of Video ScriptsMedia production, advertising

Automate the dubbing of video scenes by processing a script with scene-by-scene narration, emotions, and speeds. Produces a set of audio files ready for video editing or direct synchronization.

💼 Business Models

SaaS Subscription for Content CreatorsMonthly recurring fees from individual creators, with tiered pricing based on usage limits.

Offer a monthly subscription granting access to voice cloning, TTS synthesis, and batch generation with a limited number of characters or minutes. Premium tiers add advanced emotions, higher quality, and more backends.

Enterprise API Access for Media CompaniesPay-per-use or flat monthly fee based on API calls or processed audio minutes, with custom SLAs.

Provide API access for high-volume voice synthesis and cloning, integrating into existing production pipelines for dubbing studios, eLearning platforms, or video production houses.

White-Label Voice Cloning for Digital Human PlatformsLicensing fee per integration or revenue share from platform's voice feature usage.

License the voice cloning and TTS technology to digital human platforms that lack native voice cloning. The technology is integrated as a backend module, enabling the platform to offer custom voices to their users.

💬 Integration Tip

Automate the entire workflow by connecting the video-script-generator output to this skill for scenes and pipe the generated audio into digital-avatar or video-stitcher for seamless production.