text2speechSenseAudio Text-to-Speech (TTS) API for converting text to natural speech. Supports synchronous and SSE streaming modes, multiple voices, emotion control, sp...
Install via ClawdBot CLI:
clawdbot install scikkk/text2speechGrade Fair — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Sends data to undocumented external endpoint (potential exfiltration)
POST → https://api.senseaudio.cn/v1/t2a_v2`Calls external URL not in known-safe list
https://senseaudio.cn/docs/text_to_speech_introductionAI Analysis
The skill's external API call (api.senseaudio.cn) is directly documented for its stated text-to-speech purpose, and authentication uses a dedicated, declared environment variable. No hidden instructions, credential harvesting, or obfuscation are present in the provided definition.
Audited Apr 18, 2026 · audit v1.0
Generated Mar 22, 2026
Automatically generate voiceovers for educational videos and online courses, supporting multiple languages and emotional tones to enhance engagement. Ideal for platforms needing scalable, natural-sounding audio for lessons and tutorials.
Convert written books or articles into audio format with customizable voices and streaming capabilities for efficient processing. Useful for publishers and content creators looking to expand into audio markets without extensive recording sessions.
Integrate TTS into interactive voice response systems to provide dynamic, natural-sounding prompts and information in multiple languages. Helps businesses improve customer experience with clear, adjustable speech for automated support.
Enable text-to-speech functionality in apps and devices to read out digital content like websites or documents, with speed and pitch controls for user comfort. Supports inclusive design for assistive technology developers.
Create promotional audio content for commercials, podcasts, or social media with emotion control and streaming for real-time adjustments. Allows marketers to produce high-quality, localized ads quickly and cost-effectively.
Offer the TTS API on a subscription or pay-per-use basis to developers and businesses, charging based on character usage or audio length. Generates recurring revenue from integrations in apps, websites, and enterprise systems.
License the TTS technology to other companies for embedding into their products, such as e-learning platforms or voice assistants, with custom branding and support. Provides upfront licensing fees and ongoing maintenance contracts.
Collaborate with media producers and publishers to generate audio content like audiobooks or podcasts, offering bulk processing and premium voice options. Revenue comes from project-based fees and royalties on distributed content.
💬 Integration Tip
Ensure the SENSEAUDIO_API_KEY is securely stored as an environment variable and test both streaming and non-streaming modes to optimize for latency and resource usage.
Scored Apr 19, 2026
Local speech-to-text with the Whisper CLI (no API key).
ElevenLabs text-to-speech with mac-style say UX.
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
Text-to-speech conversion using node-edge-tts npm package for generating audio from text. Supports multiple voices, languages, speed adjustment, pitch control, and subtitle generation. Use when: (1) User requests audio/voice output with the "tts" trigger or keyword. (2) Content needs to be spoken rather than read (multitasking, accessibility, driving, cooking). (3) User wants a specific voice, speed, pitch, or format for TTS output.
Local text-to-speech via sherpa-onnx (offline, no cloud)
Start voice calls via the OpenClaw voice-call plugin.