moodcastTransform any text into emotionally expressive audio with ambient soundscapes using ElevenLabs v3 audio tags and Sound Effects API
Install via ClawdBot CLI:
clawdbot install ashutosh887/moodcastTransform any text into emotionally expressive audio with ambient soundscapes. MoodCast analyzes your content, adds expressive delivery using ElevenLabs v3 audio tags, and layers matching ambient soundscapes.
Use MoodCast when the user wants to:
Trigger phrases: "read this dramatically", "make this sound good", "create audio for", "moodcast this", "read with emotion", "narrate this"
Slash command: /moodcast
Automatically analyzes text and inserts appropriate v3 audio tags:
[excited], [nervous], [angry], [sorrowful], [calm], [happy][whispers], [shouts], [rushed], [slows down][laughs], [sighs], [gasps], [giggles], [crying][pause], [breathes], [stammers], [hesitates]Creates matching background audio using Sound Effects API:
For conversations/scripts, assigns different voices to speakers with appropriate emotional delivery.
python3 {baseDir}/scripts/moodcast.py --text "Your text here"
python3 {baseDir}/scripts/moodcast.py --text "Your text here" --ambient "coffee shop background noise"
python3 {baseDir}/scripts/moodcast.py --text "Your text here" --output story.mp3
python3 {baseDir}/scripts/moodcast.py --text "Your text" --mood dramatic
python3 {baseDir}/scripts/moodcast.py --text "Your text" --mood calm
python3 {baseDir}/scripts/moodcast.py --text "Your text" --mood excited
python3 {baseDir}/scripts/moodcast.py --text "Your text" --mood scary
python3 {baseDir}/scripts/moodcast.py --list-voices
python3 {baseDir}/scripts/moodcast.py --text "Your text" --voice VOICE_ID --model eleven_v3 --output-format mp3_44100_128
The skill automatically detects and enhances:
| Text Pattern | Audio Tag Added |
|-------------|-----------------|
| "amazing", "incredible", "wow" | [excited] |
| "scared", "afraid", "terrified" | [nervous] |
| "angry", "furious", "hate" | [angry] |
| "sad", "sorry", "unfortunately" | [sorrowful] |
| "secret", "quiet", "between us" | [whispers] |
| "!" exclamations | [excited] |
| "..." trailing off | [pause] |
| "haha", "lol" | [laughs] |
| Questions | Natural rising intonation |
Input:
Breaking news! Scientists have discovered something incredible.
This could change everything we know about the universe...
I can't believe it.
Enhanced Output:
[excited] Breaking news! Scientists have discovered something incredible.
[pause] This could change everything we know about the universe...
[gasps] [whispers] I can't believe it.
Input:
It was a dark night. The old house creaked.
Something moved in the shadows...
"Who's there?" she whispered.
Enhanced Output:
[slows down] It was a dark night. [pause] The old house creaked.
[nervous] Something moved in the shadows...
[whispers] "Who's there?" she whispered.
ELEVENLABS_API_KEY (required) - Your ElevenLabs API keyMOODCAST_DEFAULT_VOICE (optional) - Default voice ID (defaults to CwhRBWXzGAHq8TQ4Fs17)MOODCAST_MODEL (optional) - Default model ID (defaults to eleven_v3)MOODCAST_OUTPUT_FORMAT (optional) - Default output format (defaults to mp3_44100_128)MOODCAST_AUTO_AMBIENT (optional) - Set to "true" for automatic ambient sounds when using --moodConfiguration Priority: CLI arguments override environment variables, which override hardcoded defaults.
[whispers] not [WHISPERS]Built by ashutosh887
Using ElevenLabs Text-to-Speech v3 + Sound Effects API
Created for #ClawdEleven Hackathon
Generated Mar 1, 2026
Media companies can use MoodCast to produce engaging audio versions of articles and news stories, adding emotional narration and ambient soundscapes to increase listener retention and shareability. This is ideal for turning written content into podcasts or social media clips.
Educational platforms can integrate MoodCast to narrate learning materials with expressive voices and background sounds, making lessons more immersive and accessible for students. It enhances engagement in online courses and interactive textbooks.
Game developers and storytellers can use MoodCast to generate dynamic voiceovers for characters and narratives, with emotional tags and ambient effects to create atmospheric audio experiences. This supports indie games and interactive fiction projects.
Marketing agencies can leverage MoodCast to produce compelling audio ads and promotional content, using emotional delivery and soundscapes to capture audience attention and convey brand messages effectively. It's suitable for radio spots and digital campaigns.
Organizations can employ MoodCast to convert text into expressive audio for visually impaired users, offering natural-sounding narration with emotional context to improve comprehension and enjoyment of written materials.
Offer MoodCast as a cloud API with tiered subscription plans based on usage credits, targeting developers and businesses needing scalable audio generation. Revenue comes from monthly fees and overage charges for high-volume users.
Provide a free version with limited features and credits, encouraging users to upgrade to premium plans for advanced capabilities like custom voices and unlimited ambient sounds. This model attracts individual creators and small teams.
License MoodCast technology to large companies for integration into their own platforms, such as e-learning systems or media apps, with custom branding and support. Revenue is generated through licensing fees and ongoing maintenance contracts.
đź’¬ Integration Tip
Ensure the ELEVENLABS_API_KEY is securely stored and use CLI arguments for quick testing before embedding into larger applications.
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
Local speech-to-text with the Whisper CLI (no API key).
ElevenLabs text-to-speech with mac-style say UX.
Text-to-speech conversion using node-edge-tts npm package for generating audio from text. Supports multiple voices, languages, speed adjustment, pitch control, and subtitle generation. Use when: (1) User requests audio/voice output with the "tts" trigger or keyword. (2) Content needs to be spoken rather than read (multitasking, accessibility, driving, cooking). (3) User wants a specific voice, speed, pitch, or format for TTS output.
End-to-end encrypted agent-to-agent private messaging via Moltbook dead drops. Use when agents need to communicate privately, exchange secrets, or coordinate without human visibility.
Text-to-speech via OpenAI Audio Speech API.