ima-ai-music-song-voice-generatorAI music generator and voice generator with Suno sonic v5, DouBao BGM, and DouBao Song. Generate AI music, songs with lyrics, background music, soundtracks,...
Install via ClawdBot CLI:
clawdbot install dai-shuo/ima-ai-music-song-voice-generatorGrade Fair — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Sends data to undocumented external endpoint (potential exfiltration)
Send → https://api.imastudio.com`Potentially destructive shell commands in tool definitions
rm -rf ~Calls external URL not in known-safe list
https://imastudio.comAudited Apr 17, 2026 · audit v1.0
Generated May 23, 2026
Content creators can generate custom background music, soundtracks, or jingles for YouTube videos, social media clips, or presentations. Using Suno sonic v5 or DouBao BGM, they can create royalty-free music tailored to mood, genre, and tempo, enhancing viewer engagement without licensing issues.
Marketing teams can quickly produce catchy jingles for ads, brand anthems, or promotional audio. The skill supports custom lyrics, genre tags, and vocal styles, enabling rapid prototyping of audio branding assets for campaigns.
Indie game developers can generate ambient background loops, battle themes, or cinematic scores using DouBao BGM or Suno's instrumental mode. The ability to specify moods and tempos helps create immersive game audio without a composer.
While primarily for music, the skill can generate narration-style content with Suno's vocal capabilities (note: for pure TTS, use ima-tts-ai). Useful for audiobooks, explainer videos, or e-learning modules requiring custom audio.
Individuals can generate custom songs with lyrics, vocal gender, and style tags for events like weddings, birthdays, or personal projects. The simple prompt-based interface makes AI composition accessible to non-musicians.
Offer a limited number of free music generations per month with watermarked or lower-quality output, then charge per generation or via subscription for full-resolution, royalty-free tracks. Integrated as an API add-on for content platforms.
Provide the AI music generation as a white-label service for agencies, studios, or brands. They can rebrand the tool, set their pricing, and generate custom audio for clients, sharing revenue or licensing the technology.
Integrate the generator into social media apps or video editors where users create audio for free, with occasional ads or sponsored genres. Monetize through ad impressions and premium skip options.
💬 Integration Tip
Always read SKILL-DETAIL.md before the first generation call to avoid API parameter errors, and use the exact model_id from the model reference table.
Scored Jun 20, 2026
Local speech-to-text with the Whisper CLI (no API key).
ElevenLabs text-to-speech with mac-style say UX.
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
Any-to-any AI sub-agent — research, images, video, audio, music, podcasts, avatars, voice cloning, documents, spreadsheets, dashboards, 3D models, diagrams,...
Speak responses aloud on macOS using the built-in `say` command when user input indicates Voice Wake/voice recognition (for example, messages starting with "User talked via voice recognition on <device>").
High-quality voice synthesis with 18 personas, 32 languages, sound effects, batch processing, and voice design using ElevenLabs API.