⚠️Install with caution. This skill has very few installs. Always review the source and verify it on clawhub.ai before installing. Community-built skills run with agent permissions — only install ones you trust.

🎤 Speech & Audio

AudioPodv1.2.3

Name: AudioPod
Author: Rakesh1002

audiopod

Rakesh1002

Use AudioPod AI's API for audio processing tasks including AI music generation (text-to-music, text-to-rap, instrumentals, samples, vocals), stem separation, text-to-speech, noise reduction, speech-to-text transcription, speaker separation, and media extraction. Use when the user needs to generate music/songs/rap from text, split a song into stems/vocals/instruments, generate speech from text, clean up noisy audio, transcribe audio/video, or extract audio from YouTube/URLs. Requires AUDIOPOD_API_KEY env var or pass api_key directly.

stttranscriptionttsyoutube

Download Package View on ClawHub

Installs (all time)

Installs (current)

Downloads

3.6K

Stars

CreatedJan 31, 2026

UpdatedFeb 27, 2026

Install & Quick Start

Install via ClawdBot CLI:

clawdbot install Rakesh1002/audiopod

Skill Package3 files

📋SKILL.mdmarkdown

Failed to load file.

Quality Score

B61/100

Grade Fair — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.

Market Validation10/35

· 1 installs (minimal)
· 2829 downloads (high demand)
· 3 stars

Documentation20/25

· SKILL.md present
· Detailed documentation (≥3000 chars)
· Contains usage examples or trigger description
· Detailed summary

Package Completeness6/15

Security Analysis

💙 Low Risk

UNKNOWN_DATA_SINKhigh

Sends data to undocumented external endpoint (potential exfiltration)

POST → https://api.audiopod.ai/api/v1/music/text2music

UNDOCUMENTED_EXTERNALlow

Calls external URL not in known-safe list

https://audiopod.ai/auth/signup

AI Analysis

The skill's external API calls (audiopod.ai) are directly related to its stated purpose of audio processing and music generation, with no evidence of credential harvesting, hidden instructions, or obfuscation. The 'unknown data sink' signal is expected as it's the primary service endpoint, and user data transmission is authorized and necessary for functionality.

Audited Apr 17, 2026 · audit v1.0

💡

Usage Guide

Generated Mar 21, 2026

Musicians and ProducersContent Creators and MarketersDevelopers and Tech Teamsbeginner

💡 Application Scenarios

Music Production and Content CreationMedia and Entertainment

Independent musicians and content creators can generate custom background music, instrumentals, or full songs from text prompts for videos, podcasts, or social media. This reduces licensing costs and enables rapid prototyping of musical ideas without requiring extensive audio engineering skills.

Audio Post-Production and RemixingMusic Production

Audio engineers and producers can use stem separation to isolate vocals, drums, or other instruments from existing tracks for remixes, karaoke versions, or sample extraction. This facilitates creative reuse and enhances mixing workflows in music production studios.

Accessibility and Transcription ServicesEducation and Technology

Organizations can transcribe audio and video content into text using speech-to-text, making media accessible for hearing-impaired audiences or enabling searchable archives. This is useful for educational institutions, corporate training, and media companies.

Marketing and AdvertisingAdvertising

Marketing agencies can generate custom voiceovers, jingles, or sound effects from text prompts for commercials, presentations, or branded content. This allows for quick iteration and localization of audio assets without hiring voice actors or composers.

Podcast and Video EnhancementDigital Media

Podcasters and videographers can clean up noisy recordings with noise reduction, add AI-generated intros or outros, and extract audio from YouTube URLs for analysis or repurposing. This improves production quality and streamlines editing processes.

💼 Business Models

Pay-as-You-Go API ServiceUsage-based fees (e.g., per minute of audio generated or transcribed)

Charge users based on usage metrics like audio duration processed or number of API calls, with tiered pricing for different tasks such as music generation or transcription. This model appeals to developers and businesses needing scalable, on-demand audio processing without upfront commitments.

Subscription for CreatorsRecurring subscription fees with tiered plans (e.g., basic, pro, enterprise)

Offer monthly or annual subscriptions with bundled credits for music generation, stem separation, and other features, targeting individual creators, small studios, or freelancers. Include premium support and higher rate limits to encourage long-term engagement and predictable revenue.

White-Label Enterprise SolutionsAnnual licensing fees or custom enterprise contracts

License the API to larger companies for integration into their own platforms, such as video editing software, e-learning tools, or social media apps. Provide custom pricing, dedicated infrastructure, and co-branding options to serve B2B clients with high-volume needs.

💬 Integration Tip

Start by setting the AUDIOPOD_API_KEY environment variable and using the Python or Node.js SDK for quick prototyping; test with free credits before scaling.