⚠️Install with caution. This skill has very few installs. Always review the source and verify it on clawhub.ai before installing. Community-built skills run with agent permissions — only install ones you trust.

🎤 Speech & Audio

Audiov1.0.1

Name: Audio
Author: ivangdavila

audio

ivangdavila

Process, enhance, and convert audio files with noise removal, normalization, format conversion, transcription, and podcast workflows.

latest

Download Package View on ClawHub

Installs (all time)

Installs (current)

Downloads

3.1K

Stars

CreatedFeb 13, 2026

UpdatedMay 17, 2026

Install & Quick Start

Install via ClawdBot CLI:

clawdbot install ivangdavila/audio

Requires:

ffmpegffprobe

Skill Package5 files

📋SKILL.mdmarkdown

Failed to load file.

Quality Score

B52/100

Grade Fair — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.

Market Validation10/35

· 832 downloads (moderate demand)
· 2 installs (very low)
· 2 stars

Documentation14/25

· SKILL.md present
· Moderate documentation (≥1500 chars)
· Detailed summary

Package Completeness6/15

· skillAssets present (4 files)

Security Analysis

⚠️ Medium Risk

UNKNOWN_DATA_SINKhigh

Sends data to undocumented external endpoint (potential exfiltration)

POST → https://api.assemblyai.com/v2/transcript

UNDOCUMENTED_EXTERNALlow

Calls external URL not in known-safe list

https://api.assemblyai.com/v2/transcript

AI Analysis

The skill definition explicitly states it does not access cloud services without user knowledge, but the rule-based signals found indicate a POST request to an external transcription API (AssemblyAI). This creates a conflict between documented scope and actual behavior, posing a privacy risk if user audio is sent without clear consent.

Audited Apr 16, 2026 · audit v1.0

💡

Usage Guide

Generated Mar 20, 2026

Podcasters and audio producersContent creators and videographersDevelopers and tech enthusiastsintermediate

💡 Application Scenarios

Podcast ProductionMedia and Entertainment

Podcasters can use this skill to normalize audio levels to platform standards like Spotify's -16 LUFS, remove background noise, and convert files to formats like MP3 or AAC for distribution. It supports workflow steps from raw recording to final export, ensuring professional sound quality.

Video Content CreationDigital Media

Content creators can extract audio from videos, transcribe it into subtitles (SRT/VTT), and optimize audio quality for platforms like YouTube. This helps improve accessibility and engagement by providing clear, normalized audio and text transcripts.

Audio Archiving and ConversionEducation and Heritage

Libraries, museums, or individuals can convert legacy audio formats to modern ones like FLAC for lossless archiving or MP3 for sharing. The skill handles format conversion while preserving quality, making it useful for digitization projects.

Music Production and EditingMusic Industry

Musicians and producers can separate stems (e.g., vocals, drums) using Demucs, apply noise reduction, and normalize tracks for streaming platforms. This aids in remixing, mastering, and preparing music for distribution on services like Spotify.

Corporate Training and AccessibilityCorporate and Training

Businesses can transcribe training videos or meetings into text for documentation and compliance, while also enhancing audio clarity with noise removal. This supports accessibility initiatives and improves content usability for employees.

💼 Business Models

Freemium ServiceSubscription fees and tiered pricing

Offer basic audio processing (e.g., format conversion, noise removal) for free, with premium features like advanced transcription, stem separation, or batch processing available via subscription. This attracts a broad user base while monetizing power users.

B2B SaaS for Media CompaniesLicensing fees and annual contracts

License the skill as a white-label solution for podcast networks, video production studios, or streaming platforms to integrate audio processing into their workflows. Provide API access and custom integrations for automated processing.

Pay-per-Use MicroservicesUsage-based pricing and API call fees

Deploy the skill as a cloud-based API where users pay per task, such as per minute of transcription or per file processed. This model suits occasional users or developers needing on-demand audio enhancement without upfront costs.

💬 Integration Tip

Ensure ffmpeg and ffprobe are installed on the system; for advanced features like transcription, consider integrating Whisper API or local setup for better performance.