audio-quality-checkAnalyze audio recording quality - echo detection, loudness, speech intelligibility, SNR, spectral analysis. Use when the user wants to check a recording's qu...
Install via ClawdBot CLI:
clawdbot install tenequm/audio-quality-checkGrade Fair — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.
Generated May 5, 2026
A team leader wants to check if their conference call recordings have echo or poor speech intelligibility. The skill analyzes dual-track M4A files, detecting echo, measuring SNR, and providing PESQ/STOI scores to diagnose issues with audio processing.
A user is testing an audio processing app like Krisp and wants to see if it introduces echo or degrades speech quality. The skill computes PESQ and STOI between original and processed tracks, and runs autocorrelation to detect duplication.
A podcaster notices their recordings have a 'hollow' or 'echoey' sound. Using the skill, they analyze loudness, spectral characteristics, and run echo detection to identify if the room acoustics or a virtual audio cable creates delayed copies.
A quality assurance manager wants to ensure call recordings meet speech quality standards. The skill provides EBU R128 loudness, SNR, and STOI metrics to flag recordings with poor intelligibility or excessive background noise.
A security analyst suspects a recording has duplicated audio segments (e.g., from a faulty DVR). The skill's autocorrelation analysis detects systematic peaks at consistent lags, indicating duplication.
Provide a cloud service where users upload recordings and receive a detailed quality report. Includes recurring subscription tiers based on number of recordings and advanced metrics like PESQ.
Sell the skill as a package to enterprises for integration into their internal audio processing pipelines. Includes source code and support, with a perpetual license fee.
Offer expert analysis services for complex audio quality issues, including deep troubleshooting and optimization recommendations. Bill per project or hourly.
💬 Integration Tip
Wrap the skill as a REST API endpoint that accepts an audio file path and returns JSON metrics; integrate with CI/CD pipelines for automated testing of processed audio.
Scored May 5, 2026
Local speech-to-text with the Whisper CLI (no API key).
ElevenLabs text-to-speech with mac-style say UX.
Transcribe audio via OpenAI Audio Transcriptions API (Whisper).
Text-to-speech conversion using node-edge-tts npm package for generating audio from text. Supports multiple voices, languages, speed adjustment, pitch control, and subtitle generation. Use when: (1) User requests audio/voice output with the "tts" trigger or keyword. (2) Content needs to be spoken rather than read (multitasking, accessibility, driving, cooking). (3) User wants a specific voice, speed, pitch, or format for TTS output.
Local text-to-speech via sherpa-onnx (offline, no cloud)
Start voice calls via the OpenClaw voice-call plugin.