🎤 Speech & Audio

MLX Audio Serverv0.2.2

Name: MLX Audio Server
Author: guoqiao

mlx-audio-server

Local 24x7 OpenAI-compatible API server for STT/TTS, powered by MLX on your Mac.

Apple SiliconMac miniMacBookapiasraudiocompatibleglmglm-asrglm-asr-nano-2512glm-asr-nano-2512-8bitlatestlocalmacOSmlxmlx-audiomlx-audio-serveropenaiopenai-compatibleserverspeech-to-textstttext-to-speechtranscriptiontts

Download Package View on ClawHub

Installs (all time)

Installs (current)

Downloads

3.1K

Stars

CreatedFeb 4, 2026

UpdatedMay 17, 2026

Install & Quick Start

Install via ClawdBot CLI:

clawdbot install guoqiao/mlx-audio-server

Skill Package5 files

📋SKILL.mdmarkdown

Failed to load file.

Quality Score

A67/100

Grade Good — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.

Market Validation14/35

· 8 installs (low)
· 2221 downloads (high demand)

Documentation17/25

· SKILL.md present
· Moderate documentation (≥1500 chars)
· Contains usage examples or trigger description

Package Completeness11/15

· skillAssets present (4 files)
· Includes README/AGENTS doc

Security Analysis

💙 Low Risk

UNDOCUMENTED_EXTERNALlow

Calls external URL not in known-safe list

https://github.com/guoqiao/skills/blob/main/mlx-audio-server/mlx-audio-server/SK

Audited Apr 17, 2026 · audit v1.0

💡

Usage Guide

Generated Mar 1, 2026

Developers and AI enthusiastsContent creators and media professionalsEducational institutions and researchersbeginner

💡 Application Scenarios

Local Podcast Transcription for Content CreatorsMedia and Entertainment

Podcasters and video creators can use this skill to transcribe audio or video files locally on their Mac without relying on cloud services, ensuring privacy and reducing costs. It's ideal for generating subtitles, show notes, or repurposing content into text-based formats like blog posts.

Accessibility Tool for Educational InstitutionsEducation

Schools and universities can deploy this on Mac mini servers to provide speech-to-text and text-to-speech services for students with disabilities, such as converting lecture recordings to text or creating audio versions of study materials. It offers a low-cost, on-premise solution that complies with data privacy regulations.

Voice-Enabled Prototyping for DevelopersSoftware Development

Developers building AI or voice applications can use this skill as a local, OpenAI-compatible API server to test speech recognition and synthesis features without internet dependency. It accelerates prototyping for apps like voice assistants, transcription tools, or interactive media on Apple Silicon devices.

Internal Meeting Transcription for Small BusinessesBusiness Services

Small teams can run this skill on a shared MacBook to transcribe internal meetings or customer calls locally, keeping sensitive discussions secure and avoiding subscription fees. The output can be used for minutes, action items, or training documentation.

💼 Business Models

Freemium Local API ServiceSubscription fees and one-time licenses

Offer a free version with basic STT/TTS models and charge for premium features like advanced models, higher accuracy, or commercial licensing. This targets developers and small businesses looking for cost-effective, privacy-focused alternatives to cloud APIs.

Bundled Hardware SolutionHardware sales and service fees

Partner with Apple resellers to pre-install this skill on Mac mini or MacBook devices sold as dedicated transcription or accessibility workstations. This provides an out-of-the-box solution for industries like education or healthcare, with support and maintenance contracts.

Custom Integration for EnterprisesProject-based fees and ongoing support

Provide consulting and integration services to large organizations needing tailored STT/TTS solutions, such as integrating with existing workflows or training custom models. This leverages the local, secure nature of the skill for compliance-heavy sectors like finance or legal.

💬 Integration Tip

Ensure ffmpeg and jq are installed via brew for audio processing, and use the provided scripts as examples to integrate STT/TTS into custom applications via the local API server.