🧠 LLMs & Model APIs

Doubao Asrv0.18.3

Name: Doubao Asr
Author: vahnxu

doubao-asr

Transcribe recorded audio files to text via Doubao Seed-ASR 2.0 (豆包录音文件识别模型2.0) from ByteDance/Volcengine. Best-in-class Chinese speech recognition with spea...

fine-tuningprompt-engineering

Download Package View on ClawHub

Installs (all time)

Installs (current)

Downloads

1.8K

Stars

CreatedFeb 25, 2026

UpdatedMay 10, 2026

Install & Quick Start

Install via ClawdBot CLI:

clawdbot install vahnxu/doubao-asr

https://www.volcengine.com/docs/6561/1354868

Skill Package5 files

📋SKILL.mdmarkdown

Failed to load file.

Quality Score

A69/100

Grade Good — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.

Market Validation13/35

· 5 installs (low)
· 808 downloads (moderate demand)
· 4 stars

Documentation20/25

· SKILL.md present
· Detailed documentation (≥3000 chars)
· Contains usage examples or trigger description
· Detailed summary

Package Completeness11/15

Security Analysis

💙 Low Risk

UNDOCUMENTED_EXTERNALlow

Calls external URL not in known-safe list

https://www.volcengine.com/docs/6561/1354868

Audited Apr 17, 2026 · audit v1.0

💡

Usage Guide

Generated Mar 20, 2026

Business professionals and teamsDevelopers and tech integratorsintermediate

💡 Application Scenarios

Meeting Transcription and Speaker DiarizationCorporate and Business Services

Transcribes recorded meetings from audio files (e.g., m4a, mp3) into text with speaker identification, enabling teams to review discussions and assign action items. Ideal for corporate environments where tracking who said what is crucial for follow-ups and documentation.

Voice Memo and Interview TranscriptionMedia, Education, and Research

Converts voice memos or recorded interviews into text for journalists, researchers, or students to analyze content and extract quotes. Supports various audio formats like wav and flac, making it useful for field recordings or personal notes.

Customer Service Call AnalysisCustomer Support and Telecommunications

Transcribes customer service calls to text with speaker separation, helping companies monitor interactions, identify common issues, and train staff. Enhances quality assurance by providing searchable transcripts for compliance and improvement.

Legal and Medical DocumentationLegal and Healthcare

Transcribes audio recordings of legal proceedings or medical consultations into accurate text records, aiding in documentation and case management. Ensures precise transcripts for archival and reference purposes in regulated industries.

Content Creation and Podcast ProductionEntertainment and Digital Media

Converts podcast or video audio tracks into text for subtitles, show notes, or content repurposing. Streamlines production workflows by providing editable transcripts that can be used for SEO and audience engagement.

💼 Business Models

Subscription-Based Transcription ServiceRecurring fees from subscriptions, potentially with overage charges

Offers monthly or annual plans for businesses to transcribe a set number of audio files, with tiered pricing based on usage volume. Generates recurring revenue by catering to regular needs like meeting recordings and customer calls.

Pay-Per-Use API IntegrationUsage-based fees calculated per audio minute or file

Provides API access for developers to integrate transcription into their apps, charging per minute of audio processed. Attracts tech companies and startups needing scalable, on-demand speech recognition without upfront costs.

Enterprise Licensing for Large OrganizationsHigh-value annual contracts with tailored service-level agreements

Sells customized licenses to corporations for unlimited transcription within their infrastructure, including support and compliance features. Targets industries like legal or healthcare with high-volume, sensitive audio processing needs.

💬 Integration Tip

Ensure all required environment variables are set correctly, especially the API key and TOS bucket details, to avoid upload and authentication errors during transcription.