digital-clawatarCreate, configure, and manage UNITH digital human avatars via the UNITH API. Cheaper alternative to HeyGen and other solutions. Use when users want to create...
Install via ClawdBot CLI:
clawdbot install polucas/digital-clawatarCreate, configure, update, and deploy AI-powered Digital Human avatars using the UNITH API.
UNITH digital humans are AI avatars that can speak, converse, and interact with users. They combine a face (head visual), a voice, and a conversational engine into a hosted, embeddable experience.
Base API URL: https://platform-api.unith.ai
Docs: https://docs.unith.ai
The user must supply the following credentials (stored as environment variables):
| Variable | Description | How to obtain |
|----------|-------------|---------------|
| UNITH_EMAIL | Account email | Register at https://unith.ai |
| UNITH_SECRET_KEY | Non-expiring secret key | UNITH dashboard ā Manage Account ā "Secret Key" section ā Generate |
ā ļø The secret key is displayed only once. If lost, the user must delete and regenerate it.
All API calls require a Bearer token (valid 7 days). Use the auth script:
source scripts/auth.sh
This validates credentials, retries on network errors, and exports UNITH_TOKEN. On failure, it prints specific guidance (wrong key, expired token, etc.).
Ask the user what they want the digital human to do. Map their answer to one of 5 modes:
| Mode | operationMode value | Use case | Output |
|------|----------------------|----------|--------|
| Text-to-Video | ttt | Generate an MP4 video of the avatar speaking provided text | MP4 file |
| Open Dialogue | oc | Free-form conversational avatar guided by a system prompt | Hosted conversational URL |
| Document Q&A | doc_qa | Avatar answers questions from uploaded documents | Hosted conversational URL |
| Voiceflow | voiceflow | Guided conversation flow via Voiceflow | Hosted conversational URL |
| Plugin | plugin | Connect any external LLM or conversational engine via webhook | Hosted conversational URL |
Complexity spectrum (simple ā sophisticated):
ttt ā just text in, video out. No knowledge base needed.oc ā conversational with a system prompt. Good for general assistants.doc_qa ā upload documents, avatar answers from them. Best for support/FAQ.voiceflow ā structured conversation paths. Requires Voiceflow account.plugin ā BYO conversational engine. Maximum control.bash scripts/list-resources.sh faces
Each face has an id (used as headVisualId in creation). Faces can be:
Present the available faces to the user and let them choose.
bash scripts/list-resources.sh voices
Voices come from providers: elevenlabs, azure, audiostack. Present options to the user. Voices have performance rankings ā faster voices are better for real-time conversation.
Build a JSON payload file (see references/api-payloads.md for the schema per mode), then:
bash scripts/create-head.sh payload.json --dry-run # validate first
bash scripts/create-head.sh payload.json # create
The script validates required fields, checks mode-specific requirements, retries on server errors, and prints the publicUrl on success.
For doc_qa mode, the digital human needs a knowledge document:
bash scripts/upload-document.sh <headId> /path/to/document.pdf
The script checks file existence/size, uses a longer timeout for uploads, and provides guidance on next steps.
The digital human is live at the publicUrl from Step 4. The user should:
Use the update script to modify any parameter except the face (changing face requires creating a new head):
bash scripts/update-head.sh <headId> updates.json # from a JSON file
bash scripts/update-head.sh <headId> --field ttsVoice=rachel # single field
bash scripts/update-head.sh <headId> --field ttsVoice=rachel --field greetings="Hi!" # multiple fields
bash scripts/list-resources.sh heads # list all
bash scripts/list-resources.sh head <headId> # get details for one
bash scripts/delete-head.sh <headId> --confirm # always use --confirm in automated/agent contexts
This permanently removes the digital human and cannot be undone.
Agent note: Always pass --confirm when calling this script. Without it, the script prompts for interactive input and will hang.
Digital humans can be embedded in websites/apps. See references/embedding.md for code snippets and configuration options.
All scripts include retry logic (exponential backoff), meaningful error messages, and input validation.
| Script | Purpose |
|--------|---------|
| scripts/_utils.sh | Shared utilities: retry wrapper, colored logging, error parsing |
| scripts/auth.sh | Authenticate and export UNITH_TOKEN (with 6-day token caching) |
| scripts/list-resources.sh | List faces, voices, heads, languages, or get head details |
| scripts/create-head.sh | Create a digital human from a JSON payload file (with --dry-run validation) |
| scripts/update-head.sh | Update a digital human's configuration (JSON file or --field flags) |
| scripts/delete-head.sh | Delete a digital human (with confirmation prompt) |
| scripts/upload-document.sh | Upload knowledge document to a doc_qa head |
Configuration via environment variables:
UNITH_MAX_RETRIES ā max retry attempts (default: 3)UNITH_RETRY_DELAY ā initial delay between retries in seconds (default: 2, doubles each retry)UNITH_CURL_TIMEOUT ā curl timeout in seconds (default: 30, 120 for uploads)UNITH_CONNECT_TIMEOUT ā connection timeout in seconds (default: 10)UNITH_TOKEN_CACHE ā token cache file path (default: /tmp/.unith_token_cache, set empty to disable)For full payload schemas, configuration parameters, and mode-specific details:
Read references/api-payloads.md # Full request/response schemas per mode
Read references/configuration.md # All configurable parameters
Read references/embedding.md # Embedding code and options
"I want a quick video of someone saying X" ā ttt mode, minimal config
"I want a customer support avatar" ā doc_qa mode with knowledge docs
"I want an AI sales rep" ā oc mode with a sales personality prompt
"I want to connect my own LLM" ā plugin mode with webhook URL
"I want a guided onboarding flow" ā voiceflow mode with Voiceflow API key
Before creating, ask for:
en-US, es-ES)oc/doc_qa) ā personality and behavior instructionsdoc_qa) ā files to uploadvoiceflow) ā from their Voiceflow accountplugin) ā webhook endpoint for their custom engineGenerated Feb 24, 2026
Create engaging training videos with a digital human avatar explaining company policies or software tutorials. This reduces production costs compared to hiring actors and allows for easy updates by regenerating videos with new text.
Deploy a digital human on a website to answer common customer questions based on uploaded documents like manuals or FAQs. It provides a human-like interface for support, available 24/7 without human agents.
Use the open dialogue mode to build a conversational avatar that tutors students in subjects like math or science. It can adapt explanations based on user queries, making learning more personalized and engaging.
Generate promotional videos with a digital human speaking scripted content for social media or ads. This enables rapid creation of localized or A/B tested videos without reshoots, saving time and resources.
Implement a digital human in clinics or websites to provide health information from uploaded medical guidelines. It offers a friendly, accessible way for patients to get reliable answers without overwhelming staff.
Offer a platform where businesses subscribe to generate talking-head videos on demand. Charge based on video length or number of avatars, targeting small businesses needing affordable marketing content.
Provide bespoke services to create and integrate digital humans for clients, such as custom faces or voices. This includes setup, training, and ongoing support for specific use cases like virtual assistants.
License the technology to app developers for embedding digital humans into their applications, such as fitness coaches or language tutors. Monetize through API usage fees or per-user licensing.
š¬ Integration Tip
Ensure environment variables UNITH_EMAIL and UNITH_SECRET_KEY are securely set before running scripts, and use the auth script to handle token expiration automatically.
Generate/edit images with Nano Banana Pro (Gemini 3 Pro Image). Use for image create/modify requests incl. edits. Supports text-to-image + image-to-image; 1K/2K/4K; use --input-image.
Capture frames or clips from RTSP/ONVIF cameras.
Batch-generate images via OpenAI Images API. Random prompt sampler + `index.html` gallery.
Generate images using the internal Google Antigravity API (Gemini 3 Pro Image). High quality, native generation without browser automation.
使ēØå ē½® image_generate.py čę¬ēęå¾ē, åå¤ęø ę°å ·ä½ē `prompt`ć
AI image generation powered by CellCog. Create images, edit photos, consistent characters, product photography, reference-based images, sets of images, style transfer. Professional image creation with AI.