alicloud-ai-text-document-mindUse Document Mind (DocMind) via Node.js SDK to submit document parsing jobs and poll results. Designed for Claude Code/Codex document understanding workflows.
Install via ClawdBot CLI:
clawdbot install cinience/alicloud-ai-text-document-mindCategory: provider
Use DocMind to extract document structure, text, and layout with async jobs.
npm install @alicloud/docmind-api20220711 @alicloud/tea-util @alicloud/credentialsALICLOUD_ACCESS_KEY_IDALICLOUD_ACCESS_KEY_SECRETALICLOUD_REGION_ID (optional default; if unset, choose the most reasonable region for the task or ask the user)const Client = require('@alicloud/docmind-api20220711');
const Credential = require('@alicloud/credentials');
const Util = require('@alicloud/tea-util');
const cred = new Credential.default();
const regionId = process.env.ALICLOUD_REGION_ID || 'cn-hangzhou'; // Example default; choose/ask if unset.
const client = new Client.default({
endpoint: `docmind-api.${regionId}.aliyuncs.com`,
accessKeyId: cred.credential.accessKeyId,
accessKeySecret: cred.credential.accessKeySecret,
type: 'access_key',
regionId,
});
async function submitByUrl(fileUrl, fileName) {
const req = new Client.SubmitDocStructureJobRequest();
req.fileUrl = fileUrl;
req.fileName = fileName;
const resp = await client.submitDocStructureJob(req);
return resp.body.data.id;
}
async function pollResult(jobId) {
const req = new Client.GetDocStructureResultRequest();
req.id = jobId;
const resp = await client.getDocStructureResult(req);
return resp.body;
}
(async () => {
const jobId = await submitByUrl('https://example.com/example.pdf', 'example.pdf');
console.log('jobId:', jobId);
// Poll every 10s until completed.
for (;;) {
const result = await pollResult(jobId);
if (result.completed) {
console.log(result.status, result.data || result.message);
break;
}
await new Promise((r) => setTimeout(r, 10000));
}
})();
DOCMIND_FILE_URL="https://example.com/example.pdf" \\
node skills/ai/text/alicloud-ai-text-document-mind/scripts/quickstart.js
Environment variables:
DOCMIND_FILE_URLDOCMIND_FILE_NAME (optional)DOCMIND_POLL_INTERVAL_MS (optional, default 10000)DOCMIND_MAX_POLLS (optional, default 120)const fs = require('fs');
const advanceReq = new Client.SubmitDocStructureJobAdvanceRequest();
advanceReq.fileUrlObject = fs.createReadStream('./example.pdf');
advanceReq.fileName = 'example.pdf';
const runtime = new Util.RuntimeOptions({});
const resp = await client.submitDocStructureJobAdvance(advanceReq, runtime);
completed=true.UrlNotLegal: URL not publicly accessible or malformed.DocProcessing: job still running; keep polling.Fail: check message and error code for root cause.@alicloud/docmind-api20220711references/sources.mdGenerated Mar 1, 2026
Law firms and legal departments can use DocMind to parse contracts, court filings, and legal briefs to extract structured text and layout for review and analysis. This automates manual document reading, saving time and reducing errors in legal workflows.
Banks and financial institutions can submit financial reports, statements, and invoices to DocMind for automated text extraction and structure parsing. This enables faster data entry into accounting systems and supports compliance auditing by digitizing paper-based documents.
Healthcare providers can use DocMind to parse medical records, lab reports, and patient forms to extract structured data for electronic health records (EHR). This improves data accuracy and streamlines patient information management in clinical settings.
Universities and research institutions can process academic papers, theses, and research documents with DocMind to extract text, tables, and references for analysis and citation management. This aids in literature reviews and data compilation for scholarly work.
Insurance companies can submit claim forms, policy documents, and damage reports to DocMind to extract key information like dates, amounts, and descriptions. This automates claim processing, reduces manual data entry, and speeds up reimbursement workflows.
Offer DocMind as a cloud-based service with tiered subscription plans based on usage volume, such as number of documents processed per month. This provides recurring revenue and scales with customer demand for document automation.
Sell access to DocMind's API endpoints on a pay-per-use basis, charging per document processed or API call. This model attracts developers and businesses needing flexible, on-demand document parsing without long-term commitments.
Provide custom integration services where DocMind is embedded into existing enterprise systems like CRM or ERP software. Revenue comes from one-time setup fees, ongoing support contracts, and licensing for proprietary integrations.
💬 Integration Tip
Ensure Alibaba Cloud credentials are properly set via environment variables and handle async polling with error checks to manage job statuses effectively.
Automate Xiaohongshu (RedNote) content operations using a Python client for the xiaohongshu-mcp server. Use for: (1) Publishing image, text, and video content, (2) Searching for notes and trends, (3) Analyzing post details and comments, (4) Managing user profiles and content feeds. Triggers: xiaohongshu automation, rednote content, publish to xiaohongshu, xiaohongshu search, social media management.
Feishu-integrated wrapper for the capability-evolver. Manages the evolution loop lifecycle (start/stop/ensure), sends rich Feishu card reports, and provides...
The Baidu Baike Component is a knowledge service tool designed to query authoritative encyclopedia explanations for various nouns. Its core function is given a specific "noun" (object, person, location, concept, event, etc.) provided by the user, it returns a standardized, detailed entry explanation sourced from Baidu Baike.
Draft and publish posts to 小红书 (Xiaohongshu/RED). Use when creating content for 小红书, drafting posts, generating cover images, or publishing via browser automation. Covers the full workflow from content creation to browser-based publishing, including cover image generation with Pillow.
AI-assisted creation, rendering, and automated publishing of Xiaohongshu-style content with support for Markdown to image cards and data tracking.
小红书全能助手 — 文案生成、封面制作、内容发布与管理。当用户要求写小红书笔记、生成小红书文案/标题/封面、发小红书、搜索小红书、评论点赞收藏等任何小红书相关操作时使用。支持一站式从文案创作到自动发布的完整流程。封面AI生图需配置可选环境变量(GEMINI_API_KEY 或 IMG_API_KEY 或 HUNY...