🛠️ Utilities & Tools

MinerU PDF Parserv1.0.1

Name: MinerU PDF Parser
Author: EasonAI-5589

mineru

EasonAI-5589

用 MinerU API 解析 PDF/Word/PPT/图片为 Markdown，支持公式、表格、OCR。适用于论文解析、文档提取。

document-processing

Download Package View on ClawHub

Installs (all time)

191

Installs (current)

Downloads

5.5K

Stars

CreatedFeb 6, 2026

UpdatedFeb 28, 2026

Install & Quick Start

Install via ClawdBot CLI:

clawdbot install EasonAI-5589/mineru

Skill Package1 files

📋SKILL.mdmarkdown

Failed to load file.

Quality Score

B64/100

Grade Fair — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.

Market Validation18/35

· 19 installs (average)
· 2398 downloads (high demand)
· 4 stars

Documentation15/25

· SKILL.md present
· Detailed documentation (≥3000 chars)

Package Completeness6/15

· skillAssets present (0 files)

Security Analysis

💙 Low Risk

UNKNOWN_DATA_SINKhigh

Sends data to undocumented external endpoint (potential exfiltration)

POST → https://mineru.net/api/v4/extract/task

UNDOCUMENTED_EXTERNALlow

Calls external URL not in known-safe list

https://mineru.net/

KNOWN_EXTERNALlow

Uses known external API (expected, informational)

arxiv.org

AI Analysis

The skill interacts with a documented external API (MinerU) for its stated purpose of document parsing, which is consistent with its description. While it sends user documents to an external service, this is expected functionality for a document parsing tool, and the API endpoint is clearly documented as part of the service. No credential harvesting, hidden instructions, or obfuscation patterns were detected.

💡

Usage Guide

Generated Feb 23, 2026

Researchers and AcademicsLegal ProfessionalsBusiness Analystsbeginner

💡 Application Scenarios

Academic Research Paper AnalysisEducation and Research

Researchers can automatically parse arXiv PDFs into structured Markdown with LaTeX formulas and tables preserved, enabling quick literature reviews and data extraction without manual copying. This is ideal for summarizing papers, building knowledge bases, or preparing annotated bibliographies.

Legal Document ProcessingLegal Services

Law firms can convert scanned contracts, Word documents, or PDFs into searchable Markdown text while retaining complex layouts and tables, streamlining document review and analysis for cases or compliance checks. OCR support handles mixed-language content in legal materials.

Business Report GenerationFinance and Consulting

Companies can extract data from financial reports, PowerPoint presentations, and Word documents to create structured summaries or integrate content into databases, improving efficiency in reporting and decision-making processes. Batch processing allows handling multiple quarterly reports at once.

Content Digitization for ArchivesCultural Heritage

Libraries or museums can digitize historical documents, books, and images by converting them into Markdown with OCR, preserving formulas and tables for digital archives or online publications. This supports heritage preservation and accessibility initiatives.

Technical Documentation ConversionTechnology and Engineering

Engineering teams can parse technical manuals, diagrams in PDFs, or PPT slides into Markdown to update documentation, extract specifications, or feed into knowledge management systems, ensuring accurate retention of complex tables and formulas.

💼 Business Models

API-as-a-Service SubscriptionRecurring subscription fees

Offer tiered subscription plans based on usage quotas, such as number of pages or files processed per month, with premium tiers for higher concurrency or advanced features like VLM model access. Revenue comes from recurring payments from businesses and researchers.

Enterprise LicensingOne-time license fees plus annual support contracts

Provide custom licenses to large organizations for on-premise deployment or dedicated API instances, including support, training, and integration services. This targets industries like legal or finance with high-volume, secure document processing needs.

Pay-per-Use MicrotransactionsTransaction-based fees

Implement a usage-based pricing model where users pay per document or page processed, appealing to occasional users or small projects. Integrate with platforms like OpenClaw for seamless billing and low-barrier access to document parsing capabilities.

💬 Integration Tip

Set the MINERU_TOKEN environment variable in OpenClaw config for easy authentication, and use batch processing to optimize API quota when handling multiple files.