📄 Documents & Office

pdf-ocr-layoutv1.0.2

Name: pdf-ocr-layout
Author: baokui

pdf-ocr-layout

基于智谱 GLM-OCR、GLM-4.7 及 GLM-4.6V 的多模态文档深度解析工具。 Use when: - 需要高精度提取文档（PDF/图片）中的表格并转换为 Markdown 格式 - 需要从文档页面中自动裁剪并提取插图、图表为独立文件 - 需要对提取的图表进行深度语义理解（基于 GLM-4.6V 视觉分析） - 需要对提取的表格数据进行逻辑分析（基于 GLM-4.7 文本分析）核心架构： 1. 视觉提取：GLM-OCR 2. 语义理解：GLM-4.7 (纯文本/表格) + GLM-4.6V (多模态/图像)

latest

Download Package View on ClawHub

Installs (all time)

Installs (current)

Downloads

2.2K

Stars

CreatedFeb 10, 2026

UpdatedMay 17, 2026

Install & Quick Start

Install via ClawdBot CLI:

clawdbot install baokui/pdf-ocr-layout

Skill Package5 files

📋SKILL.mdmarkdown

Failed to load file.

Quality Score

B59/100

Grade Fair — based on market validation, documentation quality, package completeness, maintenance status, and authenticity signals.

Market Validation10/35

· 2 installs (very low)
· 1081 downloads (moderate demand)
· 1 stars

Documentation16/25

· SKILL.md present
· Detailed documentation (≥3000 chars)
· Detailed summary

Package Completeness8/15

· skillAssets present (4 files)

💡

Usage Guide

Generated Mar 1, 2026

Data AnalystsDocument Management SpecialistsAI Integration Developersintermediate

💡 Application Scenarios

Financial Report AnalysisFinance and Accounting

Extract and convert tables from quarterly financial PDF reports to Markdown for automated data entry into accounting systems, while analyzing charts for revenue trends using GLM-4.6V to generate insights on performance metrics.

Academic Research Paper ProcessingEducation and Academia

Process research papers in PDF format to extract tables of experimental data as Markdown for database integration, and analyze charts with GLM-4.6V to summarize visual findings in context of the full text for literature reviews.

Legal Document ReviewLegal Services

Analyze legal contracts or case documents to extract tables of terms or schedules as Markdown for contract management systems, and interpret diagrams or exhibits with GLM-4.6V to assess visual evidence in legal contexts.

Healthcare Report DigitizationHealthcare

Convert medical reports or lab results from scanned images to extract patient data tables as Markdown for electronic health records, and analyze medical charts or imaging results with GLM-4.6V to aid in diagnostic summaries.

Business Intelligence Dashboard CreationBusiness Consulting

Process business documents like sales reports to extract performance tables as Markdown for integration into BI tools, and analyze infographics with GLM-4.6V to generate automated insights on market trends and visual data representations.

💼 Business Models

SaaS SubscriptionRecurring subscription fees

Offer the tool as a cloud-based service with tiered pricing based on usage volume, targeting enterprises for automated document processing and analysis, generating recurring revenue through monthly or annual subscriptions.

API LicensingPer-call fees or enterprise licenses

License the API to software developers and integrators for embedding into custom applications, such as CRM or ERP systems, charging per API call or through enterprise licensing agreements for scalable deployment.

Consulting and CustomizationProject-based and maintenance fees

Provide tailored solutions and integration services for specific industries, offering customization of the pipeline for unique document formats and training support, with revenue from project-based fees and ongoing maintenance contracts.

💬 Integration Tip

Ensure the ZHIPU_API_KEY is securely configured and test with sample documents to validate output formats before full deployment in production environments.