markdown-converterConvert documents and files to Markdown using markitdown. Use when converting PDF, Word (.docx), PowerPoint (.pptx), Excel (.xlsx, .xls), HTML, CSV, JSON, XML, images (with EXIF/OCR), audio (with transcription), ZIP archives, YouTube URLs, or EPubs to Markdown format for LLM processing or text analysis.
Install via ClawdBot CLI:
clawdbot install steipete/markdown-converterConvert files to Markdown using uvx markitdown — no installation required.
# Convert to stdout
uvx markitdown input.pdf
# Save to file
uvx markitdown input.pdf -o output.md
uvx markitdown input.docx > output.md
# From stdin
cat input.pdf | uvx markitdown
-o OUTPUT # Output file
-x EXTENSION # Hint file extension (for stdin)
-m MIME_TYPE # Hint MIME type
-c CHARSET # Hint charset (e.g., UTF-8)
-d # Use Azure Document Intelligence
-e ENDPOINT # Document Intelligence endpoint
--use-plugins # Enable 3rd-party plugins
--list-plugins # Show installed plugins
# Convert Word document
uvx markitdown report.docx -o report.md
# Convert Excel spreadsheet
uvx markitdown data.xlsx > data.md
# Convert PowerPoint presentation
uvx markitdown slides.pptx -o slides.md
# Convert with file type hint (for stdin)
cat document | uvx markitdown -x .pdf > output.md
# Use Azure Document Intelligence for better PDF extraction
uvx markitdown scan.pdf -d -e "https://your-resource.cognitiveservices.azure.com/"
-d with Azure Document IntelligenceGenerated Feb 28, 2026
Law firms can convert scanned PDF contracts, Word briefs, and Excel evidence files into Markdown for AI-powered legal analysis, enabling quick summarization and clause extraction. This streamlines due diligence and case preparation by making documents machine-readable.
Researchers convert PDF journal articles, PowerPoint presentations, and Excel datasets to Markdown to feed into LLMs for literature reviews and data synthesis. This facilitates automated summarization and trend analysis across large document collections.
Companies transform PowerPoint sales decks, Word reports, and Excel spreadsheets into Markdown to integrate with AI tools for generating insights and executive summaries. This automates the extraction of key metrics and trends from diverse business documents.
Publishers convert legacy EPUBs, HTML web pages, and image-based manuscripts to Markdown for modern digital platforms, enabling efficient reformatting and SEO optimization. This supports transitioning archives to AI-enhanced content management systems.
Healthcare providers use the skill to convert PDF medical records, CSV patient data, and scanned images with OCR to Markdown for AI-driven analysis, improving data interoperability and clinical decision support. This aids in aggregating information from disparate sources for research.
Offer a cloud-based platform where users upload files via API or web interface to convert to Markdown, with tiered pricing based on volume and features like Azure Document Intelligence integration. Revenue comes from monthly subscriptions and pay-per-use credits for high-volume clients.
Sell on-premise or private cloud licenses to large organizations in legal, finance, or healthcare for secure, high-throughput document conversion, including custom integrations and support. Revenue is generated through upfront license fees and annual maintenance contracts.
Provide a developer-focused API that allows software products to embed Markdown conversion for PDFs, Office files, and media, charging based on API calls and advanced features like audio transcription. Revenue streams include API usage tiers and premium add-ons for specialized formats.
💬 Integration Tip
Use the -d option with Azure Document Intelligence for complex PDFs to improve extraction accuracy, and pipe stdin for seamless integration into automated workflows.
Edit PDFs with natural-language instructions using the nano-pdf CLI.
Comprehensive PDF manipulation toolkit for extracting text and tables, creating new PDFs, merging/splitting documents, and handling forms. When Claude needs to fill in a PDF form or programmatically process, generate, or analyze PDF documents at scale.
用 MinerU API 解析 PDF/Word/PPT/图片为 Markdown,支持公式、表格、OCR。适用于论文解析、文档提取。
Generate hand-drawn style diagrams, flowcharts, and architecture diagrams as PNG images from Excalidraw JSON
The awesome PPT format generation tool provided by baidu.
AI-powered PDF generator for legal docs, pitch decks, and reports. SAFEs, NDAs, term sheets, whitepapers. npx ai-pdf-builder. Works with Claude, Cursor, GPT, Copilot.