wechat-article-extractor-skillExtract metadata and content from WeChat Official Account articles. Use when user needs to parse WeChat article URLs (mp.weixin.qq.com), extract article info...
Install via ClawdBot CLI:
clawdbot install freestylefly/wechat-article-extractor-skillExtract metadata and content from WeChat Official Account (微信公众号) articles.
mp.weixin.qq.com)const { extract } = require('./scripts/extract.js');
const result = await extract('https://mp.weixin.qq.com/s?__biz=...');
// Returns: { done: true, code: 0, data: {...} }
const html = await fetch(url).then(r => r.text());
const result = await extract(html, { url: sourceUrl });
const result = await extract(url, {
shouldReturnContent: true, // Return HTML content (default: true)
shouldReturnRawMeta: false, // Return raw metadata (default: false)
shouldFollowTransferLink: true, // Follow migrated account links (default: true)
shouldExtractMpLinks: false, // Extract embedded mp.weixin links (default: false)
shouldExtractTags: false, // Extract article tags (default: false)
shouldExtractRepostMeta: false // Extract repost source info (default: false)
});
{
done: true,
code: 0,
data: {
// Account info
account_name: "公众号名称",
account_alias: "微信号",
account_avatar: "头像URL",
account_description: "功能介绍",
account_id: "原始ID",
account_biz: "biz参数",
account_biz_number: 1234567890,
account_qr_code: "二维码URL",
// Article info
msg_title: "文章标题",
msg_desc: "文章摘要",
msg_content: "HTML内容",
msg_cover: "封面图URL",
msg_author: "作者",
msg_type: "post", // post|video|image|voice|text|repost
msg_has_copyright: true,
msg_publish_time: Date,
msg_publish_time_str: "2024/01/15 10:30:00",
// Link params
msg_link: "文章链接",
msg_source_url: "阅读原文链接",
msg_sn: "sn参数",
msg_mid: 1234567890,
msg_idx: 1
}
}
{
done: false,
code: 1001,
msg: "无法获取文章信息"
}
| Code | Message | Description |
|------|---------|-------------|
| 1000 | 文章获取失败 | General failure |
| 1001 | 无法获取文章信息 | Missing title or publish time |
| 1002 | 请求失败 | HTTP request failed |
| 1003 | 响应为空 | Empty response |
| 1004 | 访问过于频繁 | Rate limited |
| 1005 | 脚本解析失败 | Script parsing error |
| 1006 | 公众号已迁移 | Account migrated |
| 2001 | 请提供文章内容或链接 | Missing input |
| 2002 | 链接已过期 | Link expired |
| 2003 | 内容涉嫌侵权 | Content removed (copyright) |
| 2004 | 无法获取迁移后的链接 | Migration link failed |
| 2005 | 内容已被发布者删除 | Content deleted by author |
| 2006 | 内容因违规无法查看 | Content blocked |
| 2007 | 内容发送失败 | Failed to send |
| 2008 | 系统出错 | System error |
| 2009 | 不支持的链接 | Unsupported URL |
| 2010 | 内容获取失败 | Content fetch failed |
| 2011 | 涉嫌过度营销 | Marketing/spam content |
| 2012 | 账号已被屏蔽 | Account blocked |
| 2013 | 账号已自主注销 | Account deleted |
| 2014 | 内容被投诉 | Content reported |
| 2015 | 账号处于迁移流程中 | Account migrating |
| 2016 | 冒名侵权 | Impersonation |
Required npm packages:
cheerio - HTML parsingdayjs - Date formattingrequest-promise - HTTP requestsqs - Query string parsinglodash.unescape - HTML entitiesweixin.sogou.com)Generated Mar 1, 2026
Media companies and PR agencies can use this skill to automatically extract and structure articles from WeChat Official Accounts for competitive analysis, trend tracking, and reporting. It enables efficient monitoring of industry news, competitor activities, and brand mentions across WeChat's vast ecosystem.
Researchers and academics can utilize this skill to gather and analyze WeChat articles as primary sources for studies on social media, communication, or cultural trends. It helps in extracting metadata and content systematically for qualitative or quantitative analysis, saving time on manual data entry.
Organizations and individuals can employ this skill to archive WeChat articles for compliance, record-keeping, or content backup purposes. It ensures that important posts, announcements, or marketing materials are preserved in a structured format, even if the original links expire or content is deleted.
Marketing teams can extract articles from WeChat to repurpose content for other platforms like blogs, newsletters, or social media. By parsing HTML and metadata, they can quickly reformat and adapt posts to reach broader audiences while maintaining accuracy and attribution.
Businesses with WeChat Official Accounts can use this skill to automatically extract and organize articles into a searchable knowledge base for customer support. It helps in providing quick access to FAQs, tutorials, and updates, improving response times and customer satisfaction.
Offer a cloud-based platform where users can input WeChat article URLs to receive structured data via API or dashboard. Charge based on usage tiers, such as number of extractions per month, with premium features like batch processing and advanced analytics.
Provide custom integration services for large organizations to embed this skill into their internal systems, such as CRM or content management platforms. Offer one-time setup fees and ongoing support contracts for tailored solutions and compliance with specific data handling requirements.
Develop a free web tool for basic article extraction to attract individual users and small businesses. Monetize through premium add-ons like historical data access, export to multiple formats, or integration with third-party apps, encouraging upgrades for advanced functionality.
💬 Integration Tip
Ensure API keys or authentication are handled securely, and implement error handling for common issues like rate limits or expired links to maintain reliability.
iMessage/SMS CLI for listing chats, history, watch, and sending.
Use when you need to control Discord from Clawdbot via the discord tool: send messages, react, post or upload stickers, upload emojis, run polls, manage threads/pins/search, fetch permissions or member/role/channel info, or handle moderation actions in Discord DMs or channels.
Use when you need to control Slack from Clawdbot via the slack tool, including reacting to messages or pinning/unpinning items in Slack channels or DMs.
Send WhatsApp messages to other people or search/sync WhatsApp history via the wacli CLI (not for normal user chats).
Build or update the BlueBubbles external channel plugin for Clawdbot (extension package, REST send/probe, webhook inbound).
OpenClaw skill for designing Telegram Bot API workflows and command-driven conversations using direct HTTPS requests (no SDKs).