research-paper-kbPersistent cross-session knowledge base for research papers. Ingest arXiv/DOI ā extract method, gap, threat level ā append to PAPERS.md. Never lose paper con...
Install via ClawdBot CLI:
clawdbot install IlkhamFY/research-paper-kbPersistent research paper knowledge base for AI agents.
Ingest any paper (arXiv URL, DOI, or title) and extract structured intelligence into a permanent PAPERS.md knowledge base that survives across sessions. Never lose context on a paper again.
Use this skill when the user:
https://arxiv.org/abs/2310.12345)10.1038/s41586-024-07156-8)PAPERS.md in the workspaceMEMORY.md with a pointer so future sessions know the KB existsAccept any of:
https://arxiv.org/abs/XXXX.XXXXX2310.12345 or 2310.12345v210.XXXX/...Normalize to arXiv ID or DOI before proceeding.
For arXiv papers ā fetch the abstract page:
https://arxiv.org/abs/<arxiv_id>
Extract: title, authors, date, abstract, subject categories.
Also fetch the Semantic Scholar API for structured metadata:
https://api.semanticscholar.org/graph/v1/paper/arXiv:<arxiv_id>?fields=title,authors,year,abstract,tldr,citationCount,influentialCitationCount,fieldsOfStudy
For DOI papers ā use Semantic Scholar:
https://api.semanticscholar.org/graph/v1/paper/<DOI>?fields=title,authors,year,abstract,tldr,citationCount,influentialCitationCount,externalIds
For title lookup:
https://api.semanticscholar.org/graph/v1/paper/search?query=<url_encoded_title>&fields=title,authors,year,abstract,externalIds&limit=1
From the abstract and any available full text, extract:
| Field | What to Extract |
|-------|----------------|
| Method | Core technical approach or contribution (1-2 sentences) |
| Gap they claim | What problem/limitation they say they're solving |
| Key results | Main quantitative or qualitative outcome |
| Overlap with user's work | Ask the user if context is unclear; or infer from prior PAPERS.md entries and MEMORY.md |
| Threat level | 1-5 scale: how much does this threaten the user's research? (1=unrelated, 5=directly competing) |
| Citation count | From Semantic Scholar |
| Related papers | Up to 3 highly-cited related papers from the same fetch |
Threat level guide:
Generate a clean BibTeX entry. Format:
For arXiv:
@article{<AuthorYEARkeyword>,
title = {<Full Title>},
author = {<Author1> and <Author2> and ...},
journal = {arXiv preprint arXiv:<id>},
year = {<year>},
url = {https://arxiv.org/abs/<id>},
note = {arXiv:<id>}
}
For published paper:
@article{<AuthorYEARkeyword>,
title = {<Full Title>},
author = {<Author1> and <Author2> and ...},
journal = {<venue>},
year = {<year>},
doi = {<doi>},
url = {https://doi.org/<doi>}
}
BibTeX key convention: FirstAuthorLastNameYearFirstContentWord (e.g., Smith2024diffusion)
Check if PAPERS.md exists in the workspace root. If not, create it with the header:
# PAPERS.md ā Research Paper Knowledge Base
> Auto-maintained by the `research-paper-kb` skill. Add papers with: "add this paper to my KB"
> Last updated: <date>
---
Append (never overwrite) the following entry template:
## [<Short Title>](<arxiv_or_doi_url>)
**Added:** <YYYY-MM-DD>
**Authors:** <Author1>, <Author2>, ...
**Venue:** <arXiv / Conference / Journal>
**Citations:** <N> (Semantic Scholar)
**Threat Level:** <1-5> ā <one-line reason>
### Method
<1-2 sentence description of the core technical contribution>
### Gap They Claim
<What problem/limitation they say they're solving>
### Key Results
<Main outcomes, benchmarks, or claims>
### Overlap With My Work
<How this relates to the user's research ā ask if unclear>
### Notes
<Any additional context the user provides, or leave blank>
### BibTeXbibtex
---
After writing to PAPERS.md, append or update the PAPERS.md pointer in MEMORY.md:
Find or create a section ## Research Paper KB:
## Research Paper KB
- PAPERS.md exists in workspace root ā <N> papers tracked as of <date>
- Latest addition: <Short title> (<threat level>/5)
- Run `research-paper-kb` to add more papers
If the section already exists, update the count and latest addition line.
Reply with a summary:
ā
Added to PAPERS.md
**[Paper Title]** (<year>)
- Authors: ...
- Threat level: X/5 ā <reason>
- BibTeX key: `AuthorYearWord`
PAPERS.md now has N papers. Run `show me my papers` to review.
When the user asks to review their paper KB:
PAPERS.md@article{...} blocks and present as a code blockWhen the user asks about a specific paper or topic:
PAPERS.md for matching title/author/keywords| Situation | Handling |
|-----------|----------|
| arXiv paper not found | Try Semantic Scholar title search; if still not found, ask user to confirm title |
| DOI behind paywall | Fetch abstract from DOI.org metadata (https://doi.org/ with Accept: application/json); note "full text unavailable" |
| Paper already in PAPERS.md | Detect by title/arXiv ID match; offer to update notes or threat level instead |
| User doesn't know their research area | Ask: "What's your research focus? I'll use this to assess overlap." Store in MEMORY.md |
| Semantic Scholar rate limit | Fall back to arXiv API: http://export.arxiv.org/api/query?id_list= |
This skill works best alongside:
citation-management ā for full BibTeX workflow and PubMed/Google Scholar searchbiorxiv-database ā for biology/life-science preprints (use to find papers to add)cs-research-methodology ā for identifying gaps and research proposals from your KBproactive-research (ClaWHub) ā can feed new papers into this KB automatically| File | Action |
|------|--------|
| PAPERS.md | Append new entry (create if missing) |
| MEMORY.md | Update ## Research Paper KB section |
Never modifies: SOUL.md, USER.md, AGENTS.md, TOOLS.md, or any project files.
User: "Add this to my KB: https://arxiv.org/abs/2310.06825"
Agent:
Jiang2023mistralname: research-paper-kb
version: 1.0.0
author: <your-github-handle>
category: Academic & Research
tags: [research, papers, arxiv, bibtex, knowledge-base, literature, academic, persistent-memory]
summary: Persistent cross-session knowledge base for research papers. Ingest arXiv/DOI ā extract method/gap/threat level ā append to PAPERS.md. Never lose paper context again.
requires: []
Generated Feb 23, 2026
A PhD student or postdoc uses this skill to systematically track and analyze relevant papers in their field. By ingesting arXiv or DOI links, they build a persistent knowledge base with structured summaries, threat levels, and BibTeX entries, ensuring they never lose context across research sessions and can quickly recall key insights.
A corporate R&D team employs this skill to monitor competitor publications and emerging technologies. They add papers via URLs or titles, extracting methods, gaps, and threat levels to assess competitive threats and identify opportunities for innovation, with all data stored in a shared PAPERS.md file for team access.
A researcher preparing a grant proposal uses this skill to compile and analyze relevant literature. By saving papers to the knowledge base, they automatically generate BibTeX entries and structured summaries, streamlining citation management and ensuring comprehensive coverage of related work for proposal submissions.
An academic group uses this skill to organize papers for journal clubs or seminars. Members add papers via triggers like 'save this paper', and the skill extracts key results and overlaps, creating a centralized PAPERS.md that facilitates discussion and follow-up across multiple sessions.
A tech startup founder leverages this skill to track academic papers relevant to their market. By ingesting papers on new methods or technologies, they assess threat levels and gaps to inform product development and competitive strategy, with persistent storage ensuring long-term insights are retained.
Offer a free tier with basic paper tracking and limited entries, and a paid subscription for advanced features like AI-powered gap analysis, team collaboration tools, and integration with reference managers. Revenue comes from monthly or annual subscriptions tailored to academic and corporate users.
Sell site-wide licenses to universities, research labs, or corporations, providing unlimited usage, custom integrations with institutional repositories, and dedicated support. Revenue is generated through one-time or recurring license fees based on the number of users or scale of deployment.
Monetize the underlying paper ingestion and extraction APIs, allowing developers to integrate the skill's capabilities into their own applications. Charge based on API call volume or offer tiered pricing plans, targeting edtech, publishing, and research tool developers.
š¬ Integration Tip
Ensure the workspace has write permissions for creating and updating PAPERS.md and MEMORY.md files, and verify internet access for API calls to arXiv and Semantic Scholar during paper ingestion.
Search, download, and summarize academic papers from arXiv. Built for AI/ML researchers.
Search and summarize papers from ArXiv. Use when the user asks for the latest research, specific topics on ArXiv, or a daily summary of AI papers.
Assistance with writing literature reviews by searching for academic sources via Semantic Scholar, OpenAlex, Crossref and PubMed APIs. Use when the user needs to find papers on a topic, get details for specific DOIs, or draft sections of a literature review with proper citations.
Baidu Scholar Search - Search Chinese and English academic literature (journals, conferences, papers, etc.)
Use this skill when users need to search academic papers, download research documents, extract citations, or gather scholarly information. Triggers include: requests to "find papers on", "search research about", "download academic articles", "get citations for", or any request involving academic databases like arXiv, PubMed, Semantic Scholar, or Google Scholar. Also use for literature reviews, bibliography generation, and research discovery. Requires OpenClawCLI installation from clawhub.ai.
Outcome-driven scientific publishing for AI agents. Publish research papers, hypotheses, and experiments with validated artifacts, structured claims, milestone tracking, and independent replications. Claim replication bounties, submit peer reviews, and collaborate with other AI researchers.