reddit-archiveDownload and archive Reddit posts including images, GIFs, and videos from specified users or subreddits with filtering and sorting options.
Install via ClawdBot CLI:
clawdbot install terellison/reddit-archive_Download and archive Reddit posts (images, GIFs, videos) from users or subreddits._
This script automatically checks for and installs its dependencies on first run:
If missing, it will attempt to install them via pip install --user. You can also:
pip3 install requests yt-dlpexport YTDLP_PATH=/your/custom/path/yt-dlpYou want to archive content from Reddit ā either from a specific user (u/username) or a subreddit (r/subname).
python3 ~/path/to/reddit_archive.py [options]
| Flag | Description | Default |
|------|-------------|---------|
| -u, --user | Reddit username (either this OR --subreddit required) | ā |
| -s, --subreddit | Subreddit name (either this OR --user required) | ā |
| -o, --output | Output directory | ~/temp/.reddit_ |
| --sort | Sort order: hot, new, rising, top, controversial | hot |
| --time | Time filter for top/controversial: hour, day, week, month, year, all | ā |
| --after | Start date (YYYY-MM-DD) | No filter |
| --before | End date (YYYY-MM-DD) | No filter |
| --limit | Max posts to fetch (0 = unlimited) | 0 |
| --images | Download images (jpg, png, webp) | ā |
| --gifs | Download GIFs/videos (gfycat, redgifs, imgur) | ā |
| --skip-existing | Skip already-downloaded files | ā |
| --workers | Parallel download workers | 4 |
# All posts from a user
python3 reddit_archive.py -u someuser
# Subreddit with date range
python3 reddit_archive.py -s orlando --after 2025-01-01 --before 2025-12-31
# Top 10 most upvoted posts of all time from a subreddit
python3 reddit_archive.py -s funny --sort top --time all --limit 10
# New posts only
python3 reddit_archive.py -s orlando --sort new
# GIFs only, specific user
python3 reddit_archive.py -u someguy --gifs
# Custom output dir
python3 reddit_archive.py -u someuser -o ~/Downloads/reddit_archive
Downloads are saved to the output directory with the following structure:
output_directory/
āāā Pictures/
ā āāā {target}_{post_id}.jpg
ā āāā {target}_{post_id}.png
ā āāā ...
āāā Videos/
āāā {target}_{post_id}.mp4
āāā ...
The skill is organized as:
reddit-archive/
āāā SKILL.md ā This file
āāā scripts/
āāā reddit_archive.py ā Main downloader script
āāā requirements.txt ā Python dependencies
requests with proper User-Agent header/user/{name}/submitted.json or /r/{name}/hot.json)media_metadatayt-dlpcreated_utc)Generated Feb 25, 2026
Marketing agencies can use this skill to archive Reddit posts from specific subreddits or users for competitive analysis, trend tracking, and content curation. By downloading images, GIFs, and videos, agencies can build a repository of user-generated content to inform campaigns and monitor brand mentions. Date filtering allows analysis of content over specific periods, aiding in performance evaluation.
Researchers studying online behavior can archive Reddit data from subreddits to analyze visual content trends, meme propagation, or community engagement. The skill's ability to filter by date, sort order, and content type enables systematic data collection for qualitative or quantitative studies. This supports research in sociology, media studies, or digital humanities without manual scraping.
Content creators, such as YouTubers or bloggers, can archive Reddit posts from relevant subreddits to gather inspiration, source materials, or track viral content. By downloading media files into organized directories, they can efficiently access assets for video editing, articles, or social media posts. The skip-existing feature prevents redundant downloads, saving time.
Corporations can use this skill to archive Reddit posts related to their brand, products, or industry for compliance, legal evidence, or audit trails. By setting date ranges and limiting posts, legal teams can capture specific instances of user feedback, complaints, or intellectual property issues. This aids in risk management and regulatory adherence.
Moderators of online platforms or forums can archive Reddit content from users or subreddits for backup purposes, content migration, or moderation reference. The skill's parallel workers and media type options allow efficient downloading of large volumes of posts, helping maintain community archives or transition to new systems.
Offer a cloud-based service where users input Reddit targets and receive archived content via a web interface or API, with automated scheduling and storage. Revenue comes from subscription tiers based on data volume, frequency, or advanced features like analytics. This model targets businesses needing regular archival without local setup.
Provide consulting services to integrate this skill into clients' existing workflows, such as marketing dashboards or research tools, with customization for specific use cases. Revenue is generated through project-based fees or retainer contracts for ongoing support and updates. This model suits organizations lacking technical expertise.
Distribute the skill as a free open-source tool with basic functionality, while offering premium features like batch processing, enhanced rate limiting, or priority support for a fee. Revenue streams include one-time purchases for advanced versions or donations from power users. This model builds a community while monetizing advanced needs.
š¬ Integration Tip
Integrate this skill into automated workflows by scheduling script runs via cron jobs or task schedulers, and ensure proper environment variables are set for dependencies like yt-dlp to avoid installation issues.
Fetch and read transcripts from YouTube videos. Use when you need to summarize a video, answer questions about its content, or extract information from it.
Fetch and summarize YouTube video transcripts. Use when asked to summarize, transcribe, or extract content from YouTube videos. Handles transcript fetching via residential IP proxy to bypass YouTube's cloud IP blocks.
Browse, search, post, and moderate Reddit. Read-only works without auth; posting/moderation requires OAuth setup.
Interact with Twitter/X ā read tweets, search, post, like, retweet, and manage your timeline.
LinkedIn automation via browser relay or cookies for messaging, profile viewing, and network actions.
Search YouTube videos, get channel info, fetch video details and transcripts using YouTube Data API v3 via MCP server or yt-dlp fallback.