mikwielgus / forum-dl
Scrape posts, threads from forums, news aggregators, mail archives, export to JSONL, mailbox, WARC
☆80Updated 7 months ago
Alternatives and similar repositories for forum-dl:
Users that are interested in forum-dl are comparing it to the libraries listed below
- A self-hosted bookmark database with full-text page content search☆89Updated last year
- Reddit archiver☆164Updated last year
- A Python Reddit API Wrapper (PRAW) script to download all of the accessible wiki pages of a Reddit subreddit☆47Updated 4 months ago
- backup and parse your browser history databases (chrome, firefox, safari, and other chrome/firefox derivatives)☆126Updated 2 months ago
- Command line tool to convert a file in the WARC format to a file in the ZIM format☆52Updated this week
- Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.☆115Updated last month
- Creates a complete full text historical archive for an RSS or ATOM feed.☆113Updated this week
- Chrome Extension for Hacker News and Reddit Links☆31Updated last year
- Selenium Open Source Search Engine & crawler☆102Updated this week
- Downloads content from reddit☆19Updated last year
- Export userdata from your reddit accounts. Submissions, comments, saved, upvoted contents are supported.☆20Updated 3 months ago
- A gui to control and manage snapcast written in python☆12Updated 3 weeks ago
- WIP - scripts for analyzing the (in)security of Chrome extensions☆26Updated 10 months ago
- 🍨 High-fidelity, browser-based, single-page web archiving library and CLI for witnessing the web.☆148Updated 2 weeks ago
- Self-hostable link database☆78Updated this week
- Home of the official docker image for ArchiveBox☆50Updated 2 months ago
- Get news from foreign RSS feeds translated, summarized, and spoken to you on demand.☆55Updated 2 months ago
- Passively capture, archive, and hoard your web browsing history, including the contents of the pages you visit, for later offline viewing…☆65Updated last week
- Command-line program for organizing and managing ebook collections. It is a Python port from the original shell scripts ebook-tools☆23Updated 9 months ago
- Clean up websites to be reader friendly☆19Updated last year
- Tube Archivist Companion for your Browser☆166Updated this week
- ☆36Updated 2 months ago
- Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page…☆39Updated 5 months ago
- 🦛 scrapes websites and generates rss feeds☆53Updated last week
- ⬇️ A simple all-in-one CLI tool to download EVERYTHING from a URL (like youtube-dl/yt-dlp, forum-dl, gallery-dl, simpler ArchiveBox). 🎭 …☆63Updated last month
- Database of Internet places. Mostly domains☆82Updated this week
- A privacy focused, self-hosted podcatcher.☆36Updated last month
- Quick and dirty script to suck down the pr0n from Reddit before it's too late☆83Updated last year