mikwielgus / forum-dlLinks
Scrape posts, threads from forums, news aggregators, mail archives, export to JSONL, mailbox, WARC
☆105Updated last year
Alternatives and similar repositories for forum-dl
Users that are interested in forum-dl are comparing it to the libraries listed below
Sorting:
- Passively capture, archive, and hoard your web browsing history, including the contents of the pages you visit, for later offline viewing…☆102Updated last month
- A self-hosted bookmark database with full-text page content search☆96Updated 5 months ago
- Reddit archiver☆180Updated last year
- Clean a series of links, resolving redirects and finding Wayback results if page is gone. Originally written to aid with importing from A…☆18Updated last year
- Command line tool to convert a file in the WARC format to a file in the ZIM format☆73Updated 8 months ago
- Downloads content from reddit☆22Updated 2 years ago
- Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.☆130Updated 2 months ago
- Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.☆376Updated 6 months ago
- Bash scripts which interact with Internet Archive Wayback Machine's Save Page Now☆133Updated 7 months ago
- Reddit takeout: export your account data as JSON: comments, submissions, upvotes etc. 🦖☆180Updated 4 months ago
- Grabbing everything from reddit.☆62Updated last year
- Docker Container for grab-site☆12Updated last year
- Export userdata from your reddit accounts. Submissions, comments, saved, upvoted contents are supported.☆22Updated last year
- The Temboz RSS/Atom feed reader☆83Updated 2 years ago
- Rexit - Liberate your Reddit Chats. This tool will export your reddit chats into a plethora of formats☆31Updated last month
- A server to collect & archive websites that also supports video downloads☆86Updated 2 years ago
- A Python Reddit API Wrapper (PRAW) script to download all of the accessible wiki pages of a Reddit subreddit☆51Updated last year
- A library/CLI tool to parse data out of your Google Takeout (History, Activity, Youtube, Locations, etc...)☆114Updated 2 months ago
- Our minimalist Certbot alternative that uses the Porkbun API to download and install web server SSL certificates☆43Updated 2 years ago
- Home of the official docker image for ArchiveBox☆53Updated 11 months ago
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.☆52Updated this week
- We back up a lot of stuff from around the web; now it's time to back up the Internet Archive, just in case.☆92Updated 5 years ago
- backup and parse your browser history databases (chrome, firefox, safari, and other chrome/firefox derivatives)☆153Updated last week
- A list of things related to software, literature, and other content for 🕣 Memento☆102Updated last year
- ⬇️ A simple all-in-one CLI tool to download EVERYTHING from a URL (like youtube-dl/yt-dlp, forum-dl, gallery-dl, simpler ArchiveBox). 🎭 …☆87Updated 3 months ago
- [mirror] Backup a list of github starred repositories for the specified user.☆142Updated 2 years ago
- Roffline allows you to browse Reddit offline☆82Updated last year
- 🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.☆59Updated last year
- Convert (E-Mail) messages into RSS feeds☆68Updated 6 years ago
- A self-hosted, drop-in replacement for Pushover that can use XMPP, as well as a wide variety of other services as the delivery method whi…☆49Updated 2 months ago