mikwielgus / forum-dlLinks
Scrape posts, threads from forums, news aggregators, mail archives, export to JSONL, mailbox, WARC
☆108Updated last year
Alternatives and similar repositories for forum-dl
Users that are interested in forum-dl are comparing it to the libraries listed below
Sorting:
- Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.☆131Updated 2 weeks ago
- Reddit archiver☆189Updated last year
- A self-hosted bookmark database with full-text page content search☆96Updated 8 months ago
- Passively capture, archive, and hoard your web browsing history, including the contents of the pages you visit, for later offline viewing…☆107Updated 3 months ago
- A web scraping suite to efficiently load .epub files onto your Kindle.☆26Updated 2 months ago
- backup and parse your browser history databases (chrome, firefox, safari, and other chrome/firefox derivatives)☆180Updated this week
- Command line tool to convert a file in the WARC format to a file in the ZIM format☆75Updated last week
- Bash scripts which interact with Internet Archive Wayback Machine's Save Page Now☆138Updated 9 months ago
- Clean a series of links, resolving redirects and finding Wayback results if page is gone. Originally written to aid with importing from A…☆19Updated 2 months ago
- Self-hostable link database and RSS reader☆138Updated this week
- Rexit - Liberate your Reddit Chats. This tool will export your reddit chats into a plethora of formats☆31Updated 3 months ago
- Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page…☆42Updated last year
- ⬇️ A simple all-in-one CLI tool to download EVERYTHING from a URL (like youtube-dl/yt-dlp, forum-dl, gallery-dl, simpler ArchiveBox). 🎭 …☆90Updated 3 weeks ago
- Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.☆408Updated 8 months ago
- FUPS: Forum user-post scraper☆21Updated 2 months ago
- Reddit takeout: export your account data as JSON: comments, submissions, upvotes etc. 🦖☆181Updated this week
- Home of the official docker image for ArchiveBox☆53Updated last year
- A self-hosted, drop-in replacement for Pushover (https://codeberg.org/mrus/overpush)☆59Updated 2 weeks ago
- A server to collect & archive websites that also supports video downloads☆84Updated 2 years ago
- A Python Reddit API Wrapper (PRAW) script to download all of the accessible wiki pages of a Reddit subreddit☆53Updated last year
- Grabbing everything from reddit.☆61Updated last year
- Docker Container for grab-site☆13Updated last year
- Scripts to build and boot warrior virtual machine containing Docker☆122Updated 9 months ago
- Roffline allows you to browse Reddit offline☆81Updated 2 years ago
- Simple podcast downloader☆37Updated 7 months ago
- Tool to index and serve HTML files. Powered by Datasette.☆111Updated 3 years ago
- Personal WayBack Machine☆129Updated 6 years ago
- Pinimatic is a socially integrated self-hosted, Pinterest inspired by Wookmark and built on top of Django. Now pre-configured for Heroku…☆26Updated 8 years ago
- The Temboz RSS/Atom feed reader☆84Updated 2 years ago
- Creates a complete full text historical archive for an RSS or ATOM feed.☆130Updated 3 weeks ago