mikwielgus / forum-dl
Scrape posts, threads from forums, news aggregators, mail archives, export to JSONL, mailbox, WARC
☆74Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for forum-dl
- Downloads content from reddit☆18Updated last year
- Chrome Extension for Hacker News and Reddit Links☆26Updated last year
- Command line tool to convert a file in the WARC format to a file in the ZIM format☆44Updated this week
- Selenium Open Source Search Engine & crawler☆38Updated last week
- A self-hosted bookmark database with full-text page content search☆80Updated last year
- Creates a complete full text historical archive for an RSS or ATOM feed.☆104Updated this week
- backup and parse your browser history databases (chrome, firefox, safari, and other chrome/firefox derivatives)☆112Updated this week
- Reddit archiver☆153Updated 9 months ago
- A Python Reddit API Wrapper (PRAW) script to download all of the accessible wiki pages of a Reddit subreddit☆45Updated 3 weeks ago
- Simple print styles for saving Twitter threads as PDFs.☆30Updated last year
- A simple script to generate an RSS feed for self-hosted audio/videos that can be used with Apple Podcast, Amazon Podcasts and more.☆29Updated 2 months ago
- Self-hostable link archive☆70Updated this week
- Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.☆244Updated 3 months ago
- Export userdata from your reddit accounts. Submissions, comments, saved, upvoted contents are supported.☆18Updated last week
- 🍨 High-fidelity, browser-based, single-page web archiving library and CLI for witnessing the web.☆117Updated this week
- Bash scripts which interact with Internet Archive Wayback Machine's Save Page Now☆105Updated last month
- searchmysite.net is an open source search engine and search as a service☆76Updated last week
- Grabbing everything from reddit.☆59Updated 8 months ago
- A privacy focused, self-hosted podcatcher.☆35Updated 11 months ago
- Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.☆99Updated 2 months ago
- Archive all your favorite podcasts☆118Updated this week
- ☆78Updated 2 years ago
- script that generates an rss feed out of websites that don't have one☆29Updated 5 years ago
- Lipupini is a public domain platform for organizing computer files like images, videos, sounds and writings that you might want to displa…☆18Updated 5 months ago
- Personal WayBack Machine☆122Updated 4 years ago
- Clean a series of links, resolving redirects and finding Wayback results if page is gone. Originally written to aid with importing from A…☆15Updated last month
- Command line tool written in Go for sorting and categorizing personal files like screenshots, recordings, logs and more.☆19Updated 2 years ago
- WIP - scripts for analyzing the (in)security of Chrome extensions☆25Updated 7 months ago
- 🦛 scrapes websites and generates rss feeds☆52Updated 9 months ago
- Convert (E-Mail) messages into RSS feeds☆58Updated 5 years ago