mikwielgus / forum-dlLinks
Scrape posts, threads from forums, news aggregators, mail archives, export to JSONL, mailbox, WARC
☆105Updated last year
Alternatives and similar repositories for forum-dl
Users that are interested in forum-dl are comparing it to the libraries listed below
Sorting:
- Passively capture, archive, and hoard your web browsing history, including the contents of the pages you visit, for later offline viewing…☆105Updated 2 months ago
- Command line tool to convert a file in the WARC format to a file in the ZIM format☆75Updated 3 weeks ago
- Reddit archiver☆186Updated last year
- Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.☆130Updated 4 months ago
- A self-hosted bookmark database with full-text page content search☆96Updated 7 months ago
- Clean a series of links, resolving redirects and finding Wayback results if page is gone. Originally written to aid with importing from A…☆18Updated last month
- backup and parse your browser history databases (chrome, firefox, safari, and other chrome/firefox derivatives)☆177Updated this week
- Grabbing everything from reddit.☆61Updated last year
- Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.☆396Updated 8 months ago
- A list of things related to software, literature, and other content for 🕣 Memento☆103Updated last year
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.☆54Updated last month
- Bash scripts which interact with Internet Archive Wayback Machine's Save Page Now☆136Updated 9 months ago
- Reddit takeout: export your account data as JSON: comments, submissions, upvotes etc. 🦖☆180Updated 5 months ago
- Home of the official docker image for ArchiveBox☆52Updated last year
- 🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.☆15Updated 5 years ago
- The Temboz RSS/Atom feed reader☆84Updated 2 years ago
- Export userdata from your reddit accounts. Submissions, comments, saved, upvoted contents are supported.☆23Updated last year
- 🍨 High-fidelity, browser-based, single-page web archiving library and CLI for witnessing the web.☆186Updated 4 months ago
- 🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.☆58Updated last year
- Self-hostable link database and RSS reader☆135Updated this week
- A server to collect & archive websites that also supports video downloads☆84Updated 2 years ago
- Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page…☆40Updated last year
- Maubot plugin list☆22Updated 2 months ago
- ⬇️ A simple all-in-one CLI tool to download EVERYTHING from a URL (like youtube-dl/yt-dlp, forum-dl, gallery-dl, simpler ArchiveBox). 🎭 …☆89Updated last week
- A library/CLI tool to parse data out of your Google Takeout (History, Activity, Youtube, Locations, etc...)☆119Updated this week
- A modified version of searx (the privacy-respecting metasearch engine) to only search an allowlist of sites, to build functionality simil…☆19Updated 4 years ago
- 🧩 Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser en…☆19Updated 5 months ago
- A Python Reddit API Wrapper (PRAW) script to download all of the accessible wiki pages of a Reddit subreddit☆52Updated last year
- Fetch all your bookmarked tweets and make them accessible through a webinterface.☆29Updated 2 years ago
- Reverse Incremental Rclone Backups☆15Updated 2 years ago