Own-Data-Privateer / hoardy-webLinks
Passively capture, archive, and hoard your web browsing history, including the contents of the pages you visit, for later offline viewing, replay, mirroring, data scraping, and/or indexing. Your own personal private Wayback Machine that can also archive HTTP POST requests and responses, as well as most other HTTP-level data.
☆106Updated 2 months ago
Alternatives and similar repositories for hoardy-web
Users that are interested in hoardy-web are comparing it to the libraries listed below
Sorting:
- backup and parse your browser history databases (chrome, firefox, safari, and other chrome/firefox derivatives)☆179Updated 2 weeks ago
- Self-hostable link database and RSS reader☆137Updated this week
- DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by Arch…☆19Updated last year
- 🍨 High-fidelity, browser-based, single-page web archiving library and CLI for witnessing the web.☆185Updated 4 months ago
- Tool to index and serve HTML files. Powered by Datasette.☆111Updated 3 years ago
- Creates a complete full text historical archive for an RSS or ATOM feed.☆130Updated last week
- 🧩 Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser en…☆19Updated 6 months ago
- ⬇️ A simple all-in-one CLI tool to download EVERYTHING from a URL (like youtube-dl/yt-dlp, forum-dl, gallery-dl, simpler ArchiveBox). 🎭 …☆89Updated 2 weeks ago
- Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more …☆380Updated this week
- Chrome Extension for Hacker News and Reddit Links☆45Updated 2 years ago
- A self-hosted bookmark database with full-text page content search☆96Updated 7 months ago
- A Library Genesis and OPDS "bridge". You can search and download books on LibGen via KOReaders OPDS Search. Most read lists from GoodRea…☆39Updated last year
- A platform for building and distributing JS bookmarklets created from GitHub gists☆72Updated 9 months ago
- Command line tool to convert a file in the WARC format to a file in the ZIM format☆75Updated last month
- Export userdata from your reddit accounts. Submissions, comments, saved, upvoted contents are supported.☆23Updated last year
- Bookmarked archived links☆25Updated this week
- A set of scripts that connect various apps to Raindrop.io☆17Updated 8 months ago
- A library/CLI tool to parse data out of your Google Takeout (History, Activity, Youtube, Locations, etc...)☆120Updated last week
- Full text search all your browsing history using Postgres + WASM☆141Updated 8 months ago
- Clean a series of links, resolving redirects and finding Wayback results if page is gone. Originally written to aid with importing from A…☆18Updated last month
- Chrome extension that adds to your browsing experience by showing you relevant discussions about your current web page from Hacker News a…☆96Updated 3 years ago
- Web extension for Firefox and Chrome that shows a popup with a list of your Omnivore articles to quickly open or archive (similar to the …☆68Updated last year
- A list of things related to software, literature, and other content for 🕣 Memento☆104Updated last week
- Host-free RSS reader in your browser.☆19Updated 5 months ago
- Gets your upvoted posts from Hacker News and imports them to raindrop.io☆26Updated 2 years ago
- A userscript to click "show more" links to expand all the text on a page, without slowing things down too much☆102Updated 8 months ago
- Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page…☆42Updated last year
- Scrape posts, threads from forums, news aggregators, mail archives, export to JSONL, mailbox, WARC☆107Updated last year
- Export your Github activity: events, repositories, stars, etc.☆55Updated 5 months ago
- Securely collect browsing history over browsers.☆97Updated 4 years ago