Own-Data-Privateer / hoardy-webLinks
Passively capture, archive, and hoard your web browsing history, including the contents of the pages you visit, for later offline viewing, replay, mirroring, data scraping, and/or indexing. Your own personal private Wayback Machine that can also archive HTTP POST requests and responses, as well as most other HTTP-level data.
☆90Updated last month
Alternatives and similar repositories for hoardy-web
Users that are interested in hoardy-web are comparing it to the libraries listed below
Sorting:
- backup and parse your browser history databases (chrome, firefox, safari, and other chrome/firefox derivatives)☆143Updated last week
- DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by Arch…☆19Updated last year
- 🧩 Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser en…☆19Updated 2 months ago
- Self-hostable link database☆119Updated this week
- 🍨 High-fidelity, browser-based, single-page web archiving library and CLI for witnessing the web.☆170Updated last week
- A self-hosted bookmark database with full-text page content search☆95Updated 3 months ago
- Creates a complete full text historical archive for an RSS or ATOM feed.☆124Updated last month
- A library/CLI tool to parse data out of your Google Takeout (History, Activity, Youtube, Locations, etc...)☆110Updated last week
- Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page…☆40Updated 11 months ago
- Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more …☆328Updated last week
- A list of things related to software, literature, and other content for 🕣 Memento☆99Updated last year
- Gets your upvoted posts from Hacker News and imports them to raindrop.io☆26Updated 2 years ago
- Export userdata from your reddit accounts. Submissions, comments, saved, upvoted contents are supported.☆22Updated 10 months ago
- ⬇️ A simple all-in-one CLI tool to download EVERYTHING from a URL (like youtube-dl/yt-dlp, forum-dl, gallery-dl, simpler ArchiveBox). 🎭 …☆82Updated 3 weeks ago
- 📜 A CLI toolkit for extracting and working with your digital history☆172Updated last year
- Full text search all your browsing history using Postgres + WASM☆136Updated 4 months ago
- Human Programming Interface - a way to unify, access and interact with all of my personal data [my modules]☆85Updated last week
- Chrome Extension for Hacker News and Reddit Links☆37Updated 2 years ago
- Tool to index and serve HTML files. Powered by Datasette.☆105Updated 3 years ago
- A platform for building and distributing JS bookmarklets created from GitHub gists☆71Updated 5 months ago
- Command line tool to convert a file in the WARC format to a file in the ZIM format☆70Updated 5 months ago
- Web page archive tool☆27Updated 8 months ago
- Web extension for Firefox and Chrome that shows a popup with a list of your Omnivore articles to quickly open or archive (similar to the …☆72Updated 9 months ago
- A set of scripts that connect various apps to Raindrop.io☆19Updated 4 months ago
- This is the HeadQuarters of my digital info. HPI library got me inspired and I'm trying to play with the idea on a smaller scale for myse…☆21Updated last year
- Securely collect browsing history over browsers.☆96Updated 4 years ago
- A Collection of Awesome Personal Search Engines and Related Projects☆19Updated 2 years ago
- A userscript to click "show more" links to expand all the text on a page, without slowing things down too much☆98Updated 4 months ago
- A language/format/parser for manual logging of quantified self data, easily editable by humans☆12Updated last year
- Export your Github activity: events, repositories, stars, etc.☆52Updated last month