Own-Data-Privateer / hoardy-webLinks
Passively capture, archive, and hoard your web browsing history, including the contents of the pages you visit, for later offline viewing, replay, mirroring, data scraping, and/or indexing. Your own personal private Wayback Machine that can also archive HTTP POST requests and responses, as well as most other HTTP-level data.
β88Updated last month
Alternatives and similar repositories for hoardy-web
Users that are interested in hoardy-web are comparing it to the libraries listed below
Sorting:
- backup and parse your browser history databases (chrome, firefox, safari, and other chrome/firefox derivatives)β143Updated 9 months ago
- π¨ High-fidelity, browser-based, single-page web archiving library and CLI for witnessing the web.β169Updated 2 weeks ago
- DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by Archβ¦β19Updated last year
- π§© Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser enβ¦β19Updated last month
- A self-hosted bookmark database with full-text page content searchβ94Updated 3 months ago
- A library/CLI tool to parse data out of your Google Takeout (History, Activity, Youtube, Locations, etc...)β107Updated 3 weeks ago
- Self-hostable link databaseβ106Updated this week
- β¬οΈ A simple all-in-one CLI tool to download EVERYTHING from a URL (like youtube-dl/yt-dlp, forum-dl, gallery-dl, simpler ArchiveBox). π β¦β80Updated this week
- A platform for building and distributing JS bookmarklets created from GitHub gistsβ70Updated 4 months ago
- Export userdata from your reddit accounts. Submissions, comments, saved, upvoted contents are supported.β22Updated 9 months ago
- A set of scripts that connect various apps to Raindrop.ioβ17Updated 4 months ago
- Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more β¦β314Updated this week
- Clean a series of links, resolving redirects and finding Wayback results if page is gone. Originally written to aid with importing from Aβ¦β18Updated 10 months ago
- Tool to index and serve HTML files. Powered by Datasette.β104Updated 3 years ago
- Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each pageβ¦β40Updated 11 months ago
- π An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.β58Updated last year
- A modern, keyboard-driven RSS feed reader that brings back the magic of Google Reader with AI! πβ19Updated 7 months ago
- Full text search all your browsing history using Postgres + WASMβ134Updated 4 months ago
- Chrome Extension for Hacker News and Reddit Linksβ37Updated 2 years ago
- Web page archive toolβ26Updated 7 months ago
- Human Programming Interface - a way to unify, access and interact with all of my personal data [my modules]β84Updated 5 months ago
- Gets your upvoted posts from Hacker News and imports them to raindrop.ioβ25Updated 2 years ago
- Synchronize your Mastodon bookmarks to bookmarking services.β13Updated last week
- Creates a complete full text historical archive for an RSS or ATOM feed.β123Updated 3 weeks ago
- π A CLI toolkit for extracting and working with your digital historyβ171Updated last year
- Where knowledge grows.β20Updated 9 months ago
- A list of things related to software, literature, and other content for π£ Mementoβ99Updated last year
- BookFusion Calibre Pluginβ20Updated 2 years ago
- A program to generate epub file from articles saved in your Omnivore library and optionally send it to your eReader using emailβ107Updated 9 months ago
- Add all of your starred repos to raindrop.ioβ16Updated 4 years ago