Own-Data-Privateer / hoardy-webLinks
Passively capture, archive, and hoard your web browsing history, including the contents of the pages you visit, for later offline viewing, replay, mirroring, data scraping, and/or indexing. Your own personal private Wayback Machine that can also archive HTTP POST requests and responses, as well as most other HTTP-level data.
β95Updated 2 months ago
Alternatives and similar repositories for hoardy-web
Users that are interested in hoardy-web are comparing it to the libraries listed below
Sorting:
- backup and parse your browser history databases (chrome, firefox, safari, and other chrome/firefox derivatives)β145Updated 3 weeks ago
- π¨ High-fidelity, browser-based, single-page web archiving library and CLI for witnessing the web.β174Updated last month
- π§© Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser enβ¦β19Updated 2 months ago
- Self-hostable link databaseβ120Updated this week
- DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by Archβ¦β19Updated last year
- A self-hosted bookmark database with full-text page content searchβ96Updated 4 months ago
- Chrome Extension for Hacker News and Reddit Linksβ43Updated 2 years ago
- Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.β358Updated 5 months ago
- β¬οΈ A simple all-in-one CLI tool to download EVERYTHING from a URL (like youtube-dl/yt-dlp, forum-dl, gallery-dl, simpler ArchiveBox). π β¦β84Updated last month
- Web page archive toolβ27Updated 2 weeks ago
- Gets your upvoted posts from Hacker News and imports them to raindrop.ioβ26Updated 2 years ago
- Full text search all your browsing history using Postgres + WASMβ138Updated 5 months ago
- Creates a complete full text historical archive for an RSS or ATOM feed.β124Updated 2 months ago
- Export userdata from your reddit accounts. Submissions, comments, saved, upvoted contents are supported.β22Updated 11 months ago
- Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more β¦β336Updated this week
- A list of things related to software, literature, and other content for π£ Mementoβ99Updated last year
- A set of scripts that connect various apps to Raindrop.ioβ17Updated 5 months ago
- Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each pageβ¦β40Updated last year
- Host-free RSS reader in your browser.β19Updated 2 months ago
- Web extension for Firefox and Chrome that shows a popup with a list of your Omnivore articles to quickly open or archive (similar to the β¦β72Updated 10 months ago
- A library/CLI tool to parse data out of your Google Takeout (History, Activity, Youtube, Locations, etc...)β111Updated 3 weeks ago
- Tool to index and serve HTML files. Powered by Datasette.β107Updated 3 years ago
- Clean a series of links, resolving redirects and finding Wayback results if page is gone. Originally written to aid with importing from Aβ¦β18Updated last year
- A userscript to click "show more" links to expand all the text on a page, without slowing things down too muchβ98Updated 5 months ago
- A platform for building and distributing JS bookmarklets created from GitHub gistsβ68Updated 6 months ago
- π A CLI toolkit for extracting and working with your digital historyβ175Updated last year
- Quickly generate an RSS feed from any websiteβ66Updated last year
- Chrome extension that adds to your browsing experience by showing you relevant discussions about your current web page from Hacker News aβ¦β99Updated 3 years ago
- Export your Github activity: events, repositories, stars, etc.β52Updated 2 months ago
- A modern, keyboard-driven RSS feed reader that brings back the magic of Google Reader with AI! πβ20Updated 9 months ago