webrecorder / archiveweb.page-site
The ArchiveWeb.page Site
☆31Updated 4 months ago
Alternatives and similar repositories for archiveweb.page-site:
Users that are interested in archiveweb.page-site are comparing it to the libraries listed below
- Docker Compose based system for running remote browsers (including Flash and Java support) connected to web archives☆15Updated 3 years ago
- Encode/decode binary data over a live streaming video in real time.☆13Updated last year
- DigestBox takes any webpage URL (news article, video link, comment thread, etc.) and gives you just the raw content. It's powered by Arch…☆19Updated last year
- Javascript/Node wrapper around Mozilla's Readability library so that ArchiveBox can call it as a oneshot CLI command to extract each page…☆40Updated 7 months ago
- Homebrew formula for the ArchiveBox self-hosted internet archiving solution.☆28Updated 6 months ago
- Webrecorder Automated In-Page Behavior Framework☆13Updated 4 years ago
- Official Python package for ArchiveBox, the self-hosted internet archiving solution.☆13Updated 6 months ago
- Clean a series of links, resolving redirects and finding Wayback results if page is gone. Originally written to aid with importing from A…☆18Updated 6 months ago
- Passively capture, archive, and hoard your web browsing history, including the contents of the pages you visit, for later offline viewing…☆76Updated 3 weeks ago
- 🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.☆15Updated 4 years ago
- 🧩 Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser en…☆17Updated last month
- ArchiveBoxMatic: configure ArchiveBox with the simplicity of a yaml file.☆14Updated 4 years ago
- Home of the official apt/deb package for Ubuntu/Debian-based systems.☆17Updated 6 months ago
- Archiving public telegram messages.☆12Updated 3 months ago
- Tool to index and serve HTML files. Powered by Datasette.☆98Updated 3 years ago
- rsstodolist Firefox and Chrome addon (using Web Extension API)☆13Updated 2 years ago
- Command line tool to convert a file in the WARC format to a file in the ZIM format☆56Updated last month
- Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.☆120Updated 3 months ago
- Proxies third-party PDF files and HTML pages with the Hypothesis client embedded, so you can annotate them☆23Updated 2 weeks ago
- Python script to extract news from RSS feeds and save it as json.☆18Updated 2 years ago
- 🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.☆53Updated 8 months ago
- A dockerized, queued high fidelity web archiver based on Squidwarc☆58Updated 9 months ago
- rsstodolist is an URL oriented to-read-list based on an RSS XML feed☆16Updated this week
- A prototype server to swarm multiple DATs for Webrecorder☆14Updated 5 years ago
- Whole Feedbin stack in a container.☆28Updated last week
- Export your Github activity: events, repositories, stars, etc.☆53Updated last year
- Convert HTTP Archive (HAR) -> Web Archive (WARC) format☆51Updated 6 years ago
- Share files to the decentralized, unblockable torrent swarm and share links using Handshake domains. Decentralized internet is here.☆34Updated 3 years ago
- DNS Stamps library for Go☆18Updated last year
- Modified version of the original yarr(yet another RSS reader)☆9Updated last year