webrecorder / browsertrixLinks
Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more accessible for all!
β277Updated this week
Alternatives and similar repositories for browsertrix
Users that are interested in browsertrix are comparing it to the libraries listed below
Sorting:
- π¨ High-fidelity, browser-based, single-page web archiving library and CLI for witnessing the web.β160Updated last month
- wabac.js - Web Archive Browsing Augmentation Clientβ108Updated last week
- Specifications developed and maintained by the Webrecorder community.β131Updated 4 months ago
- π A compilation of research relevant to Data Together's efforts tackling the general problem of data resilience & interactivityβ95Updated 6 years ago
- brozzler - distributed browser-based web crawlerβ713Updated this week
- Converts WARC files to static HTMLβ44Updated 11 months ago
- Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.β318Updated last month
- Command line tools and libraries for handling and manipulating WARC files (and HTTP contents)β160Updated last week
- Experimental proxy and wrapper for safely embedding Web Archives (warc, warc.gz, wacz) into web pages.β32Updated 3 weeks ago
- A search interface and wayback machine for the UKWA Solr based warc-indexer framework.β113Updated this week
- Webrecorder Player for Desktop (OSX/Windows/Linux). (Built with Electron + Webrecorder)β446Updated 4 years ago
- Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a headβ170Updated 5 years ago
- (Experimental) High-fidelity capture of Twitter threads as sealed PDFs.β54Updated last year
- Web Archiving Integration Layer: One-Click User Instigated Preservationβ373Updated 2 months ago
- A list of things related to software, literature, and other content for π£ Mementoβ98Updated last year
- A simple Python wrapper and command-line interface for archive.orgβs "Save Page Now" capturing serviceβ179Updated 7 months ago
- Make a ZIM file from any Web site and surf offline!β527Updated last month
- Tool and library for handling Web ARChive (WARC) files.β159Updated 7 months ago
- Convert Directories, Files and ZIP Files to Web Archives (WARC)β85Updated last month
- β45Updated last year
- Centralised repository for WARC usage specifications.β111Updated 6 months ago
- Static Site Generator for Viewing Web Archives (in WACZ) formatβ27Updated last year
- Indelible linksβ466Updated last week
- Command line tool to convert a file in the WARC format to a file in the ZIM formatβ58Updated 2 months ago
- WARC and ARC indexing and discovery tools.β124Updated 2 months ago
- Command line tool for digging into WARC filesβ40Updated 3 weeks ago
- The OpenWayback Developmentβ498Updated last year
- NPM package and CLI tool for saving web page as single HTML fileβ48Updated last week
- A Memento Aggregator CLI and Server in Goβ65Updated 3 months ago
- Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.β123Updated 5 months ago