webrecorder / browsertrixLinks
Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more accessible for all!
β336Updated this week
Alternatives and similar repositories for browsertrix
Users that are interested in browsertrix are comparing it to the libraries listed below
Sorting:
- π¨ High-fidelity, browser-based, single-page web archiving library and CLI for witnessing the web.β174Updated last month
- Serverless replay of web archives directly in the browserβ848Updated this week
- Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.β358Updated 5 months ago
- Specifications developed and maintained by the Webrecorder community.β136Updated 8 months ago
- wabac.js - Web Archive Browsing Augmentation Clientβ114Updated this week
- A search interface and wayback machine for the UKWA Solr based warc-indexer framework.β131Updated last week
- Passively capture, archive, and hoard your web browsing history, including the contents of the pages you visit, for later offline viewingβ¦β95Updated 2 months ago
- Core Python Web Archiving Toolkit for replay and recording of web archivesβ1,562Updated this week
- Experimental proxy and wrapper for safely embedding Web Archives (warc, warc.gz, wacz) into web pages.β37Updated 5 months ago
- π A compilation of research relevant to Data Together's efforts tackling the general problem of data resilience & interactivityβ97Updated 7 years ago
- Web archive index server based on RocksDBβ35Updated last month
- A list of things related to software, literature, and other content for π£ Mementoβ99Updated last year
- A tool for detecting viruses and NSFW material in WARC filesβ17Updated last year
- Converts WARC files to static HTMLβ49Updated 2 weeks ago
- β53Updated last year
- brozzler - distributed browser-based web crawlerβ742Updated last week
- Web Archiving Integration Layer: One-Click User Instigated Preservationβ379Updated 6 months ago
- Chrome extension to "Create WARC files from any webpage"β223Updated last year
- Indelible linksβ485Updated last week
- Please note that the warc-indexer tool & code is now supported by NetArchiveSuite. The 'warc-indexer' directory and code that exists in tβ¦β128Updated 2 months ago
- Creates a complete full text historical archive for an RSS or ATOM feed.β124Updated 2 months ago
- Static Site Generator for Viewing Web Archives (in WACZ) formatβ28Updated 2 years ago
- MediaWiki scraper: all your wiki articles in one highly compressed ZIM fileβ396Updated this week
- (Experimental) High-fidelity capture of Twitter threads as sealed PDFs.β54Updated last year
- A Memento Aggregator CLI and Server in Goβ68Updated 7 months ago
- Make a ZIM file from any Web site and surf offline!β614Updated 3 months ago
- Command line tool for digging into WARC filesβ46Updated last week
- Own webarchive serviceβ163Updated 5 months ago
- The OpenWayback Developmentβ505Updated last year
- Command line tools and libraries for handling and manipulating WARC files (and HTTP contents)β165Updated last month