webrecorder / browsertrixLinks
Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more accessible for all!
β371Updated this week
Alternatives and similar repositories for browsertrix
Users that are interested in browsertrix are comparing it to the libraries listed below
Sorting:
- Run a high-fidelity browser-based web archiving crawler in a single Docker containerβ935Updated last week
- π¨ High-fidelity, browser-based, single-page web archiving library and CLI for witnessing the web.β184Updated 3 months ago
- Serverless replay of web archives directly in the browserβ880Updated 3 weeks ago
- Specifications developed and maintained by the Webrecorder community.β138Updated 2 months ago
- Core Python Web Archiving Toolkit for replay and recording of web archivesβ1,593Updated last month
- wabac.js - Web Archive Browsing Augmentation Clientβ117Updated 3 weeks ago
- Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.β393Updated 7 months ago
- π A compilation of research relevant to Data Together's efforts tackling the general problem of data resilience & interactivityβ97Updated 7 years ago
- Passively capture, archive, and hoard your web browsing history, including the contents of the pages you visit, for later offline viewingβ¦β105Updated 2 months ago
- Converts WARC files to static HTMLβ49Updated 3 months ago
- Indelible linksβ490Updated last week
- A list of things related to software, literature, and other content for π£ Mementoβ102Updated last year
- Offline Internet Archive projectβ308Updated last year
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.β54Updated last month
- A Memento Aggregator CLI and Server in Goβ72Updated 9 months ago
- Chrome extension to "Create WARC files from any webpage"β226Updated 3 weeks ago
- brozzler - distributed browser-based web crawlerβ765Updated this week
- Convert Directories, Files and ZIP Files to Web Archives (WARC)β90Updated 8 months ago
- A search interface and wayback machine for the UKWA Solr based warc-indexer framework.β132Updated last week
- The OpenWayback Developmentβ507Updated last year
- A Tool To Push Web Resources Into Web Archivesβ425Updated last year
- Web Archiving Integration Layer: One-Click User Instigated Preservationβ385Updated 9 months ago
- Experimental proxy and wrapper for safely embedding Web Archives (warc, warc.gz, wacz) into web pages.β39Updated last month
- Please note that the warc-indexer tool & code is now supported by NetArchiveSuite. The 'warc-indexer' directory and code that exists in tβ¦β131Updated last month
- Command line tools and libraries for handling and manipulating WARC files (and HTTP contents)β168Updated 4 months ago
- Command line tool to convert a file in the WARC format to a file in the ZIM formatβ75Updated last week
- Webrecorder Player for Desktop (OSX/Windows/Linux). (Built with Electron + Webrecorder)β445Updated 5 years ago
- Command line tool for digging into WARC filesβ49Updated last week
- (Experimental) High-fidelity capture of Twitter threads as sealed PDFs.β55Updated 2 years ago
- Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a headβ172Updated 5 years ago