webrecorder / browsertrixLinks
Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more accessible for all!
β298Updated this week
Alternatives and similar repositories for browsertrix
Users that are interested in browsertrix are comparing it to the libraries listed below
Sorting:
- π¨ High-fidelity, browser-based, single-page web archiving library and CLI for witnessing the web.β162Updated last month
- Run a high-fidelity browser-based web archiving crawler in a single Docker containerβ824Updated last week
- Specifications developed and maintained by the Webrecorder community.β132Updated 6 months ago
- Serverless replay of web archives directly in the browserβ815Updated 3 weeks ago
- wabac.js - Web Archive Browsing Augmentation Clientβ109Updated 2 weeks ago
- Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.β323Updated 2 months ago
- Core Python Web Archiving Toolkit for replay and recording of web archivesβ1,527Updated 2 months ago
- Web Archiving Integration Layer: One-Click User Instigated Preservationβ376Updated 4 months ago
- A Tool To Push Web Resources Into Web Archivesβ421Updated last year
- A search interface and wayback machine for the UKWA Solr based warc-indexer framework.β117Updated this week
- Experimental proxy and wrapper for safely embedding Web Archives (warc, warc.gz, wacz) into web pages.β34Updated 2 months ago
- π A compilation of research relevant to Data Together's efforts tackling the general problem of data resilience & interactivityβ95Updated 6 years ago
- Indelible linksβ470Updated 3 weeks ago
- (Experimental) High-fidelity capture of Twitter threads as sealed PDFs.β54Updated last year
- Converts WARC files to static HTMLβ46Updated last year
- A tool for detecting viruses and NSFW material in WARC filesβ15Updated 11 months ago
- β51Updated last year
- A list of things related to software, literature, and other content for π£ Memento