webrecorder / browsertrixLinks
Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more accessible for all!
β352Updated this week
Alternatives and similar repositories for browsertrix
Users that are interested in browsertrix are comparing it to the libraries listed below
Sorting:
- Run a high-fidelity browser-based web archiving crawler in a single Docker containerβ915Updated this week
- π¨ High-fidelity, browser-based, single-page web archiving library and CLI for witnessing the web.β179Updated 2 months ago
- wabac.js - Web Archive Browsing Augmentation Clientβ114Updated this week
- Specifications developed and maintained by the Webrecorder community.β136Updated last month
- Serverless replay of web archives directly in the browserβ856Updated 2 weeks ago
- Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.β376Updated 6 months ago
- π A compilation of research relevant to Data Together's efforts tackling the general problem of data resilience & interactivityβ98Updated 7 years ago
- A list of things related to software, literature, and other content for π£ Mementoβ102Updated last year
- Converts WARC files to static HTMLβ49Updated 2 months ago
- Web Archiving Integration Layer: One-Click User Instigated Preservationβ381Updated 8 months ago
- A search interface and wayback machine for the UKWA Solr based warc-indexer framework.β131Updated last week
- Passively capture, archive, and hoard your web browsing history, including the contents of the pages you visit, for later offline viewingβ¦β102Updated last month
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.β51Updated last week
- (Experimental) High-fidelity capture of Twitter threads as sealed PDFs.β55Updated last year
- A tool for detecting viruses and NSFW material in WARC filesβ17Updated last year
- Centralised repository for WARC usage specifications.β118Updated last month
- Indelible linksβ489Updated last week
- β54Updated last year
- Command line tool to convert a file in the WARC format to a file in the ZIM formatβ72Updated 7 months ago
- A Tool To Push Web Resources Into Web Archivesβ423Updated last year
- Web archive index server based on RocksDBβ36Updated 2 weeks ago
- Convert Directories, Files and ZIP Files to Web Archives (WARC)β89Updated 6 months ago
- Command line tools and libraries for handling and manipulating WARC files (and HTTP contents)β167Updated 2 months ago
- Please note that the warc-indexer tool & code is now supported by NetArchiveSuite. The 'warc-indexer' directory and code that exists in tβ¦β130Updated 3 months ago
- A Memento Aggregator CLI and Server in Goβ70Updated 8 months ago
- Python 3 tools for downloading and preserving wikisβ122Updated 4 months ago
- MediaWiki scraper: all your wiki articles in one highly compressed ZIM fileβ407Updated this week
- Command line tool for digging into WARC filesβ47Updated this week
- Experimental proxy and wrapper for safely embedding Web Archives (warc, warc.gz, wacz) into web pages.β38Updated 6 months ago
- The OpenWayback Developmentβ506Updated last year