webrecorder / browsertrixLinks
Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more accessible for all!
β314Updated this week
Alternatives and similar repositories for browsertrix
Users that are interested in browsertrix are comparing it to the libraries listed below
Sorting:
- π¨ High-fidelity, browser-based, single-page web archiving library and CLI for witnessing the web.β169Updated 2 weeks ago
- wabac.js - Web Archive Browsing Augmentation Clientβ114Updated last week
- Specifications developed and maintained by the Webrecorder community.β136Updated 7 months ago
- Official ArchiveBox browser extension: automatically/manually preserve your browsing history using ArchiveBox.β348Updated 3 months ago
- Converts WARC files to static HTMLβ47Updated last year
- Chrome extension to "Create WARC files from any webpage"β222Updated last year
- A Tool To Push Web Resources Into Web Archivesβ422Updated last year
- β52Updated last year
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.β47Updated last week
- A list of things related to software, literature, and other content for π£ Mementoβ99Updated last year
- Web archive index server based on RocksDBβ34Updated last month
- Web Archiving Integration Layer: One-Click User Instigated Preservationβ377Updated 5 months ago
- Experimental proxy and wrapper for safely embedding Web Archives (warc, warc.gz, wacz) into web pages.β36Updated 3 months ago
- A search interface and wayback machine for the UKWA Solr based warc-indexer framework.β131Updated last month
- π A compilation of research relevant to Data Together's efforts tackling the general problem of data resilience & interactivityβ96Updated 6 years ago
- Passively capture, archive, and hoard your web browsing history, including the contents of the pages you visit, for later offline viewingβ¦β88Updated last month
- Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.β127Updated 7 months ago
- Python 3 tools for downloading and preserving wikisβ118Updated last month
- Webrecorder Player for Desktop (OSX/Windows/Linux). (Built with Electron + Webrecorder)β448Updated 4 years ago
- π A Docker Compose bundle to run on servers with spare CPU, RAM, disk, and bandwidth to help the world. Includes Tor, ArchiveWarrior, Bβ¦β375Updated 3 months ago
- A Memento Aggregator CLI and Server in Goβ68Updated 5 months ago
- A tool for detecting viruses and NSFW material in WARC filesβ15Updated last year
- Make a ZIM file from any Web site and surf offline!β586Updated 2 months ago
- Centralised repository for WARC usage specifications.β115Updated 9 months ago
- Command line tools and libraries for handling and manipulating WARC files (and HTTP contents)β162Updated last week
- Indelible linksβ476Updated this week
- Command line tool to convert a file in the WARC format to a file in the ZIM formatβ65Updated 5 months ago
- Command line tool for digging into WARC filesβ45Updated this week
- (Experimental) High-fidelity capture of Twitter threads as sealed PDFs.β54Updated last year
- A simple Python wrapper and command-line interface for archive.orgβs "Save Page Now" capturing serviceβ182Updated 10 months ago