internetarchive / brozzler
brozzler - distributed browser-based web crawler
☆669Updated this week
Related projects ⓘ
Alternatives and complementary repositories for brozzler
- Core Python Web Archiving Toolkit for replay and recording of web archives☆1,396Updated this week
- WARC writing MITM HTTP/S proxy☆380Updated this week
- The OpenWayback Development☆484Updated 10 months ago
- Wget-compatible web downloader and crawler.☆555Updated 6 months ago
- Run a high-fidelity browser-based web archiving crawler in a single Docker container☆640Updated this week
- Collect and revisit web pages.☆1,482Updated last year
- Squidwarc is a high fidelity, user scriptable, archival crawler that uses Chrome or Chromium with or without a head☆169Updated 4 years ago
- Chrome extension to "Create WARC files from any webpage"