An easy-to-use and highly customizable crawler that enables you to create your own little Web archives (WARC/CDX)
☆25Oct 9, 2017Updated 8 years ago
Alternatives and similar repositories for Web2Warc
Users that are interested in Web2Warc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A prototype server to swarm multiple DATs for Webrecorder☆14Apr 27, 2019Updated 6 years ago
- An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed…☆158Oct 8, 2025Updated 6 months ago
- A dockerized, queued high fidelity web archiver based on Squidwarc☆62Jul 9, 2024Updated last year
- ☆14Feb 28, 2017Updated 9 years ago
- Web Archives for Historical Research☆13Jun 12, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A client for the Archive-It And Webrecorder WASAPI Data Transfer API☆16Oct 18, 2019Updated 6 years ago
- ☆27Oct 14, 2022Updated 3 years ago
- WASAPI data transfer APIs☆50Apr 23, 2022Updated 3 years ago
- A Memento Aggregator CLI and Server in Go☆78Mar 4, 2025Updated last year
- A service that provides archive-aware oEmbed-compatible embeddable surrogates (social cards, thumbnails, etc.) for archived web pages (me…☆14Nov 15, 2021Updated 4 years ago
- A golang library to work with WARC files from the common crawl☆15Feb 20, 2018Updated 8 years ago
- Parallelized web crawler written in Golang☆15Oct 2, 2018Updated 7 years ago
- ☆17Mar 31, 2025Updated last year
- Golang WARC (Web ARChive) Library