Command line tool to convert a file in the WARC format to a file in the ZIM format
☆79Mar 16, 2026Updated last week
Alternatives and similar repositories for warc2zim
Users that are interested in warc2zim are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Nov 21, 2025Updated 4 months ago
- Make a ZIM file from any Web site and surf offline!☆742Mar 17, 2026Updated last week
- Turns a collection of documents into a browsable ZIM file☆27Dec 18, 2025Updated 3 months ago
- ArchiveWeb.page Express!☆14Nov 1, 2024Updated last year
- Web archive index server based on RocksDB☆38Mar 2, 2026Updated 3 weeks ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Command line tool for digging into WARC files☆51Feb 27, 2026Updated last month
- Convert Directories, Files and ZIP Files to Web Archives (WARC)☆97Apr 22, 2025Updated 11 months ago
- ☆10Dec 3, 2025Updated 3 months ago
- Various ZIM command line tools☆196Feb 27, 2026Updated last month
- Docker Compose based system for running remote browsers (including Flash and Java support) connected to web archives☆16Jun 10, 2021Updated 4 years ago
- 🧩 Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser en…☆19Jul 11, 2025Updated 8 months ago
- Run a high-fidelity browser-based web archiving crawler in a single Docker container☆1,002Updated this week
- ☆11Jul 20, 2023Updated 2 years ago
- Collection of Python code to re-use across Python-based scrapers☆25Mar 20, 2026Updated last week
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.☆56Feb 10, 2026Updated last month
- A web archives reader☆116Feb 12, 2026Updated last month
- Mounts WARC files on Windows☆16Apr 20, 2019Updated 6 years ago
- CDXJ Indexing of WARC/ARCs☆33Dec 10, 2024Updated last year
- JS Streaming WARC IO optimized for Browser and Node☆53Mar 20, 2026Updated last week
- Farm operated by bots to grow and harvest new zim files☆189Updated this week
- Reference implementation of the ZIM specification☆228Updated this week
- Web archiving using Google Chrome☆46Dec 30, 2019Updated 6 years ago
- A set of Docker images for streaming a remote desktop video and audio☆27May 15, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆17Oct 2, 2025Updated 5 months ago
- Dataset for the ACL 2015 paper : Learning to Explain Entity Relationships in Knowledge Graphs☆11Oct 22, 2015Updated 10 years ago
- ☆12Jan 18, 2016Updated 10 years ago
- A command line utility for listing and searching snapshots in web archives☆17Dec 21, 2023Updated 2 years ago
- MS Marco Entity Annotations Disambiguation☆13May 19, 2023Updated 2 years ago
- Backend, IA-specific tools for crawling and processing the scholarly web. Content ends up in https://fatcat.wiki☆28Jul 31, 2024Updated last year
- An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed…☆158Oct 8, 2025Updated 5 months ago
- 📌 replaces mutable tags or branch names by commit shas in your GitHub actions☆14Jun 18, 2024Updated last year
- ☆60Apr 11, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- An easy-to-use and highly customizable crawler that enables you to create your own little Web archives (WARC/CDX)☆25Oct 9, 2017Updated 8 years ago
- Simple CertificateAuthority and host certificate creation, useful for man-in-the-middle HTTPS proxy☆25Sep 29, 2022Updated 3 years ago
- WASAPI data transfer APIs☆49Apr 23, 2022Updated 3 years ago
- Nondestructive warc-in-tar to warc conversion☆27Apr 21, 2013Updated 12 years ago
- Specifications developed and maintained by the Webrecorder community.☆140Oct 16, 2025Updated 5 months ago
- Download, parse, and filter data from Literotica. Data-ready for The-Pile.☆11Sep 18, 2020Updated 5 years ago
- Ad-hoc light weight SPARQL endpoint from a file, using Python Flask and RDFlib☆15Oct 24, 2016Updated 9 years ago