openzim/warc2zim

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/openzim/warc2zim)

openzim / warc2zim

Command line tool to convert a file in the WARC format to a file in the ZIM format

☆87

Alternatives and similar repositories for warc2zim

Users that are interested in warc2zim are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ukwa / ukwa-pywb
View on GitHub
☆11Nov 21, 2025Updated 8 months ago
openzim / zimit
View on GitHub
Make a ZIM file from any Web site and surf offline!
☆815Updated this week
openzim / nautilus
View on GitHub
Turns a collection of documents into a browsable ZIM file
☆30Jun 15, 2026Updated last month
nla / outbackcdx
View on GitHub
Web archive index server based on RocksDB
☆43Jul 9, 2026Updated 2 weeks ago
openzim / zimit-frontend
View on GitHub
Zimit Public Web UI
☆25Jul 6, 2026Updated 2 weeks ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
harvard-lil / waczerciser
View on GitHub
Create and edit WARC and WACZ files
☆29Dec 6, 2024Updated last year
webrecorder / warcit
View on GitHub
Convert Directories, Files and ZIP Files to Web Archives (WARC)
☆100Apr 22, 2025Updated last year
openzim / zimfarm
View on GitHub
Farm operated by bots to grow and harvest new zim files
☆196Updated this week
webrecorder / pywb-remote-browsers
View on GitHub
Docker Compose based system for running remote browsers (including Flash and Java support) connected to web archives
☆16Jun 10, 2021Updated 5 years ago
openzim / zim-tools
View on GitHub
Various ZIM command line tools
☆211Jul 17, 2026Updated last week
renevoorburg / robustify.js
View on GitHub
A javascript for fighting link rot and content drift using link decoration and web archives.
☆17Oct 31, 2024Updated last year
webrecorder / browsertrix-crawler
View on GitHub
Run a high-fidelity browser-based web archiving crawler in a single Docker container
☆1,092Updated this week
webrecorder / browsertrix-behaviors
View on GitHub
Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.
☆58Updated this week
kiwix / container-images
View on GitHub
☆28Updated this week
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
Rhizome-Conifer / conifer-deploy
View on GitHub
Conifer setup and deployment via Ansible
☆12Jun 15, 2020Updated 6 years ago
ArchiveBox / abx-spec-behaviors
View on GitHub
🧩 Proposal to allow user scripts like "expand comments", "hide popups", "fill out this form", etc. to be reusable across pure browser en…
☆20Jul 11, 2025Updated last year
openzim / python-scraperlib
View on GitHub
Collection of Python code to re-use across Python-based scrapers
☆30Updated this week
reprozip-news-apps / reprozip-web
View on GitHub
ReproZip for the Preservation of Web Applications
☆17May 6, 2024Updated 2 years ago
antiufo / Shaman.Dokan.Warc
View on GitHub
Mounts WARC files on Windows
☆16Apr 20, 2019Updated 7 years ago
webrecorder / cdxj-indexer
View on GitHub
CDXJ Indexing of WARC/ARCs
☆35May 11, 2026Updated 2 months ago
webrecorder / warcio.js
View on GitHub
JS Streaming WARC IO optimized for Browser and Node
☆55Updated this week
birros / web-archives
View on GitHub
A web archives reader
☆119Feb 12, 2026Updated 5 months ago
openzim / libzim
View on GitHub
Reference implementation of the ZIM specification
☆252Jul 11, 2026Updated 2 weeks ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
webis-de / wasp
View on GitHub
☆28Jun 30, 2026Updated 3 weeks ago
edsu / memento-cli
View on GitHub
A command line utility for listing and searching snapshots in web archives
☆20Jun 4, 2026Updated last month
helgeho / ArchiveSpark
View on GitHub
An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed…
☆161Oct 8, 2025Updated 9 months ago
informagi / mmead
View on GitHub
MS Marco Entity Annotations Disambiguation
☆14May 19, 2023Updated 3 years ago
webrecorder / web-archive-site-mirror
View on GitHub
☆17Apr 16, 2026Updated 3 months ago
maturban / WARCMerge
View on GitHub
Merging WARCs into a single WARC file
☆15Aug 29, 2014Updated 11 years ago
oduwsdl / MemGator
View on GitHub
A Memento Aggregator CLI and Server in Go
☆80Apr 9, 2026Updated 3 months ago
ArchiveBox / debian-archivebox
View on GitHub
Home of the official apt/deb package for Ubuntu/Debian-based systems.
☆17Updated this week
helgeho / Web2Warc
View on GitHub
An easy-to-use and highly customizable crawler that enables you to create your own little Web archives (WARC/CDX)
☆26Oct 9, 2017Updated 8 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
alard / megawarc
View on GitHub
Nondestructive warc-in-tar to warc conversion
☆27Apr 21, 2013Updated 13 years ago
ikreymer / certauth
View on GitHub
Simple CertificateAuthority and host certificate creation, useful for man-in-the-middle HTTPS proxy
☆25Sep 29, 2022Updated 3 years ago
WASAPI-Community / data-transfer-apis
View on GitHub
WASAPI data transfer APIs
☆50Apr 23, 2022Updated 4 years ago
brentr / viless
View on GitHub
Tiny vi text editor clone with enough features to be truly useful
☆16Feb 14, 2024Updated 2 years ago
jasiek / dockerized-kiwix-server
View on GitHub
Your own wikipedia server in a box.
☆23Oct 3, 2018Updated 7 years ago
nareike / adhs
View on GitHub
Ad-hoc light weight SPARQL endpoint from a file, using Python Flask and RDFlib
☆15Oct 24, 2016Updated 9 years ago
anjackson / sliver
View on GitHub
A tool for collection archival slivers of the web and web archives
☆19Jun 1, 2026Updated last month