pirate / wikipedia-mirror
π Guide and tools to run a full offline mirror of Wikipedia.org with three different approaches: Nginx caching proxy, Kiwix + ZIM dump, and MediaWiki/XOWA + XML dump
β359Updated 3 years ago
Related projects β
Alternatives and complementary repositories for wikipedia-mirror
- Make a ZIM file from any Web site and surf offline!β359Updated this week
- A Dockerfile for the ArchiveTeam Warriorβ306Updated this week
- π A Docker Compose bundle to run on servers with spare CPU, RAM, disk, and bandwidth to help the world. Includes Tor, ArchiveWarrior, Bβ¦β320Updated 6 months ago
- Command line Kiwix tools: kiwix-serve, kiwix-manage, ...β468Updated last month
- ArchiveBot, an IRC bot for archiving websitesβ359Updated last month
- Scripts to build and boot warrior virtual machine containing Dockerβ114Updated 11 months ago
- Mediawiki scraper: all your wiki articles in one highly compressed ZIM fileβ292Updated this week
- Offline Internet Archive projectβ273Updated 9 months ago
- The OpenWayback Developmentβ486Updated 10 months ago
- Library of Alexandria (LoA in short) is a project that aims to collect and archive documents from the internet.β112Updated 4 months ago
- Various ZIM command line toolsβ133Updated 2 weeks ago
- We back up a lot of stuff from around the web; now it's time to back up the Internet Archive, just in case.β87Updated 4 years ago
- Web Archiving Integration Layer: One-Click User Instigated Preservationβ350Updated last month
- Lightning-fast file system indexer and search toolβ887Updated 2 months ago
- Create a ZIM file from a Youtube channel/username/playlistβ52Updated 2 weeks ago
- Wget-compatible web downloader and crawler.β557Updated 6 months ago
- Webrecorder Player for Desktop (OSX/Windows/Linux). (Built with Electron + Webrecorder)β438Updated 4 years ago
- The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patternsβ1,402Updated 4 months ago
- A Tool To Push Web Resources Into Web Archivesβ410Updated 9 months ago
- Downloads and hosts posts from a subreddit/subreddits of your choiceβ55Updated 6 years ago
- Tools for downloading and preserving wikis. We archive wikis, from Wikipedia to tiniest wikis. As of 2024, WikiTeam has preserved more thβ¦β729Updated 2 months ago
- Chrome extension to "Create WARC files from any webpage"β211Updated 11 months ago
- A script to check `download.kiwix.org` for updates to your local ZIM library.β83Updated 2 months ago
- Create a searx instance using Dockerβ406Updated 2 years ago
- InterPlanetary Wayback: A distributed and persistent archive replay system using IPFSβ617Updated last week
- An Awesome List for getting started with web archivingβ2,057Updated 2 weeks ago
- USENET-inspired, uncensorable, decentralized internet discussion system running on IPFS & OrbitDBβ720Updated 3 months ago
- Core Python Web Archiving Toolkit for replay and recording of web archivesβ1,410Updated last week
- Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more β¦β201Updated this week
- Tool and library for handling Web ARChive (WARC) files.β150Updated last month