Tools for downloading and preserving wikis. We archive wikis, from Wikipedia to tiniest wikis. As of 2026, WikiTeam has preserved more than 600,000 wikis.
☆826Jan 10, 2026Updated 2 months ago
Alternatives and similar repositories for wikiteam
Users that are interested in wikiteam are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- git svn clone of https://code.google.com/p/wikiteam/☆13Mar 6, 2016Updated 10 years ago
- MediaWiki scraper: all your wiki articles in one highly compressed ZIM file☆440Mar 20, 2026Updated last week
- Wget-compatible web downloader and crawler.☆603Apr 29, 2024Updated last year
- The archivist's web crawler: WARC output, dashboard for all crawls, dynamic ignore patterns☆1,561May 23, 2025Updated 10 months ago
- Scraping bhinneka.com, just for fun☆14Jan 28, 2013Updated 13 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A static web site generator using MediaWiki.☆14Dec 27, 2021Updated 4 years ago
- Downloads all pages from a MediaWiki install to local text files.☆12Jan 23, 2024Updated 2 years ago
- archiving MediaWikis (and uploading wikidump to the Internet Archive)☆87Dec 24, 2025Updated 3 months ago
- Tools for tracking changes to various Genealogy websites collection of record collections☆18Updated this week
- Use yt-dlp to download video/metadata and upload to the Internet Archive.☆480Mar 15, 2026Updated last week
- A Python and Command-Line Interface to Archive.org☆1,844Feb 24, 2026Updated last month
- Core Python Web Archiving Toolkit for replay and recording of web archives☆1,637Updated this week
- Convert MediaWiki XML backup into structured raw text file tree☆16Sep 18, 2015Updated 10 years ago
- The little things give you away... A collection of various small helper stuff – Mirror repo only, no longer kept in sync, refer to gitea.…☆24Sep 11, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ArchiveBot, an IRC bot for archiving websites☆408Aug 6, 2025Updated 7 months ago
- Last Writer Slicing: data provenance tracking for concurrent program debugging & analysis☆13Nov 14, 2014Updated 11 years ago
- An Awesome List for getting started with web archiving☆2,512Mar 18, 2026Updated last week
- List of data-hoarding related tools☆1,287Sep 14, 2023Updated 2 years ago
- Webrecorder Player for Desktop (OSX/Windows/Linux). (Built with Electron + Webrecorder)☆448Sep 17, 2020Updated 5 years ago
- Framework of tools and libraries for building and running bots on Wikipedia☆25Feb 21, 2026Updated last month
- The OpenWayback Development☆516Jan 3, 2024Updated 2 years ago
- A tool for archiving DokuWiki☆28Jan 30, 2026Updated last month
- Exporting MediaWiki content to HTML☆34Sep 30, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- A command line client for MediaWiki wikis.☆14Jan 28, 2025Updated last year
- Download an entire website from the Wayback Machine.☆5,834Feb 8, 2024Updated 2 years ago
- Gate between Git and Mediawiki☆187Jan 27, 2026Updated 2 months ago
- WARC writing MITM HTTP/S proxy☆447Feb 3, 2026Updated last month
- Import entities from another Wikibase instance (e.g. Wikidata)☆13May 21, 2023Updated 2 years ago
- Deploy MediaWiki on AWS using Elastic Beanstalk☆10Oct 19, 2017Updated 8 years ago
- Boot scripts for the ArchiveTeam Warrior 2☆26Jul 5, 2025Updated 8 months ago
- Centralised repository for WARC usage specifications.☆125Oct 12, 2025Updated 5 months ago
- A server to collect & archive websites that also supports video downloads☆84Feb 11, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.☆3,208Mar 10, 2026Updated 2 weeks ago
- Wget with Lua extension☆24Dec 17, 2015Updated 10 years ago
- Bash script to retrieve modules from modarchive.org☆11Jan 24, 2021Updated 5 years ago
- Makes Wikibase data available in Semantic MediaWiki☆18Apr 3, 2025Updated 11 months ago
- 🌻 The collaborative editing software that runs Wikipedia. Mirror from https://gerrit.wikimedia.org/g/mediawiki/core. See https://mediawi…☆5,006Updated this week
- Archiving GitHub☆11Aug 5, 2025Updated 7 months ago
- A configurable, reusable tracker with dashboard☆36Dec 15, 2023Updated 2 years ago