mediawiki-client-tools / mediawiki-dump-generator
Python 3 tools for downloading and preserving wikis
☆105Updated 4 months ago
Alternatives and similar repositories for mediawiki-dump-generator:
Users that are interested in mediawiki-dump-generator are comparing it to the libraries listed below
- archiving MediaWikis (and uploading wikidump to the Internet Archive)☆37Updated this week
- Archiving public telegram messages.☆12Updated last month
- A command line tool to archive a git repository from GitHub to the Internet Archive.☆92Updated 4 years ago
- Converts WARC files to static HTML☆43Updated 7 months ago
- Command line tool for digging into WARC files☆38Updated this week
- Bash scripts which interact with Internet Archive Wayback Machine's Save Page Now☆117Updated last week
- Archiving imgur.☆64Updated 3 months ago
- A tool for archiving DokuWiki☆21Updated last month
- Tools for downloading and preserving wikis. We archive wikis, from Wikipedia to tiniest wikis. As of 2024, WikiTeam has preserved more th…☆751Updated 5 months ago
- A tool for detecting viruses and NSFW material in WARC files☆11Updated 6 months ago
- Tool and library for handling Web ARChive (WARC) files.☆155Updated 4 months ago
- ☆41Updated 10 months ago
- fork for nitter for keeping nitter working on xcancel.com☆76Updated 3 weeks ago
- Specifications developed and maintained by the Webrecorder community.☆128Updated last month
- CDXJ Indexing of WARC/ARCs☆25Updated 2 months ago
- Warrior virtual machine appliance (version 4)☆24Updated 2 weeks ago
- A search interface and wayback machine for the UKWA Solr based warc-indexer framework.☆108Updated last month
- wabac.js - Web Archive Browsing Augmentation Client☆106Updated last week
- Use yt-dlp to download video/metadata and upload to the Internet Archive.☆435Updated last month
- Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more …☆238Updated this week
- Record Discord traffic via mitmproxy and export chatlogs to JSON or HTML.☆78Updated 9 months ago
- A collection of tools for archiving and analysing the internet.☆72Updated 2 years ago
- Discord archiver☆60Updated last year
- Centralised repository for WARC usage specifications.☆106Updated 3 months ago
- Archiving all metadata from YouTube (everything except videos themselves due to size)☆24Updated last month
- Nondestructive warc-in-tar to warc conversion☆26Updated 11 years ago
- ☆10Updated 3 years ago
- Web archive index server based on RocksDB☆34Updated 3 months ago
- ☆27Updated 2 years ago
- Scripts to build and boot warrior virtual machine containing Docker☆114Updated last year