mediawiki-client-tools / mediawiki-dump-generatorLinks
Python 3 tools for downloading and preserving wikis
☆118Updated 3 weeks ago
Alternatives and similar repositories for mediawiki-dump-generator
Users that are interested in mediawiki-dump-generator are comparing it to the libraries listed below
Sorting:
- archiving MediaWikis (and uploading wikidump to the Internet Archive)☆58Updated last week
- A command line tool to archive a git repository from GitHub to the Internet Archive.☆94Updated 4 years ago
- Archiving public telegram messages.☆13Updated this week
- Bash scripts which interact with Internet Archive Wayback Machine's Save Page Now☆131Updated 4 months ago
- Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.☆125Updated 7 months ago
- Scripts to build and boot warrior virtual machine containing Docker☆119Updated 4 months ago
- MediaWiki scraper: all your wiki articles in one highly compressed ZIM file☆382Updated last week
- Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more …☆309Updated this week
- A tool for archiving DokuWiki☆21Updated last week
- Record Discord traffic via mitmproxy and export chatlogs to JSON or HTML.☆100Updated 2 months ago
- Specifications developed and maintained by the Webrecorder community.☆132Updated 7 months ago
- A collection of tools for archiving and analysing the internet.☆77Updated 3 years ago
- Archiving imgur.☆62Updated this week
- Command line tool to convert a file in the WARC format to a file in the ZIM format☆64Updated 4 months ago
- ☆51Updated last year
- Grabbing everything from reddit.☆61Updated last year
- Discord archiver☆64Updated last year
- Archiving all metadata from YouTube (everything except videos themselves due to size)☆27Updated 3 weeks ago
- Web frontend to browse the SponsorBlock database written with Django☆46Updated 6 months ago
- 🕸 A frontend for the Wayback Machine which works on old browsers☆104Updated 2 weeks ago
- Tool and library for handling Web ARChive (WARC) files.☆162Updated 10 months ago
- Alternative Twitter front-end☆159Updated last year
- Wombat.js client-side rewriting library☆101Updated 3 weeks ago
- A simple Python wrapper and command-line interface for archive.org’s "Save Page Now" capturing service☆180Updated 9 months ago
- Wayback Machine Downloader. 🔥 Download your entire archived websites from the Internet Archive Wayback Machine.☆97Updated 3 years ago
- 🍨 High-fidelity, browser-based, single-page web archiving library and CLI for witnessing the web.☆166Updated last month
- Centralised repository for WARC usage specifications.☆115Updated 8 months ago
- A cross-browser extension for Twitter to bring @WhatIsAWomanBot features into your browser.☆22Updated 9 months ago
- A simple 88x31 button maker☆45Updated 2 years ago
- nitter is back - archive of fork for nitter for keeping nitter working on xcancel.com☆72Updated 5 months ago