mediawiki-client-tools / mediawiki-dump-generator
Python 3 tools for downloading and preserving wikis
☆102Updated 3 months ago
Alternatives and similar repositories for mediawiki-dump-generator:
Users that are interested in mediawiki-dump-generator are comparing it to the libraries listed below
- A tool for archiving DokuWiki☆21Updated 2 weeks ago
- Archiving public telegram messages.☆12Updated 2 weeks ago
- Archiving imgur.☆64Updated 2 months ago
- Scripts to build and boot warrior virtual machine containing Docker☆114Updated last year
- go-ia is a command-line interface for interacting with archive.org written in Go.☆61Updated 3 years ago
- Converts WARC files to static HTML☆43Updated 6 months ago
- A simple Python wrapper and command-line interface for archive.org’s "Save Page Now" capturing service☆170Updated 3 months ago
- Bash scripts which interact with Internet Archive Wayback Machine's Save Page Now☆111Updated 3 months ago
- Tool and library for handling Web ARChive (WARC) files.☆153Updated 3 months ago
- Command line tool to convert a file in the WARC format to a file in the ZIM format☆48Updated last week
- A command line tool to archive a git repository from GitHub to the Internet Archive.☆91Updated 3 years ago
- ☆41Updated 9 months ago
- 🕸 A frontend for the Wayback Machine which works on old browsers☆89Updated 2 months ago
- Tools for downloading and preserving wikis. We archive wikis, from Wikipedia to tiniest wikis. As of 2024, WikiTeam has preserved more th…☆740Updated 4 months ago
- 🍨 High-fidelity, browser-based, single-page web archiving library and CLI for witnessing the web.☆124Updated this week
- Record Discord traffic via mitmproxy and export chatlogs to JSON or HTML.☆74Updated 8 months ago
- A collection of tools for archiving and analysing the internet.☆71Updated 2 years ago
- Command line tool written in Go for sorting and categorizing personal files like screenshots, recordings, logs and more.☆19Updated 2 years ago
- A tool for detecting viruses and NSFW material in WARC files☆11Updated 5 months ago
- Command line tool for digging into WARC files☆37Updated this week
- A Wikipedia gadget to a browser extension to display article contribution information. Powered by WikiWho.☆49Updated last month
- Wombat.js client-side rewriting library☆88Updated last month
- A Dockerfile for the ArchiveTeam Warrior☆309Updated 2 months ago
- A configurable, reusable tracker with dashboard☆34Updated last year
- fork for nitter for keeping nitter working on xcancel.com☆61Updated 3 weeks ago
- Specifications developed and maintained by the Webrecorder community.☆126Updated last week
- A framework for quick web archiving; canonical repository: https://gitea.arpa.li/JustAnotherArchivist/qwarc☆27Updated 3 years ago
- A static web site generator using MediaWiki.☆13Updated 3 years ago
- Scrape posts, threads from forums, news aggregators, mail archives, export to JSONL, mailbox, WARC☆78Updated 6 months ago