openzim / zimfarmLinks
Farm operated by bots to grow and harvest new zim files
☆128Updated this week
Alternatives and similar repositories for zimfarm
Users that are interested in zimfarm are comparing it to the libraries listed below
Sorting:
- MediaWiki scraper: all your wiki articles in one highly compressed ZIM file☆407Updated this week
- Various ZIM command line tools☆179Updated last week
- Command line tool to convert a file in the WARC format to a file in the ZIM format☆72Updated 7 months ago
- Kiwix & openZIM build engine☆108Updated last week
- Create a ZIM file from a Youtube channel/username/playlist☆80Updated last week
- We back up a lot of stuff from around the web; now it's time to back up the Internet Archive, just in case.☆92Updated 5 years ago
- creates ZIM files for Kiwix from arbitrary websites with wget and some nifty tricks (doesn't need ServiceWorkers)☆101Updated 4 months ago
- [ARCHIVED] Kiwix Hotspot Image Creator (Desktop) for Windows/macOS/Linux☆71Updated last year
- Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.☆129Updated 2 months ago
- freeyourstuff.cc - universal content liberation☆81Updated 2 years ago
- A list of things related to software, literature, and other content for 🕣 Memento☆102Updated last year
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.☆51Updated last week
- ☆90Updated 6 months ago
- A command line tool to archive a git repository from GitHub to the Internet Archive.☆91Updated 4 years ago
- Collection of Python code to re-use across Python-based scrapers☆24Updated this week
- Kiwix Catalog BitTorrent Seeder Companion☆14Updated 2 months ago
- Scrape posts, threads from forums, news aggregators, mail archives, export to JSONL, mailbox, WARC☆105Updated last year
- This is the web portal for Snikket Chat services. To learn more about what Snikket Chat services are, check the website.☆43Updated last week
- Command line tools and libraries for handling and manipulating WARC files (and HTTP contents)☆167Updated 2 months ago
- Want a new ZIM file? Propose ZIM content improvements or fixes? Here you are!☆61Updated 2 weeks ago
- Nondestructive warc-in-tar to warc conversion☆27Updated 12 years ago
- The ArchiveWeb.page Site☆30Updated last week
- Convert HTTP Archive (HAR) -> Web Archive (WARC) format☆54Updated 7 years ago
- A bot written in Python3 that mirrors YouTube channels to PeerTube channels as videos are released in a YouTube channel.☆141Updated 2 years ago
- Generation of bilingual dictionaries from Wiktionary/dbnary data for the WikDict project☆55Updated 3 months ago
- View the history of public and world readable Matrix rooms☆78Updated last year
- wpull fork with fixes and faster parsing using html5-parser; used by grab-site; should go away when wpull is similarly improved☆30Updated last month
- Browsertrix is the hosted, high-fidelity, browser-based crawling service from Webrecorder designed to make web archiving easier and more …☆352Updated this week
- 🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.☆59Updated last year
- Lokal, offline first, content and services - for and by local communities.☆50Updated this week