openzim / zimfarmLinks
Farm operated by bots to grow and harvest new zim files
☆111Updated this week
Alternatives and similar repositories for zimfarm
Users that are interested in zimfarm are comparing it to the libraries listed below
Sorting:
- Various ZIM command line tools☆171Updated 2 months ago
- Command line tool to convert a file in the WARC format to a file in the ZIM format☆65Updated 5 months ago
- MediaWiki scraper: all your wiki articles in one highly compressed ZIM file☆385Updated this week
- Kiwix & openZIM build engine☆103Updated last month
- Create a ZIM file from a Youtube channel/username/playlist☆76Updated this week
- Reference implementation of the ZIM specification☆199Updated 3 weeks ago
- Common code base for all Kiwix ports☆148Updated this week
- Libzim binding for Python: read/write ZIM files in Python☆92Updated 4 months ago
- creates ZIM files for Kiwix from arbitrary websites with wget and some nifty tricks (doesn't need ServiceWorkers)☆91Updated 2 months ago
- This is the web portal for Snikket Chat services. To learn more about what Snikket Chat services are, check the website.☆40Updated 2 weeks ago
- A command line tool to archive a git repository from GitHub to the Internet Archive.☆91Updated 4 years ago
- Scrape posts, threads from forums, news aggregators, mail archives, export to JSONL, mailbox, WARC☆97Updated last year
- [ARCHIVED] Kiwix Hotspot Image Creator (Desktop) for Windows/macOS/Linux☆72Updated last year
- A list of things related to software, literature, and other content for 🕣 Memento☆99Updated last year
- Wget-AT is a modern Wget with Lua hooks, Zstandard (+dictionary) WARC compression and URL-agnostic deduplication.☆127Updated 7 months ago
- ☆89Updated 4 months ago
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.☆47Updated this week
- ArchiveBot, an IRC bot for archiving websites☆395Updated 2 weeks ago
- StackExchange websites to ZIM scraper☆231Updated 2 months ago
- Nondestructive warc-in-tar to warc conversion☆27Updated 12 years ago
- 🎭 An introduction to the Internet Archiving ecosystem, tooling, and some of the ethical dilemmas that the community faces.☆58Updated last year
- Offline Internet Archive project☆289Updated last year
- freeyourstuff.cc - universal content liberation☆80Updated 2 years ago
- Want a new ZIM file? Propose ZIM content improvements or fixes? Here you are!☆59Updated last month
- wpull fork with fixes and faster parsing using html5-parser; used by grab-site; should go away when wpull is similarly improved☆30Updated last year
- View the history of public and world readable Matrix rooms☆78Updated last year
- The ArchiveWeb.page Site☆31Updated 8 months ago
- Command line tools and libraries for handling and manipulating WARC files (and HTTP contents)☆162Updated last week
- Collection of Python code to re-use across Python-based scrapers☆25Updated 3 months ago
- A listing of world wide web archives, for humans and machines using Web Archive Manifest (WAM) yaml format☆53Updated 2 years ago