openzim / python-libzimLinks
Libzim binding for Python: read/write ZIM files in Python
☆87Updated last month
Alternatives and similar repositories for python-libzim
Users that are interested in python-libzim are comparing it to the libraries listed below
Sorting:
- Create a ZIM file from a Youtube channel/username/playlist☆68Updated last month
- Various ZIM command line tools☆161Updated last month
- Standalone version of Django's feedgenerator module☆52Updated last year
- A Python implementation of Lunr.js 🌖☆195Updated 2 months ago
- Loadable spellfix1 extension for sqlite as python package☆26Updated last year
- Farm operated by bots to grow and harvest new zim files☆105Updated this week
- A sentence segmentation library with wide language support optimized for speed and utility.☆65Updated 9 months ago
- An experimental Python parser for MediaWiki syntax with a focus on extensibility and comprehensibility☆61Updated 2 years ago
- fasttext with wheels and no external dependency, but only the predict method (<1MB)☆16Updated 6 months ago
- Small string compression using smaz compression algorithm. Fast, because it's in C. Supports Python 3+☆13Updated last year
- Translate HTML using Argos Translate☆50Updated last year
- Kiwix & openZIM build engine☆101Updated this week
- Python wrapper for the MediaWiki API to access and parse data from Wikipedia☆40Updated 2 months ago
- Atom, RSS and JSON feed parser for Python 3☆117Updated 2 years ago
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆101Updated 2 weeks ago
- ZODB Client-Server framework☆43Updated last month
- A fastcgi handler for Python's `socketserver` classes☆19Updated 2 years ago
- A set of utilities for processing MediaWiki XML dump data.☆53Updated 3 months ago
- Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.☆54Updated 5 months ago
- Centralised repository for WARC usage specifications.☆111Updated 6 months ago
- ISO 639 library for Python☆33Updated 9 months ago
- MediaWiki scraper: all your wiki articles in one highly compressed ZIM file☆358Updated this week
- Common code base for all Kiwix ports☆139Updated last week
- Fast Neural Machine Translation in C++ - development repository☆19Updated last year
- Streaming WARC/ARC library for fast web archive IO☆415Updated 5 months ago
- Pelican plugin that adds site search capability☆52Updated 8 months ago
- A robust web archive analytics toolkit☆108Updated 2 months ago
- Python difflib with parts reimplemented in C☆38Updated 4 months ago
- A Memento Aggregator CLI and Server in Go☆65Updated 3 months ago
- A polite and user-friendly downloader for Common Crawl data☆46Updated 3 weeks ago