openzim / python-libzimLinks
Libzim binding for Python: read/write ZIM files in Python
☆90Updated 2 months ago
Alternatives and similar repositories for python-libzim
Users that are interested in python-libzim are comparing it to the libraries listed below
Sorting:
- Collection of Python code to re-use across Python-based scrapers☆24Updated 2 months ago
- Farm operated by bots to grow and harvest new zim files☆108Updated this week
- Create a ZIM file from a Youtube channel/username/playlist☆74Updated 2 weeks ago
- Reference implementation of the ZIM specification☆191Updated 3 weeks ago
- Various ZIM command line tools☆166Updated 3 weeks ago
- Command line tool to convert a file in the WARC format to a file in the ZIM format☆61Updated 3 months ago
- Standalone version of Django's feedgenerator module☆52Updated last year
- An easy to use offline reader for ZIM files right in your browser!☆80Updated last year
- Loadable spellfix1 extension for sqlite as python package☆26Updated last year
- An experimental Python parser for MediaWiki syntax with a focus on extensibility and comprehensibility☆61Updated 2 years ago
- Kiwix & openZIM build engine☆102Updated 2 weeks ago
- StackExchange websites to ZIM scraper☆229Updated 3 weeks ago
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.☆43Updated last week
- A framework for quick web archiving; canonical repository: https://gitea.arpa.li/JustAnotherArchivist/qwarc☆28Updated 4 years ago
- Atom, RSS and JSON feed parser for Python 3☆117Updated 2 years ago
- Generation of bilingual dictionaries from Wiktionary/dbnary data for the WikDict project☆51Updated 8 months ago
- MediaWiki scraper: all your wiki articles in one highly compressed ZIM file☆369Updated this week
- Python wrapper for the MediaWiki API to access and parse data from Wikipedia☆41Updated 3 weeks ago
- Python API for PDF documents☆123Updated 10 months ago
- A Python binding of SQLite Full Text Search Tokenizer☆48Updated 2 months ago
- modulegraph determines a dependency graph between Python modules primarily by bytecode analysis for import statements. modulegraph …☆45Updated last year
- A toolchain of tasks for sequencing and fingerprinting book fulltext☆46Updated 10 months ago
- Training scripts for Argos Translate☆133Updated last month
- Fast and robust date extraction from web pages, with Python or on the command-line☆133Updated 6 months ago
- A polite and user-friendly downloader for Common Crawl data☆50Updated last week
- Simplified, fast RSS parsing library in Python☆141Updated 11 months ago
- A modern CSS selector implementation for BeautifulSoup☆244Updated 2 months ago
- A python package for grapheme aware string handling☆112Updated 3 years ago
- Simple bencode parser (for Python 2, Python 3 and PyPy)☆55Updated 2 years ago
- A super-lightweight IPC (Inter-Process Communication) protocol over TCP socket.☆24Updated 3 years ago