openzim / python-libzimLinks
Libzim binding for Python: read/write ZIM files in Python
☆89Updated 2 months ago
Alternatives and similar repositories for python-libzim
Users that are interested in python-libzim are comparing it to the libraries listed below
Sorting:
- Create a ZIM file from a Youtube channel/username/playlist☆69Updated 2 months ago
- Collection of Python code to re-use across Python-based scrapers☆24Updated last month
- Translate HTML using Argos Translate☆51Updated last year
- Command line tool to convert a file in the WARC format to a file in the ZIM format☆58Updated 3 months ago
- fasttext with wheels and no external dependency, but only the predict method (<1MB)☆16Updated 7 months ago
- Loadable spellfix1 extension for sqlite as python package☆26Updated last year
- Python package for WikiMedia dump processing (Wiktionary, Wikipedia etc). Wikitext parsing, template expansion, Lua module execution. Fo…☆102Updated last month
- Training scripts for Argos Translate☆131Updated 2 weeks ago
- Fast Neural Machine Translation in C++ - development repository☆19Updated last year
- A PDF classifier ensemble with REST API service☆23Updated 4 years ago
- Miscellaneous scripts to gather and process data of wikis.☆21Updated 2 years ago
- Standalone version of Django's feedgenerator module☆52Updated last year
- An easy to use offline reader for ZIM files right in your browser!☆80Updated last year
- Internet-in-a-Box (IIAB) Maps are like Google Maps but better, for schools especially, as they work offline (including satellite photos!)…☆26Updated 2 years ago
- Whoosh is a fast, featureful full-text indexing and searching library implemented in pure Python.☆207Updated last week
- ActivityPub server without Javascript, designed for simplicity and accessibility. Includes calendar, news and sharing economy features to…☆72Updated last week
- Next-generation Punkt sentence boundary detection with zero dependencies☆17Updated 2 months ago
- A toolchain of tasks for sequencing and fingerprinting book fulltext☆45Updated 10 months ago
- ZODB Client-Server framework☆43Updated last month
- Automatically exported from code.google.com/p/guess-language☆53Updated last year
- Python wrapper for the MediaWiki API to access and parse data from Wikipedia☆41Updated last week
- A Memento Client Library in Python☆26Updated 7 years ago
- Python API for PDF documents☆122Updated 9 months ago
- A package for removing tracing parameters from URLs. This package supports automatically updating filtering rules from Adguard.☆15Updated 2 years ago
- A Python binding of SQLite Full Text Search Tokenizer☆48Updated 2 months ago
- Wombat.js client-side rewriting library☆97Updated last month
- A set of utilities for processing MediaWiki XML dump data.☆54Updated 4 months ago
- Diff Match Patch is a high-performance library in multiple languages that manipulates plain text.☆54Updated 5 months ago
- Faster, modernized fork of the language identification tool langid.py☆56Updated 7 months ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆52Updated 3 years ago