openzim / python-libzimLinks
Libzim binding for Python: read/write ZIM files in Python
☆94Updated last week
Alternatives and similar repositories for python-libzim
Users that are interested in python-libzim are comparing it to the libraries listed below
Sorting:
- Create a ZIM file from a Youtube channel/username/playlist☆81Updated this week
- Various ZIM command line tools☆179Updated 2 weeks ago
- Python wrapper for the MediaWiki API to access and parse data from Wikipedia☆42Updated 2 months ago
- A set of utilities for processing MediaWiki XML dump data.☆58Updated 9 months ago
- Automated behaviors that run in browser to interact with complex sites automatically. Used by ArchiveWeb.page and Browsertrix Crawler.☆52Updated this week
- Command line tool to convert a file in the WARC format to a file in the ZIM format☆73Updated 8 months ago
- Atom, RSS and JSON feed parser for Python 3☆117Updated 3 years ago
- Collection of Python code to re-use across Python-based scrapers☆24Updated last week
- Scraper for downloading the entire ebooks repository of project Gutenberg☆153Updated this week
- An experimental Python parser for MediaWiki syntax with a focus on extensibility and comprehensibility☆60Updated 3 years ago
- Pure python implementation of identifying files based off their magic numbers☆221Updated 4 months ago
- Standalone version of Django's feedgenerator module☆55Updated 3 months ago
- Loadable spellfix1 extension for sqlite as python package☆26Updated last year
- search interface for scholarly works☆85Updated last year
- modulegraph determines a dependency graph between Python modules primarily by bytecode analysis for import statements. modulegraph …☆46Updated 2 years ago
- A modern CSS selector implementation for BeautifulSoup☆250Updated 2 months ago
- Python client library to interface with the MediaWiki API☆338Updated last week
- Whoosh is a fast, featureful full-text indexing and searching library implemented in pure Python.☆225Updated last week
- Fast and robust date extraction from web pages, with Python or on the command-line☆142Updated 2 weeks ago
- Python library for reading and writing warc files☆245Updated 3 years ago
- fasttext with wheels and no external dependency, but only the predict method (<1MB)☆18Updated 11 months ago
- SQLite3 DB-API 2.0 driver from Python 3, packaged separately, with improvements☆223Updated 7 months ago
- Python API for PDF documents☆125Updated last year
- Training scripts for Argos Translate☆146Updated this week
- Command line tools and libraries for handling and manipulating WARC files (and HTTP contents)☆167Updated 3 months ago
- MediaWiki scraper: all your wiki articles in one highly compressed ZIM file☆409Updated this week
- A Python implementation of Lunr.js 🌖☆201Updated 8 months ago
- A python package for grapheme aware string handling☆114Updated 3 years ago
- ISO 639 library for Python☆35Updated last year
- Fast PDF generation and compression. Deals with millions of pages daily.☆125Updated 2 months ago