henu / bigjson
Python library that reads JSON files of any size.
☆194Updated last year
Related projects ⓘ
Alternatives and complementary repositories for bigjson
- A fast streaming JSON parser for Python that generates SAX-like events using yajl☆221Updated last month
- python library to simplify working with jsonlines and ndjson data☆273Updated 3 months ago
- rstr is a helper module for easily generating random strings of various types. It could be useful for fuzz testing, generating dummy data…☆88Updated 11 months ago
- Python Simple Object Storage - provides a list and dictionary interface that seamlessly stores data in a file, like a simplified database…☆53Updated last year
- Abydos NLP/IR library for Python☆183Updated 2 years ago
- Fast Autocomplete: When Elastcsearch suggestions are not fast and flexible enough☆271Updated last year
- Common interface for data container classes☆62Updated 3 weeks ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆149Updated last year
- Convert sql to sqlalchemy expressions☆157Updated last year
- Python binding to Poppler-cpp pdf library☆97Updated 2 months ago
- ☆165Updated 5 months ago
- ☆446Updated this week
- URLExtract is python class for collecting (extracting) URLs from given text based on locating TLD.☆246Updated 8 months ago
- Simple, easy-to-use throttler for asyncio.☆118Updated 2 years ago
- Extract text from HTML☆130Updated 4 years ago
- Find parts of long text or data, allowing for some changes/typos.☆311Updated 3 months ago
- Iterative JSON parser with Pythonic interfaces☆842Updated 2 weeks ago
- A Python library for working with and comparing language codes.☆339Updated 7 months ago
- Parse numbers written in natural language☆109Updated 3 weeks ago
- Helpers to use cachetools with async functions☆94Updated 7 months ago
- A purely-functional HTML builder for Python. Think JSX rather than templates.☆93Updated 3 months ago
- Python library implementing a trie data structure.☆38Updated 7 months ago
- Python stemming library using snowball stemmers☆245Updated last month
- Language detection using Spacy and Fasttext☆54Updated 10 months ago
- An open-source package for python to clean raw text data☆69Updated last year
- Extract city and country mentions from Text like GeoText without regex, but FlashText, a Aho-Corasick implementation.☆60Updated this week
- A mutable set that remembers the order of its entries. One of Python's missing data types.☆215Updated 3 months ago
- Library to populate items using XPath and CSS with a convenient API☆45Updated 3 weeks ago
- A Python binding of SQLite Full Text Search Tokenizer☆45Updated last month
- Pythonic search engine based on PyLucene.☆120Updated 3 weeks ago