henu / bigjson
Python library that reads JSON files of any size.
β197Updated 2 years ago
Alternatives and similar repositories for bigjson:
Users that are interested in bigjson are comparing it to the libraries listed below
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacingβ68Updated 2 weeks ago
- A fast streaming JSON parser for Python that generates SAX-like events using yajlβ221Updated 4 months ago
- A Python implementation of Lunr.js πβ196Updated last month
- Simple redis cache for Python functionsβ106Updated 5 months ago
- Python Simple Object Storage - provides a list and dictionary interface that seamlessly stores data in a file, like a simplified databaseβ¦β56Updated 2 years ago
- Python interface to Apache PDFBox command-line tools.β75Updated 2 years ago
- Language detection using Spacy and Fasttextβ55Updated last year
- The most basic Text::Unidecode port (licensed under Artistic License or GPL or GPLv2+ - choose whatever you want)β65Updated last year
- Accurately find/replace/remove emojis in text stringsβ160Updated last year
- Parse natural language time expressions in pythonβ131Updated 2 years ago
- A pure Python Levenshtein implementation that's not freaking GPL'd.β96Updated last year
- Multi-Langauge Identificationβ29Updated 6 months ago
- π¦ Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)β449Updated last month
- Simple, easy-to-use throttler for asyncio.β120Updated 2 years ago
- Fast Autocomplete: When Elastcsearch suggestions are not fast and flexible enoughβ278Updated 2 years ago
- β70Updated 2 years ago
- Implementation of phonetic algorithm in pythonβ40Updated 6 years ago
- Build and upload fastText Python wheels to PyPIβ23Updated last year
- Python 3 library for reading and writing warc filesβ21Updated 7 years ago
- ndjson with the same interface as the builtin json moduleβ68Updated 2 years ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)β149Updated last year
- An open-source package for python to clean raw text dataβ69Updated last year
- Postal code geocoding and distance calculationβ240Updated last month
- Abydos NLP/IR library for Pythonβ184Updated 2 years ago
- Use ML-Annotate to label data for machine learning purposesβ107Updated 4 years ago
- A Cython implementation of the affine gap string distanceβ57Updated 2 years ago
- Make every function async and await-able.β98Updated 2 years ago
- Original, standard and customisable versions of the Jaro-Winkler functions.β32Updated 2 years ago
- Python library for fast approximate string matching using Jaro and Jaro-Winkler similarityβ65Updated last year
- Pythonic search engine based on PyLucene.β125Updated 2 months ago