henu / bigjson
Python library that reads JSON files of any size.
☆198Updated 2 years ago
Alternatives and similar repositories for bigjson:
Users that are interested in bigjson are comparing it to the libraries listed below
- A fast streaming JSON parser for Python that generates SAX-like events using yajl☆222Updated 7 months ago
- Pythonic search engine based on PyLucene.☆126Updated 5 months ago
- Language detection using Spacy and Fasttext☆55Updated last year
- Parse natural language time expressions in python☆130Updated 2 years ago
- ☆169Updated last month
- Extract text from HTML☆135Updated 4 years ago
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆71Updated this week
- Python Simple Object Storage - provides a list and dictionary interface that seamlessly stores data in a file, like a simplified database…☆58Updated 2 years ago
- python library to simplify working with jsonlines and ndjson data☆293Updated 9 months ago
- ☆68Updated last year
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆151Updated 3 months ago
- A Python implementation of Lunr.js 🌖☆195Updated last month
- Library for unit extraction - fork of quantulum for python3☆138Updated 10 months ago
- Accurately find/replace/remove emojis in text strings☆161Updated last year
- rstr is a helper module for easily generating random strings of various types. It could be useful for fuzz testing, generating dummy data…☆93Updated 2 months ago
- Guess gender from first name in Python 2 and 3☆133Updated 2 years ago
- Prebuilt .whl files for MacOS + Linux of the Facebook FAISS library☆56Updated 3 years ago
- 🦉 Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)☆465Updated 3 months ago
- An open-source package for python to clean raw text data☆69Updated last year
- ☆70Updated 2 years ago
- Python library for fast approximate string matching using Jaro and Jaro-Winkler similarity☆70Updated last year
- Wikidata client library for Python☆355Updated 9 months ago
- A flexible utility for flattening and unflattening dict-like objects in Python.☆182Updated 3 years ago
- Python package for lexicon; Trie and DAWG implementation.☆55Updated 5 months ago
- The most basic Text::Unidecode port (licensed under Artistic License or GPL or GPLv2+ - choose whatever you want)☆66Updated 2 years ago
- Super lightweight function registries for your library☆179Updated 11 months ago
- Plac: Parsing the Command Line the Easy Way☆297Updated last month
- Extract city and country mentions from Text like GeoText without regex, but FlashText, a Aho-Corasick implementation.☆62Updated last week
- xmlsjon converts XML into Python dictionary structures (trees, like in JSON) and vice-versa.☆123Updated 2 years ago
- Python library to infer date format from examples☆43Updated 3 years ago