henu / bigjsonLinks
Python library that reads JSON files of any size.
☆196Updated 2 years ago
Alternatives and similar repositories for bigjson
Users that are interested in bigjson are comparing it to the libraries listed below
Sorting:
- Language detection using Spacy and Fasttext☆57Updated last year
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆154Updated 2 years ago
- ☆174Updated 6 months ago
- Fast Autocomplete: When Elastcsearch suggestions are not fast and flexible enough☆286Updated last month
- Accurately find/replace/remove emojis in text strings☆162Updated last year
- 🦉 Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)☆472Updated 8 months ago
- Pythonic search engine based on PyLucene.☆130Updated last month
- python library to simplify working with jsonlines and ndjson data☆302Updated last year
- A pure Python Levenshtein implementation that's not freaking GPL'd.☆98Updated 2 years ago
- The most basic Text::Unidecode port (licensed under Artistic License or GPL or GPLv2+ - choose whatever you want)☆67Updated 2 years ago
- Python Simple Object Storage - provides a list and dictionary interface that seamlessly stores data in a file, like a simplified database…☆58Updated 2 years ago
- A fully customisable language detection pipeline for spaCy☆93Updated 6 years ago
- Parse natural language time expressions in python☆131Updated 2 years ago
- Abydos NLP/IR library for Python☆191Updated 2 years ago
- Super lightweight function registries for your library☆180Updated last year
- A Python module to convert natural language numerics into ints and floats.☆231Updated last year
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆75Updated last week
- Lightning Fast Language Prediction 🚀☆167Updated last month
- Find parts of long text or data, allowing for some changes/typos.☆330Updated 4 months ago
- A Python implementation of Lunr.js 🌖☆200Updated 7 months ago
- Library for unit extraction - fork of quantulum for python3☆142Updated last year
- Extract city and country mentions from Text like GeoText without regex, but FlashText, a Aho-Corasick implementation.☆62Updated this week
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 3 years ago
- A purely-functional HTML builder for Python. Think JSX rather than templates.☆101Updated 9 months ago
- ☆70Updated 2 years ago
- Python library for fast approximate string matching using Jaro and Jaro-Winkler similarity☆74Updated last year
- Python 3 library for reading and writing warc files☆21Updated 7 years ago
- Python package for lexicon; Trie and DAWG implementation.☆55Updated 10 months ago
- A fast and memory-optimized string library for heavy-text manipulation in Python☆250Updated 5 years ago
- Bounded Process&Thread Pool Executor☆63Updated last year