henu / bigjsonLinks
Python library that reads JSON files of any size.
☆197Updated 2 years ago
Alternatives and similar repositories for bigjson
Users that are interested in bigjson are comparing it to the libraries listed below
Sorting:
- ☆178Updated 10 months ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆155Updated 2 years ago
- Pythonic search engine based on PyLucene.☆132Updated last month
- The most basic Text::Unidecode port (licensed under Artistic License or GPL or GPLv2+ - choose whatever you want)☆68Updated 2 years ago
- Language detection using Spacy and Fasttext☆57Updated 2 years ago
- Fast Autocomplete: When Elastcsearch suggestions are not fast and flexible enough☆287Updated 5 months ago
- Parse natural language time expressions in python☆131Updated 3 years ago
- A Python implementation of Lunr.js 🌖☆204Updated 11 months ago
- 🦉 Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)☆482Updated 2 months ago
- Find parts of long text or data, allowing for some changes/typos.☆338Updated 2 months ago
- Python 3 library for reading and writing warc files☆21Updated 8 years ago
- Convert number words (eg. twenty one) to numeric digits (21)☆180Updated 2 years ago
- Guess gender from first name in Python 2 and 3☆139Updated 8 months ago
- python library to simplify working with jsonlines and ndjson data☆307Updated last year
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆76Updated 2 weeks ago
- Accurately find/replace/remove emojis in text strings☆163Updated 2 years ago
- Abydos NLP/IR library for Python☆194Updated 3 years ago
- Lightning Fast Language Prediction 🚀☆167Updated 5 months ago
- Python API for PDF documents☆124Updated last year
- A comprehensive and scalable set of string tokenizers and similarity measures in Python☆142Updated last year
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any othe…☆68Updated 3 years ago
- A fully customisable language detection pipeline for spaCy☆93Updated 6 years ago
- Parse numbers written in natural language☆126Updated last year
- A pure Python Levenshtein implementation that's not freaking GPL'd.☆97Updated 2 years ago
- Python Simple Object Storage - provides a list and dictionary interface that seamlessly stores data in a file, like a simplified database…☆59Updated 3 years ago
- Super lightweight function registries for your library☆181Updated last year
- Python package for lexicon; Trie and DAWG implementation.☆56Updated last year
- Price and currency parsing utility☆27Updated 2 years ago
- A flexible utility for flattening and unflattening dict-like objects in Python.☆185Updated 3 years ago
- 🍣 A lightweight console printing and formatting toolkit☆468Updated last year