henu / bigjsonLinks
Python library that reads JSON files of any size.
☆196Updated 2 years ago
Alternatives and similar repositories for bigjson
Users that are interested in bigjson are comparing it to the libraries listed below
Sorting:
- Pythonic search engine based on PyLucene.☆131Updated last week
- Language detection using Spacy and Fasttext☆57Updated 2 years ago
- Python Simple Object Storage - provides a list and dictionary interface that seamlessly stores data in a file, like a simplified database…☆58Updated 2 years ago
- The most basic Text::Unidecode port (licensed under Artistic License or GPL or GPLv2+ - choose whatever you want)☆68Updated 2 years ago
- ☆176Updated 9 months ago
- python library to simplify working with jsonlines and ndjson data☆306Updated last year
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆155Updated 2 years ago
- Fast Autocomplete: When Elastcsearch suggestions are not fast and flexible enough☆287Updated 4 months ago
- Accurately find/replace/remove emojis in text strings☆163Updated 2 years ago
- 🦉 Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)☆480Updated last month
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆76Updated last month
- Python 3 library for reading and writing warc files☆21Updated 7 years ago
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any othe…☆68Updated 3 years ago
- Find parts of long text or data, allowing for some changes/typos.☆334Updated last month
- Library for unit extraction - fork of quantulum for python3☆145Updated last year
- Lightning Fast Language Prediction 🚀☆167Updated 4 months ago
- A Python implementation of Lunr.js 🌖☆202Updated 9 months ago
- Abydos NLP/IR library for Python☆193Updated 3 years ago
- Python library for fast approximate string matching using Jaro and Jaro-Winkler similarity☆76Updated last year
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 4 years ago
- rstr is a helper module for easily generating random strings of various types. It could be useful for fuzz testing, generating dummy data…☆98Updated 10 months ago
- Parse natural language time expressions in python☆131Updated 3 years ago
- A purely-functional HTML builder for Python. Think JSX rather than templates.☆102Updated 11 months ago
- A comprehensive and scalable set of string tokenizers and similarity measures in Python☆142Updated last year
- A fully customisable language detection pipeline for spaCy☆93Updated 6 years ago
- Convert number words (eg. twenty one) to numeric digits (21)☆180Updated 2 years ago
- 🍣 A lightweight console printing and formatting toolkit☆467Updated last year
- Find strings/words in text; convenience and C speed☆126Updated 3 years ago
- ☆70Updated 3 years ago
- Weighted Levenshtein library☆113Updated last month