henu / bigjsonLinks
Python library that reads JSON files of any size.
☆198Updated 2 years ago
Alternatives and similar repositories for bigjson
Users that are interested in bigjson are comparing it to the libraries listed below
Sorting:
- python library to simplify working with jsonlines and ndjson data☆295Updated 11 months ago
- Fast Autocomplete: When Elastcsearch suggestions are not fast and flexible enough☆285Updated 2 years ago
- Accurately find/replace/remove emojis in text strings☆163Updated last year
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆152Updated 2 years ago
- Language detection using Spacy and Fasttext☆55Updated last year
- ☆170Updated 3 months ago
- The most basic Text::Unidecode port (licensed under Artistic License or GPL or GPLv2+ - choose whatever you want)☆66Updated 2 years ago
- Pythonic search engine based on PyLucene.☆128Updated 7 months ago
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆73Updated last week
- Find parts of long text or data, allowing for some changes/typos.☆325Updated last month
- 🦉 Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)☆471Updated 5 months ago
- Abydos NLP/IR library for Python☆186Updated 2 years ago
- Convert number words (eg. twenty one) to numeric digits (21)☆176Updated last year
- A Python implementation of Lunr.js 🌖☆197Updated 4 months ago
- An open-source package for python to clean raw text data☆70Updated last year
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any othe…☆68Updated 2 years ago
- Parse numbers written in natural language☆119Updated 8 months ago
- Library for unit extraction - fork of quantulum for python3☆141Updated last year
- Python library for fast approximate string matching using Jaro and Jaro-Winkler similarity☆72Updated last year
- Python port of Boilerpipe library☆88Updated 10 months ago
- Python Simple Object Storage - provides a list and dictionary interface that seamlessly stores data in a file, like a simplified database…☆57Updated 2 years ago
- Postal code geocoding and distance calculation☆251Updated this week
- Python API for PDF documents☆123Updated 10 months ago
- A Python module to convert natural language numerics into ints and floats.☆228Updated 9 months ago
- A Python library for working with and comparing language codes.☆345Updated 2 months ago
- Python package for lexicon; Trie and DAWG implementation.☆55Updated 7 months ago
- A purely-functional HTML builder for Python. Think JSX rather than templates.☆100Updated 6 months ago
- Find strings/words in text; convenience and C speed☆126Updated 2 years ago
- A simple fuzzy matching set for python strings☆228Updated 10 months ago
- Python 3 library for reading and writing warc files☆20Updated 7 years ago