henu / bigjsonLinks
Python library that reads JSON files of any size.
β196Updated 2 years ago
Alternatives and similar repositories for bigjson
Users that are interested in bigjson are comparing it to the libraries listed below
Sorting:
- python library to simplify working with jsonlines and ndjson dataβ303Updated last year
- π¦ Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)β474Updated 9 months ago
- Fast Autocomplete: When Elastcsearch suggestions are not fast and flexible enoughβ286Updated last month
- Pythonic search engine based on PyLucene.β130Updated 2 weeks ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)β154Updated 2 years ago
- β174Updated 7 months ago
- Accurately find/replace/remove emojis in text stringsβ162Updated last year
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacingβ76Updated last month
- A Python implementation of Lunr.js πβ200Updated 7 months ago
- Language detection using Spacy and Fasttextβ57Updated last year
- Python API for PDF documentsβ124Updated last year
- Find strings/words in text; convenience and C speedβ127Updated 3 years ago
- Python Simple Object Storage - provides a list and dictionary interface that seamlessly stores data in a file, like a simplified databaseβ¦β58Updated 2 years ago
- Parse natural language time expressions in pythonβ131Updated 2 years ago
- Library for unit extraction - fork of quantulum for python3β143Updated last year
- Find parts of long text or data, allowing for some changes/typos.β333Updated 5 months ago
- The most basic Text::Unidecode port (licensed under Artistic License or GPL or GPLv2+ - choose whatever you want)β67Updated 2 years ago
- Abydos NLP/IR library for Pythonβ191Updated 2 years ago
- rstr is a helper module for easily generating random strings of various types. It could be useful for fuzz testing, generating dummy dataβ¦β94Updated 8 months ago
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any otheβ¦β68Updated 3 years ago
- 𧬠A VS Code extension for annotating data with Prodigyβ30Updated 3 years ago
- Simple, Pythonic extraction of text, shapes and images from PDFsβ80Updated 5 years ago
- Python package for lexicon; Trie and DAWG implementation.β55Updated 11 months ago
- A pure Python Levenshtein implementation that's not freaking GPL'd.β97Updated 2 years ago
- Python binding to Poppler-cpp pdf libraryβ113Updated last year
- Bounded Process&Thread Pool Executorβ63Updated last year
- Guess gender from first name in Python 2 and 3β137Updated 5 months ago
- Extract city and country mentions from Text like GeoText without regex, but FlashText, a Aho-Corasick implementation.β62Updated this week
- π£ A lightweight console printing and formatting toolkitβ465Updated last year
- Pure-Python full-text search libraryβ643Updated last year