originell / smaz-py3Links
Small string compression using smaz compression algorithm. Fast, because it's in C. Supports Python 3+
☆13Updated 2 months ago
Alternatives and similar repositories for smaz-py3
Users that are interested in smaz-py3 are comparing it to the libraries listed below
Sorting:
- Python bindings for the fast integer compression library FastPFor.☆61Updated last year
- Benchmark scripts for comparing different tokenizers and sentence segmenters of German☆12Updated 2 years ago
- 🔤 Measure edit distance based on keyboard layout☆61Updated last year
- Fast Text Classification with Compressors dictionary☆150Updated 2 years ago
- Loadable spellfix1 extension for sqlite as python package☆26Updated last year
- A Python module for retrieving script types of writing systems including alphabets, abjads, abugidas, syllabaries, logographs, featurals …☆15Updated last year
- Backport of the pickle 5 protocol (PEP 574)☆33Updated 3 years ago
- Python JSON benchmarking and "correctness".☆34Updated 2 years ago
- ☆18Updated last year
- Libzim binding for Python: read/write ZIM files in Python☆92Updated 2 weeks ago
- A robust web archive analytics toolkit☆116Updated 5 months ago
- A high-performance library for compressed ndarrays, with a flexible computational engine☆169Updated this week
- Python bindings for xorfilter(faster and smaller than bloom and cuckoo filters)☆117Updated 3 weeks ago
- Serialize JAX, Flax, Haiku, or Objax model params with 🤗`safetensors`☆46Updated last year
- ☆21Updated 3 months ago
- High-performance Python runtime extensions☆34Updated this week
- Fast and vectorizable algorithms for searching in a vector of sorted floating point numbers☆150Updated 9 months ago
- Homoglyphs: get similar letters, convert to ASCII, detect possible languages and UTF-8 group.☆19Updated 2 weeks ago
- Get control over your imports -- no matter how you run your code☆48Updated 2 months ago
- A polite and user-friendly downloader for Common Crawl data☆57Updated last month
- Extracts structured data from unstructured input. Programming language agnostic. Uses llama.cpp☆45Updated last year
- Write your code as tree-like expressions, then transform it☆21Updated last year
- Fast random access of gzip files in Python☆110Updated 2 weeks ago
- A python dictionary that uses Redis as in-memory storage backend to facilitate distributed computing applications development.☆23Updated 2 years ago
- A library for incremental loading of large PyTorch checkpoints☆56Updated 2 years ago
- Bloom filter for Python☆43Updated 4 years ago
- Python bindings for simdjson using libpy☆69Updated 2 years ago
- Python bindings for RocksDB☆35Updated 3 years ago
- Simple implementation of a GPT (training and inference) in PyTorch.☆12Updated last year
- Efficiently computing & storing token n-grams from large corpora☆26Updated 11 months ago