benhoyt / pybktreeLinks
Python BK-tree data structure to allow fast querying of "close" matches
☆186Updated 4 years ago
Alternatives and similar repositories for pybktree
Users that are interested in pybktree are comparing it to the libraries listed below
Sorting:
- A simple fuzzy matching set for python strings☆230Updated last year
- DAFSA-based dictionary-like read-only objects for Python. Based on `dawgdic` C++ library.☆303Updated last year
- Roaring Bitmap in Cython☆82Updated last year
- A module for getting data into python from large data sources☆176Updated last year
- Levenshtein and Hamming distance computation☆116Updated 6 years ago
- A fast and memory-optimized string library for heavy-text manipulation in Python☆252Updated 5 years ago
- Python package to compress numerical series & numpy arrays into strings☆99Updated 3 years ago
- 🎡 Automated build repo for Python wheels and source packages☆172Updated last year
- All-pair set similarity search on millions of sets in Python and on a laptop☆603Updated 3 years ago
- Fast, efficiently stored Trie for Python. Uses libdatrie.☆544Updated last week
- Python library for handling efficiently sorted integer sets.☆225Updated last month
- Fast multi-keyword search engine for text strings☆257Updated last year
- 💥 Cython hash tables that assume keys are pre-hashed☆87Updated this week
- A Generalized Suffix Tree for any Python iterable using Ukkonen's algorithm, with Lowest Common Ancestor retrieval.☆54Updated 2 years ago
- pyxDamerauLevenshtein implements the Damerau-Levenshtein (DL) edit distance algorithm for Python in Cython for high performance.☆250Updated last month
- 145+ extra higher-level functional tools beyond standard library's `itertools`, `functools`, etc. and popular third-party libraries like …☆159Updated 6 months ago
- 🐍 Python library implementing sorted containers with state-of-the-art query performance and compressed memory usage☆215Updated last year
- Python library to calculate the difference hash (perceptual hash) for a given image, useful for detecting duplicates☆373Updated last year
- Python stream processing for humans☆185Updated last month
- Fast and customizable tokenization☆66Updated 6 years ago
- Python bindings for the SQLite4 LSM database.☆132Updated 4 months ago
- Python Data Processing library☆102Updated last year
- persistent caching to memory, disk, or database☆277Updated 3 weeks ago
- Parse natural language time expressions in python☆131Updated 2 years ago
- A high performance python hash table library that is generally faster and consumes significantly less memory than Python Dictionaries☆214Updated 2 years ago
- A Cython implementation of the affine gap string distance☆57Updated 2 years ago
- Weighted Levenshtein library☆113Updated 2 years ago
- Regular Expression based parsers for extracting data from natural languages☆71Updated 8 years ago
- Python library to infer date format from examples☆45Updated 4 years ago
- A python library to manipulate and transform indexable data (lists, arrays, ...)☆46Updated last year