ahupp / bktree
Implementation of Burkhard-Keller trees in various languages
☆52Updated 14 years ago
Related projects ⓘ
Alternatives and complementary repositories for bktree
- C++ Ternary Search Tree implementation with Python bindings☆43Updated 6 years ago
- Python search module for fast approximate string matching☆53Updated last year
- HAT-Trie for Python☆86Updated 8 years ago
- Roaring Bitmap in Cython☆79Updated 5 months ago
- A disk-based key/value store in Python with no dependencies.☆21Updated 9 years ago
- *Deprecated* A fast and accurate part-of-speech tagger for TextBlob.☆103Updated 9 years ago
- Python bindings for the Google's FarmHash☆37Updated 2 months ago
- Statistical Dependency Parser using SVM as proposed by Yamada et al☆29Updated 8 years ago
- unofficial git mirror of http://svn.whoosh.ca svn repo☆49Updated 14 years ago
- A pure Python implementation of Aho-Corasick algorithm.☆23Updated 6 years ago
- templatemaker is a Python library that can extract data from files with a similar format, like HTML pages.☆63Updated 4 years ago
- Non-Overlapping Aho-Corasick Python extension, for Python 2 (str and unicode) and Python 3☆50Updated 9 years ago
- Simhashing in C++☆134Updated last year
- [NO LONGER MAINTAINED AS OPEN SOURCE - USE SCALETEXT.COM INSTEAD]☆109Updated 11 years ago
- A Cython implementation of the affine gap string distance☆58Updated last year
- Python Set subclass that supports searching by ngram similarity☆120Updated 3 years ago
- Efficiently search the most similar strings against the query in Python.☆18Updated 6 years ago
- A memory-based, optional-persistence naïve bayesian text classifier.☆35Updated 9 years ago
- Memory-efficient Count-Min Sketch Counter (based on Madoka C++ library)☆25Updated 5 years ago
- A partially and lazily sorted list data structure for Python☆124Updated 4 years ago
- Code for "Performance shootout between nearest-neighbour libraries": http://radimrehurek.com/2013/11/performance-shootout-of-nearest-neig…☆100Updated 9 years ago
- A collection of documents and materials for the EMNLP-2015 Semantic Similarity tutorial☆30Updated 9 years ago
- The implementation of Word2Vec (SkipGram - and CBOW) models using theano and numpy☆27Updated 8 years ago
- google all pairs similarity search package, with swig bindings☆23Updated 9 years ago
- A GBDT(MART) and LambdaMART training and predicting package☆15Updated 9 years ago
- Python BK-tree data structure to allow fast querying of "close" matches☆172Updated 3 years ago
- Python API for Various DB-Backed Simhash Clusters☆64Updated 7 years ago
- Interesting (non-cryptographic) hashes implemented in pure Python.☆240Updated 3 years ago