julesjacobs / levenshtein
A simple proof of concept levenshtein automaton in Python
☆109Updated 9 years ago
Alternatives and similar repositories for levenshtein:
Users that are interested in levenshtein are comparing it to the libraries listed below
- Finite state dictionaries in Java☆130Updated 3 years ago
- Various utilities regarding Levenshtein transducers.☆68Updated 4 years ago
- Fast directed acyclic word graph generator☆91Updated 6 years ago
- DAFSA-based dictionary-like read-only objects for Python. Based on `dawgdic` C++ library.☆301Updated 10 months ago
- Lightning fast spell correction / fuzzy search library based on SymSpell by Commerce-Experts☆81Updated 6 years ago
- FlashX is a collection of big data analytics tools that perform data analytics in the form of graphs and matrices.☆233Updated 5 years ago
- HAT-Trie for Python☆86Updated 9 years ago
- Golomb Coded Sets☆91Updated 7 years ago
- Trinity IR Infrastructure☆238Updated 5 years ago
- Open-Source Information Retrieval Reproducibility Challenge☆50Updated 9 years ago
- Json Wikipedia, contains code to convert the Wikipedia xml dump into a json/avro dump☆253Updated last year
- Implementation of Bayesian Sets for fast similarity searches.☆14Updated 13 years ago
- Roaring Bitmap in Cython☆81Updated 11 months ago
- Search for similar short strings☆52Updated 4 years ago
- Learning M-Way Tree - Web Scale Clustering - EM-tree, K-tree, k-means, TSVQ, repeated k-means, bitwise clustering☆74Updated 3 years ago
- A library of inverted index data structures☆148Updated 2 years ago
- A fast Python implementation of locality sensitive hashing.☆70Updated 10 years ago
- A collection of succinct data structures☆201Updated last year
- ☆50Updated 4 years ago
- A Utility Library for Wikipedia dumps☆33Updated 8 years ago
- Keyvi - a key value index that powers Cliqz search engine. It is an in-memory FST-based data structure highly optimized for size and look…☆177Updated 6 years ago
- Implementation of Burkhard-Keller trees in various languages☆52Updated 15 years ago
- Solr Dictionary Annotator (Microservice for Spark)☆71Updated 5 years ago
- Succinct Data Structure Library☆106Updated 11 years ago
- Compilation and rule-based optimization framework for relational algebra. Raco is the language, optimization, and query translation layer…☆72Updated 7 years ago
- A bunch of fancy soft string matching routines, with some accompanying datasets☆56Updated 7 years ago
- A pure python implementation of locality sensitive hashing for text documents☆85Updated 9 years ago
- SymSpellCompound: compound aware automatic spelling correction☆66Updated 7 years ago
- Suite of universal indexes for Highly Repetitive Document Collections☆20Updated 4 years ago
- A partially and lazily sorted list data structure for Python☆124Updated 4 years ago