julesjacobs / levenshtein
A simple proof of concept levenshtein automaton in Python
☆109Updated 9 years ago
Alternatives and similar repositories for levenshtein:
Users that are interested in levenshtein are comparing it to the libraries listed below
- Finite state dictionaries in Java☆130Updated 3 years ago
- DAFSA-based dictionary-like read-only objects for Python. Based on `dawgdic` C++ library.☆300Updated 9 months ago
- Various utilities regarding Levenshtein transducers.☆68Updated 4 years ago
- HAT-Trie for Python☆86Updated 9 years ago
- Roaring Bitmap in Cython☆81Updated 10 months ago
- A General-Purpose Counting Filter: Counting Quotient Filter☆127Updated last year
- Software and resources for natural language processing.☆131Updated 8 years ago
- Open-Source Information Retrieval Reproducibility Challenge☆50Updated 9 years ago
- Succinct Data Structure Library☆106Updated 11 years ago
- A bunch of fancy soft string matching routines, with some accompanying datasets☆56Updated 7 years ago
- C++ Ternary Search Tree implementation with Python bindings☆43Updated 7 years ago
- Solr Dictionary Annotator (Microservice for Spark)☆71Updated 5 years ago
- Search for similar short strings☆52Updated 4 years ago
- Trinity IR Infrastructure☆237Updated 5 years ago
- Forever incomplete suite of tools for an orthographic/grammatical checker☆28Updated 5 years ago
- A tool to segment text based on frequencies and the Viterbi algorithm "#TheBoyWhoLived" => ['#', 'The', 'Boy', 'Who', 'Lived']☆82Updated 8 years ago
- Simhashing in C++☆132Updated 2 years ago
- A library of inverted index data structures☆148Updated 2 years ago
- Implementation of Bayesian Sets for fast similarity searches.☆14Updated 13 years ago
- Locality-sensitive hashing algorithm for text similarity comparisons☆58Updated 3 years ago
- SimString☆113Updated 3 years ago
- FlashX is a collection of big data analytics tools that perform data analytics in the form of graphs and matrices.☆233Updated 4 years ago
- EliasFanoCompression: quasi-succinct compression of sorted integers in C#☆45Updated 3 years ago
- Keyvi - a key value index that powers Cliqz search engine. It is an in-memory FST-based data structure highly optimized for size and look…☆177Updated 6 years ago
- HyperMinHash: Bringing intersections to HyperLogLog☆303Updated 7 years ago
- Implementation of Burkhard-Keller trees in various languages☆52Updated 14 years ago
- Learning M-Way Tree - Web Scale Clustering - EM-tree, K-tree, k-means, TSVQ, repeated k-means, bitwise clustering☆74Updated 3 years ago
- *Deprecated* A fast and accurate part-of-speech tagger for TextBlob.☆102Updated 9 years ago
- Json Wikipedia, contains code to convert the Wikipedia xml dump into a json/avro dump☆253Updated last year
- A fast Python implementation of locality sensitive hashing.☆70Updated 10 years ago