julesjacobs / levenshteinLinks
A simple proof of concept levenshtein automaton in Python
☆108Updated 10 years ago
Alternatives and similar repositories for levenshtein
Users that are interested in levenshtein are comparing it to the libraries listed below
Sorting:
- Finite state dictionaries in Java☆131Updated 3 years ago
- HAT-Trie for Python☆86Updated 9 years ago
- Various utilities regarding Levenshtein transducers.☆68Updated 4 years ago
- DAFSA-based dictionary-like read-only objects for Python. Based on `dawgdic` C++ library.☆303Updated last year
- Roaring Bitmap in Cython☆81Updated last year
- Fast directed acyclic word graph generator☆91Updated 7 years ago
- *Deprecated* A fast and accurate part-of-speech tagger for TextBlob.☆101Updated 9 years ago
- Lightning fast spell correction / fuzzy search library based on SymSpell by Commerce-Experts☆81Updated 7 years ago
- Json Wikipedia, contains code to convert the Wikipedia xml dump into a json/avro dump☆254Updated last year
- Implementation of Bayesian Sets for fast similarity searches.☆14Updated 14 years ago
- Forever incomplete suite of tools for an orthographic/grammatical checker☆29Updated 5 years ago
- Learning M-Way Tree - Web Scale Clustering - EM-tree, K-tree, k-means, TSVQ, repeated k-means, bitwise clustering☆77Updated 3 years ago
- The BLOG programming language☆100Updated 2 years ago
- Open-Source Information Retrieval Reproducibility Challenge☆50Updated 9 years ago
- Language Lego☆141Updated 5 years ago
- Locality-sensitive hashing algorithm for text similarity comparisons☆58Updated 6 months ago
- Keyvi - a key value index that powers Cliqz search engine. It is an in-memory FST-based data structure highly optimized for size and look…☆177Updated 6 years ago
- C++ Ternary Search Tree implementation with Python bindings☆43Updated 7 years ago
- An efficient approximation for tree edit-distance.☆45Updated 14 years ago
- A RESTful web service that runs microtasks across multiple crowds, provides quality control techniques, and is easily extensible.☆52Updated 8 years ago
- A tool to segment text based on frequencies and the Viterbi algorithm "#TheBoyWhoLived" => ['#', 'The', 'Boy', 'Who', 'Lived']☆81Updated 9 years ago
- Fast and robust NLP components implemented in Java.☆52Updated 4 years ago
- Compute association strength over semantic networks in a dimensionality-reduced form.☆32Updated 10 years ago
- A bunch of fancy soft string matching routines, with some accompanying datasets☆56Updated 8 years ago
- Read natural language interactive queries. Great for bots.☆18Updated 8 years ago
- Code for "Performance shootout between nearest-neighbour libraries": http://radimrehurek.com/2013/11/performance-shootout-of-nearest-neig…☆99Updated 10 years ago
- A fast Python implementation of locality sensitive hashing.☆70Updated 10 years ago
- Golomb Coded Sets☆94Updated 8 years ago
- Automatic keyword extraction - no alchemy required!☆169Updated 9 years ago
- FlashX is a collection of big data analytics tools that perform data analytics in the form of graphs and matrices.☆235Updated 5 years ago