roy-ht / editdistanceLinks

Fast implementation of the edit distance(Levenshtein distance)

☆689

Alternatives and similar repositories for editdistance

Users that are interested in editdistance are comparing it to the libraries listed below

Sorting:

ztane / python-Levenshtein
The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity
☆1,276Updated 3 years ago
WojciechMula / pyahocorasick
Python module (C extension and plain python) implementing Aho-Corasick algorithm
☆1,015Updated last month
grantjenks / python-wordsegment
English word segmentation, written in pure-Python, and based on a trillion-word corpus.
☆376Updated 2 years ago
pytries / marisa-trie
Static memory-efficient Trie-like structures for Python based on marisa-trie C++ library.
☆1,098Updated last month
pyenchant / pyenchant
spellchecking library for python
☆610Updated last year
taleinat / fuzzysearch
Find parts of long text or data, allowing for some changes/typos.
☆325Updated 2 months ago
hplt-project / sacremoses
Python port of Moses tokenizer, truecaser and normalizer
☆495Updated last year
lanl / pyxDamerauLevenshtein
pyxDamerauLevenshtein implements the Damerau-Levenshtein (DL) edit distance algorithm for Python in Cython for high performance.
☆246Updated last week
google / pygtrie
Python library implementing a trie data structure.
☆824Updated 4 years ago
pytries / datrie
Fast, efficiently stored Trie for Python. Uses libdatrie.
☆537Updated last year
nipunsadvilkar / pySBD
🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.
☆864Updated 11 months ago
explosion / srsly
🦉 Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)
☆472Updated 6 months ago
scrapinghub / python-crfsuite
A python binding for crfsuite
☆775Updated 10 months ago
miohtama / python-Levenshtein
The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity
☆387Updated 2 years ago
mammothb / symspellpy
Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…
☆835Updated 3 months ago
fujimotos / polyleven
A Fast Levenshtein Distance Library for Python
☆84Updated 5 months ago
chokkan / crfsuite
CRFsuite: a fast implementation of Conditional Random Fields (CRFs)
☆659Updated last year
ptrus / suffix-trees
Python implementation of Suffix Trees and Generalized Suffix Trees.
☆127Updated 5 years ago
gpoulter / python-ngram
Python Set subclass that supports searching by ngram similarity
☆119Updated 3 years ago
pytries / DAWG
DAFSA-based dictionary-like read-only objects for Python. Based on `dawgdic` C++ library.
☆305Updated last year
taki0112 / Vector_Similarity
Python, Java implementation of TS-SS called from "A Hybrid Geometric Approach for Measuring Similarity Level Among Documents and Document…
☆300Updated 5 years ago
belambert / edit-distance
Python library for computing edit distance between arbitrary Python sequences.
☆102Updated 5 months ago
abusix / ahocorapy
Pure python Aho-Corasick library.
☆216Updated 2 years ago
Mimino666 / langdetect
Port of Google's language-detection library to Python.
☆1,826Updated 5 months ago
glample / fastBPE
Fast BPE
☆670Updated last year
aboSamoor / pycld2
☆171Updated 4 months ago
bheinzerling / bpemb
Pre-trained subword embeddings in 275 languages, based on Byte-Pair Encoding (BPE)
☆1,216Updated 10 months ago
jaraco / inflect
Correctly generate plurals, ordinals, indefinite articles; convert numbers to words
☆1,033Updated 2 months ago
TeamHG-Memex / sklearn-crfsuite
scikit-learn inspired API for CRFsuite
☆431Updated last year
infoscout / weighted-levenshtein
Weighted Levenshtein library
☆110Updated 2 years ago