luozhouyang / python-string-similarityLinks

A library implementing different string similarity and distance measures using Python.

☆1,014

Alternatives and similar repositories for python-string-similarity

Users that are interested in python-string-similarity are comparing it to the libraries listed below

Sorting:

WojciechMula / pyahocorasick
Python module (C extension and plain python) implementing Aho-Corasick algorithm
☆1,015Updated last month
mammothb / symspellpy
Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…
☆835Updated 3 months ago
barrust / pyspellchecker
Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/
☆751Updated 2 weeks ago
ztane / python-Levenshtein
The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity
☆1,276Updated 3 years ago
abusix / ahocorapy
Pure python Aho-Corasick library.
☆216Updated 2 years ago
boudinfl / pke
Python Keyphrase Extraction module
☆1,580Updated 2 years ago
chakki-works / seqeval
A Python framework for sequence labeling evaluation(named-entity recognition, pos tagging, etc...)
☆1,147Updated 11 months ago
jamesturk / jellyfish
🪼 a python library for doing approximate and phonetic matching of strings.
☆2,148Updated last month
oborchers / Fast_Sentence_Embeddings
Compute Sentence Embeddings Fast!
☆623Updated 2 years ago
chartbeat-labs / textacy
NLP, before and after spaCy
☆2,230Updated last year
vgrabovets / multi_rake
Multilingual Rapid Automatic Keyword Extraction (RAKE) for Python
☆272Updated 2 years ago
csurfer / rake-nltk
Python implementation of the Rapid Automatic Keyword Extraction algorithm using NLTK.
☆1,072Updated 2 years ago
keredson / wordninja
Probabilistically split concatenated words using NLP based on English Wikipedia unigram frequencies.
☆851Updated 2 years ago
MaartenGr / PolyFuzz
Fuzzy string matching, grouping, and evaluation.
☆774Updated 3 weeks ago
LIAAD / yake
Single-document unsupervised keyword extraction
☆1,757Updated 2 weeks ago
summanlp / textrank
TextRank implementation for Python 3.
☆1,263Updated 2 years ago
life4 / textdistance
📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.
☆3,480Updated 3 months ago
nipunsadvilkar / pySBD
🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.
☆864Updated 11 months ago
jfilter / clean-text
🧹 Python package for text cleaning
☆982Updated 2 years ago
explosion / spacy-stanza
💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy
☆739Updated 11 months ago
DerwenAI / pytextrank
Python implementation of TextRank algorithms ("textgraphs") for phrase extraction
☆2,188Updated 3 weeks ago
Mimino666 / langdetect
Port of Google's language-detection library to Python.
☆1,826Updated 5 months ago
TeamHG-Memex / sklearn-crfsuite
scikit-learn inspired API for CRFsuite
☆431Updated last year
explosion / spacy-models
💫 Models for the spaCy Natural Language Processing (NLP) library
☆1,768Updated 2 months ago
scrapinghub / python-crfsuite
A python binding for crfsuite
☆775Updated 10 months ago
akoumjian / datefinder
Find dates inside text using Python and get back datetime objects
☆661Updated last year
aboSamoor / polyglot
Multilingual text (NLP) processing toolkit
☆2,351Updated last year
pyenchant / pyenchant
spellchecking library for python
☆610Updated last year
stephenhky / PyShortTextCategorization
Various Algorithms for Short Text Mining
☆472Updated this week
rwalk / gsdmm
GSDMM: Short text clustering
☆356Updated 2 years ago