jamesturk / jellyfishLinks
πͺΌ a python library for doing approximate and phonetic matching of strings.
β2,166Updated 2 weeks ago
Alternatives and similar repositories for jellyfish
Users that are interested in jellyfish are comparing it to the libraries listed below
Sorting:
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarityβ1,275Updated 4 years ago
- Fixes mojibake and other glitches in Unicode text, after the fact.β3,983Updated last year
- Find dates inside text using Python and get back datetime objectsβ664Updated last year
- Port of Google's language-detection library to Python.β1,853Updated 8 months ago
- Tika-Python is a Python binding to the Apache Tikaβ’ REST services allowing Tika to be called natively in the Python community.β1,629Updated 7 months ago
- Rapid fuzzy string matching in Python using various string metricsβ3,508Updated last week
- python parser for human readable datesβ2,744Updated 2 weeks ago
- a python library for parsing unstructured western names into name components.β609Updated 5 months ago
- A simple Python module for parsing human names into their individual componentsβ691Updated last year
- Correctly generate plurals, ordinals, indefinite articles; convert numbers to wordsβ1,045Updated 6 months ago
- π Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.β3,501Updated 6 months ago
- Fuzzy String Matching in Pythonβ9,265Updated 2 years ago
- A powerful and modular toolkit for record linkage and duplicate detection in Pythonβ1,027Updated last year
- A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.β4,394Updated 3 months ago
- A toolkit for making domain-specific probabilistic parsersβ806Updated last year
- extract text from any document. no muss. no fuss.β4,357Updated 11 months ago
- Multilingual text (NLP) processing toolkitβ2,353Updated 2 years ago
- Python implementation of TextRank algorithms ("textgraphs") for phrase extractionβ2,203Updated last month
- Python bindings to libpostal for fast international address parsing/normalizationβ850Updated 2 weeks ago
- spellchecking library for pythonβ614Updated 2 months ago
- NLP, before and after spaCyβ2,235Updated 2 years ago
- A collection of common regular expressions bundled with an easy to use interface.β1,581Updated 2 years ago
- a python library for parsing unstructured United States address strings into address componentsβ1,604Updated 3 months ago
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithmβ¦β848Updated 2 months ago
- Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/β764Updated 2 months ago
- Parse strings using a specification based on the Python format() syntax.β1,771Updated this week
- Python character encoding detectorβ2,293Updated last week
- Static memory-efficient Trie-like structures for Python based on marisa-trie C++ library.β1,113Updated 3 weeks ago
- Simple yet flexible natural sorting in Python.β994Updated 3 months ago
- Iterative JSON parser with Pythonic interfacesβ1,027Updated 2 weeks ago