jamesturk / jellyfish
🪼 a python library for doing approximate and phonetic matching of strings.
☆2,097Updated last month
Alternatives and similar repositories for jellyfish:
Users that are interested in jellyfish are comparing it to the libraries listed below
- Fixes mojibake and other glitches in Unicode text, after the fact.☆3,856Updated 3 months ago
- Rapid fuzzy string matching in Python using various string metrics☆2,907Updated 3 weeks ago
- python parser for human readable dates☆2,616Updated 2 weeks ago
- 📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.☆3,442Updated 5 months ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆1,268Updated 3 years ago
- NLP, before and after spaCy☆2,215Updated last year
- A simple Python module for parsing human names into their individual components☆668Updated 8 months ago
- Useful extensions to the standard Python datetime features☆2,423Updated 6 months ago
- Fuzzy String Matching in Python☆3,054Updated 11 months ago
- Find dates inside text using Python and get back datetime objects☆641Updated 9 months ago
- Computing with Python functions.☆3,984Updated 3 weeks ago
- Python datetimes made easy☆6,359Updated 3 weeks ago
- Fuzzy String Matching in Python☆9,245Updated last year
- Correctly generate plurals, ordinals, indefinite articles; convert numbers to words☆1,002Updated this week
- Port of Google's language-detection library to Python.☆1,752Updated last year
- Multilingual text (NLP) processing toolkit☆2,322Updated last year
- python package to calculate readability statistics of a text object - paragraphs, sentences, articles.☆1,181Updated 2 weeks ago
- Python implementation of TextRank algorithms ("textgraphs") for phrase extraction☆2,166Updated 7 months ago
- 🦆 Contextually-keyed word vectors☆1,638Updated 11 months ago
- serialize all of Python☆2,318Updated this week
- Retrying library for Python☆7,068Updated this week
- A powerful and modular toolkit for record linkage and duplicate detection in Python☆982Updated 11 months ago
- Python bindings to libpostal for fast international address parsing/normalization☆784Updated last week
- extract text from any document. no muss. no fuss.☆3,972Updated 2 months ago
- A toolkit for making domain-specific probabilistic parsers☆799Updated 4 months ago
- spellchecking library for python☆606Updated 8 months ago
- a python library for parsing unstructured western names into name components.☆599Updated 3 months ago
- Stand-alone language identification system☆2,354Updated 5 years ago
- Utils for streaming large files (S3, HDFS, gzip, bz2...)☆3,266Updated 2 months ago
- Extract Keywords from sentence or Replace keywords in sentences.☆5,630Updated 7 months ago