jamesturk / jellyfishLinks
πͺΌ a python library for doing approximate and phonetic matching of strings.
β2,172Updated last week
Alternatives and similar repositories for jellyfish
Users that are interested in jellyfish are comparing it to the libraries listed below
Sorting:
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarityβ1,276Updated 4 years ago
- Fixes mojibake and other glitches in Unicode text, after the fact.β3,989Updated last year
- Find dates inside text using Python and get back datetime objectsβ665Updated last year
- NLP, before and after spaCyβ2,234Updated 2 years ago
- python parser for human readable datesβ2,750Updated last month
- Utils for streaming large files (S3, HDFS, gzip, bz2...)β3,413Updated 3 weeks ago
- Correctly generate plurals, ordinals, indefinite articles; convert numbers to wordsβ1,051Updated 6 months ago
- A simple Python module for parsing human names into their individual componentsβ697Updated last year
- A collection of common regular expressions bundled with an easy to use interface.β1,582Updated 2 years ago
- Multilingual text (NLP) processing toolkitβ2,361Updated 2 years ago
- A powerful and modular toolkit for record linkage and duplicate detection in Pythonβ1,032Updated last year
- π Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.β3,508Updated 7 months ago
- Fuzzy String Matching in Pythonβ3,500Updated 9 months ago
- spellchecking library for pythonβ614Updated 2 months ago
- βοΈ Python's nested data operator (and CLI), for all your declarative restructuring needs. Got data? Glom it! βοΈβ2,097Updated 10 months ago
- Fuzzy String Matching in Pythonβ9,268Updated 2 years ago
- Parse strings using a specification based on the Python format() syntax.β1,776Updated this week
- Port of Google's language-detection library to Python.β1,858Updated 9 months ago
- a python library for parsing unstructured western names into name components.β614Updated 6 months ago
- Computing with Python functions.β4,280Updated this week
- π¦ Contextually-keyed word vectorsβ1,666Updated 7 months ago
- extract text from any document. no muss. no fuss.β4,382Updated last year
- Tika-Python is a Python binding to the Apache Tikaβ’ REST services allowing Tika to be called natively in the Python community.β1,633Updated 7 months ago
- Python implementation of TextRank algorithms ("textgraphs") for phrase extractionβ2,204Updated this week
- Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/β766Updated last week
- Python library providing function decorators for configurable backoff and retryβ2,703Updated last year
- A toolkit for making domain-specific probabilistic parsersβ805Updated last year
- Python library for serializing any arbitrary object graph into JSON. It can take almost any Python object and turn the object into JSON. β¦β1,312Updated last week
- A library implementing different string similarity and distance measures using Python.β1,021Updated 3 years ago
- serialize all of Pythonβ2,408Updated 3 weeks ago