jamesturk / jellyfishLinks
πͺΌ a python library for doing approximate and phonetic matching of strings.
β2,148Updated last month
Alternatives and similar repositories for jellyfish
Users that are interested in jellyfish are comparing it to the libraries listed below
Sorting:
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarityβ1,276Updated 3 years ago
- Fixes mojibake and other glitches in Unicode text, after the fact.β3,934Updated 9 months ago
- python parser for human readable datesβ2,705Updated last week
- Find dates inside text using Python and get back datetime objectsβ661Updated last year
- π Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.β3,480Updated 3 months ago
- A simple Python module for parsing human names into their individual componentsβ680Updated last year
- Correctly generate plurals, ordinals, indefinite articles; convert numbers to wordsβ1,031Updated 2 months ago
- Python bindings to libpostal for fast international address parsing/normalizationβ831Updated 5 months ago
- NLP, before and after spaCyβ2,230Updated last year
- a python library for parsing unstructured United States address strings into address componentsβ1,589Updated 3 weeks ago
- Port of Google's language-detection library to Python.β1,826Updated 5 months ago
- A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.β4,352Updated this week
- A powerful and modular toolkit for record linkage and duplicate detection in Pythonβ1,018Updated last year
- Python implementation of TextRank algorithms ("textgraphs") for phrase extractionβ2,188Updated 3 weeks ago
- A toolkit for making domain-specific probabilistic parsersβ805Updated 10 months ago
- A collection of common regular expressions bundled with an easy to use interface.β1,579Updated 2 years ago
- serialize all of Pythonβ2,364Updated last month
- a python library for parsing unstructured western names into name components.β608Updated 2 months ago
- extract text from any document. no muss. no fuss.β4,229Updated 8 months ago
- Computing with Python functions.β4,130Updated 2 months ago
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithmβ¦β835Updated 3 months ago
- Multilingual text (NLP) processing toolkitβ2,349Updated last year
- Tika-Python is a Python binding to the Apache Tikaβ’ REST services allowing Tika to be called natively in the Python community.β1,608Updated 3 months ago
- Parse strings using a specification based on the Python format() syntax.β1,760Updated 3 weeks ago
- Utils for streaming large files (S3, HDFS, gzip, bz2...)β3,340Updated 3 weeks ago
- Fuzzy String Matching in Pythonβ9,257Updated 2 years ago
- spellchecking library for pythonβ610Updated last year
- βοΈ Python's nested data operator (and CLI), for all your declarative restructuring needs. Got data? Glom it! βοΈβ2,044Updated 6 months ago
- Rapid fuzzy string matching in Python using various string metricsβ3,264Updated this week
- pyxDamerauLevenshtein implements the Damerau-Levenshtein (DL) edit distance algorithm for Python in Cython for high performance.β246Updated this week