jamesturk / jellyfishLinks
πͺΌ a python library for doing approximate and phonetic matching of strings.
β2,151Updated this week
Alternatives and similar repositories for jellyfish
Users that are interested in jellyfish are comparing it to the libraries listed below
Sorting:
- python parser for human readable datesβ2,711Updated 2 weeks ago
- NLP, before and after spaCyβ2,229Updated last year
- Fixes mojibake and other glitches in Unicode text, after the fact.β3,940Updated 9 months ago
- Port of Google's language-detection library to Python.β1,830Updated 5 months ago
- Find dates inside text using Python and get back datetime objectsβ662Updated last year
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarityβ1,276Updated 4 years ago
- A simple Python module for parsing human names into their individual componentsβ685Updated last year
- A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.β4,357Updated 3 weeks ago
- Python implementation of TextRank algorithms ("textgraphs") for phrase extractionβ2,188Updated last month
- Correctly generate plurals, ordinals, indefinite articles; convert numbers to wordsβ1,035Updated 3 months ago
- π¦ Contextually-keyed word vectorsβ1,657Updated 4 months ago
- A collection of common regular expressions bundled with an easy to use interface.β1,579Updated 2 years ago
- Python bindings to libpostal for fast international address parsing/normalizationβ838Updated 6 months ago
- a python library for parsing unstructured western names into name components.β609Updated 3 months ago
- Tika-Python is a Python binding to the Apache Tikaβ’ REST services allowing Tika to be called natively in the Python community.β1,611Updated 4 months ago
- Multilingual text (NLP) processing toolkitβ2,353Updated last year
- Rapid fuzzy string matching in Python using various string metricsβ3,340Updated last week
- π Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.β3,489Updated 4 months ago
- A toolkit for making domain-specific probabilistic parsersβ805Updated 10 months ago
- spellchecking library for pythonβ611Updated last year
- a python library for parsing unstructured United States address strings into address componentsβ1,592Updated 2 weeks ago
- A powerful and modular toolkit for record linkage and duplicate detection in Pythonβ1,023Updated last year
- python humanize functionsβ1,688Updated 3 years ago
- Fuzzy String Matching in Pythonβ9,259Updated 2 years ago
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithmβ¦β839Updated last week
- Computing with Python functions.β4,182Updated last week
- Iterative JSON parser with Pythonic interfacesβ993Updated last week
- Parse human-readable date/time stringsβ703Updated 6 months ago
- serialize all of Pythonβ2,368Updated 2 months ago
- extract text from any document. no muss. no fuss.β4,263Updated 8 months ago