jamesturk / jellyfishLinks
πͺΌ a python library for doing approximate and phonetic matching of strings.
β2,153Updated this week
Alternatives and similar repositories for jellyfish
Users that are interested in jellyfish are comparing it to the libraries listed below
Sorting:
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarityβ1,278Updated 4 years ago
- Fixes mojibake and other glitches in Unicode text, after the fact.β3,957Updated 10 months ago
- Correctly generate plurals, ordinals, indefinite articles; convert numbers to wordsβ1,037Updated 4 months ago
- Tika-Python is a Python binding to the Apache Tikaβ’ REST services allowing Tika to be called natively in the Python community.β1,613Updated 5 months ago
- A simple Python module for parsing human names into their individual componentsβ686Updated last year
- Port of Google's language-detection library to Python.β1,841Updated 6 months ago
- Fuzzy String Matching in Pythonβ9,259Updated 2 years ago
- Rapid fuzzy string matching in Python using various string metricsβ3,388Updated this week
- Find dates inside text using Python and get back datetime objectsβ662Updated last year
- python parser for human readable datesβ2,717Updated 3 weeks ago
- A collection of common regular expressions bundled with an easy to use interface.β1,578Updated 2 years ago
- A toolkit for making domain-specific probabilistic parsersβ805Updated 11 months ago
- serialize all of Pythonβ2,375Updated 2 months ago
- Computing with Python functions.β4,211Updated 2 weeks ago
- NLP, before and after spaCyβ2,228Updated last year
- π Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.β3,491Updated 4 months ago
- Multilingual text (NLP) processing toolkitβ2,353Updated last year
- Utils for streaming large files (S3, HDFS, gzip, bz2...)β3,356Updated last week
- python humanize functionsβ1,693Updated 3 years ago
- Parse strings using a specification based on the Python format() syntax.β1,769Updated 2 months ago
- A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.β4,369Updated last month
- extract text from any document. no muss. no fuss.β4,281Updated 9 months ago
- Python bindings to libpostal for fast international address parsing/normalizationβ842Updated 7 months ago
- a python library for parsing unstructured western names into name components.β609Updated 3 months ago
- spellchecking library for pythonβ613Updated last year
- A powerful and modular toolkit for record linkage and duplicate detection in Pythonβ1,023Updated last year
- a python library for parsing unstructured United States address strings into address componentsβ1,593Updated last month
- π¦ Contextually-keyed word vectorsβ1,657Updated 4 months ago
- Parse human-readable date/time stringsβ704Updated 7 months ago
- Python implementation of TextRank algorithms ("textgraphs") for phrase extractionβ2,196Updated 2 months ago