jamesturk / jellyfishLinks
πͺΌ a python library for doing approximate and phonetic matching of strings.
β2,157Updated last week
Alternatives and similar repositories for jellyfish
Users that are interested in jellyfish are comparing it to the libraries listed below
Sorting:
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarityβ1,276Updated 4 years ago
- Find dates inside text using Python and get back datetime objectsβ663Updated last year
- Tika-Python is a Python binding to the Apache Tikaβ’ REST services allowing Tika to be called natively in the Python community.β1,627Updated 6 months ago
- python parser for human readable datesβ2,734Updated 2 months ago
- Fixes mojibake and other glitches in Unicode text, after the fact.β3,980Updated 11 months ago
- Python bindings to libpostal for fast international address parsing/normalizationβ846Updated 8 months ago
- A simple Python module for parsing human names into their individual componentsβ689Updated last year
- Port of Google's language-detection library to Python.β1,851Updated 7 months ago
- π Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.β3,497Updated 6 months ago
- NLP, before and after spaCyβ2,230Updated 2 years ago
- Multilingual text (NLP) processing toolkitβ2,348Updated last year
- extract text from any document. no muss. no fuss.β4,338Updated 10 months ago
- Correctly generate plurals, ordinals, indefinite articles; convert numbers to wordsβ1,043Updated 5 months ago
- Fuzzy String Matching in Pythonβ9,263Updated 2 years ago
- a python library for parsing unstructured western names into name components.β609Updated 5 months ago
- A toolkit for making domain-specific probabilistic parsersβ807Updated last year
- A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.β4,382Updated 2 months ago
- A collection of common regular expressions bundled with an easy to use interface.β1,580Updated 2 years ago
- a python library for parsing unstructured United States address strings into address componentsβ1,598Updated 2 months ago
- Python implementation of TextRank algorithms ("textgraphs") for phrase extractionβ2,203Updated 2 weeks ago
- Useful extensions to the standard Python datetime featuresβ2,550Updated last month
- Python character encoding detectorβ2,289Updated last week
- Parse human-readable date/time stringsβ705Updated last week
- Fuzzy String Matching in Pythonβ3,456Updated 7 months ago
- ASCII transliterations of Unicode text - GitHub mirrorβ589Updated last month
- python humanize functionsβ1,692Updated 3 years ago
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithmβ¦β847Updated last month
- Rapid fuzzy string matching in Python using various string metricsβ3,475Updated this week
- βοΈ Python's nested data operator (and CLI), for all your declarative restructuring needs. Got data? Glom it! βοΈβ2,085Updated 9 months ago
- Persistent HTTP cache for python requestsβ1,459Updated 2 weeks ago