jamesturk / jellyfishLinks
πͺΌ a python library for doing approximate and phonetic matching of strings.
β2,159Updated 3 weeks ago
Alternatives and similar repositories for jellyfish
Users that are interested in jellyfish are comparing it to the libraries listed below
Sorting:
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarityβ1,277Updated 4 years ago
- Find dates inside text using Python and get back datetime objectsβ663Updated last year
- Fixes mojibake and other glitches in Unicode text, after the fact.β3,970Updated 11 months ago
- A simple Python module for parsing human names into their individual componentsβ686Updated last year
- NLP, before and after spaCyβ2,231Updated 2 years ago
- A toolkit for making domain-specific probabilistic parsersβ805Updated last year
- Correctly generate plurals, ordinals, indefinite articles; convert numbers to wordsβ1,039Updated 4 months ago
- Tika-Python is a Python binding to the Apache Tikaβ’ REST services allowing Tika to be called natively in the Python community.β1,623Updated 5 months ago
- Rapid fuzzy string matching in Python using various string metricsβ3,424Updated this week
- Port of Google's language-detection library to Python.β1,848Updated 7 months ago
- A collection of common regular expressions bundled with an easy to use interface.β1,581Updated 2 years ago
- Python implementation of TextRank algorithms ("textgraphs") for phrase extractionβ2,199Updated 2 months ago
- Python bindings to libpostal for fast international address parsing/normalizationβ844Updated 7 months ago
- a python library for parsing unstructured western names into name components.β608Updated 4 months ago
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithmβ¦β845Updated 3 weeks ago
- spellchecking library for pythonβ614Updated 2 weeks ago
- Heuristic based boilerplate removal toolβ798Updated 7 months ago
- extract text from any document. no muss. no fuss.β4,315Updated 10 months ago
- python parser for human readable datesβ2,727Updated last month
- A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.β4,381Updated 2 months ago
- a python library for parsing unstructured United States address strings into address componentsβ1,595Updated last month
- π¦ Contextually-keyed word vectorsβ1,660Updated 5 months ago
- Fuzzy String Matching in Pythonβ9,260Updated 2 years ago
- Multilingual text (NLP) processing toolkitβ2,351Updated last year
- Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/β760Updated 2 weeks ago
- βοΈ Python's nested data operator (and CLI), for all your declarative restructuring needs. Got data? Glom it! βοΈβ2,073Updated 8 months ago
- Python character encoding detectorβ2,287Updated 8 months ago
- A library implementing different string similarity and distance measures using Python.β1,019Updated 2 years ago
- Python implementation of the Rapid Automatic Keyword Extraction algorithm using NLTK.β1,073Updated 2 years ago
- python package to calculate readability statistics of a text object - paragraphs, sentences, articles.β1,318Updated last month