jamesturk / jellyfishLinks
πͺΌ a python library for doing approximate and phonetic matching of strings.
β2,144Updated 3 weeks ago
Alternatives and similar repositories for jellyfish
Users that are interested in jellyfish are comparing it to the libraries listed below
Sorting:
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarityβ1,273Updated 3 years ago
- Find dates inside text using Python and get back datetime objectsβ661Updated last year
- Fixes mojibake and other glitches in Unicode text, after the fact.β3,923Updated 8 months ago
- A simple Python module for parsing human names into their individual componentsβ676Updated last year
- python parser for human readable datesβ2,689Updated 2 weeks ago
- Port of Google's language-detection library to Python.β1,820Updated 4 months ago
- Correctly generate plurals, ordinals, indefinite articles; convert numbers to wordsβ1,026Updated last month
- A collection of common regular expressions bundled with an easy to use interface.β1,576Updated 2 years ago
- NLP, before and after spaCyβ2,228Updated last year
- extract text from any document. no muss. no fuss.β4,184Updated 7 months ago
- Python bindings to libpostal for fast international address parsing/normalizationβ824Updated 5 months ago
- python humanize functionsβ1,684Updated 2 years ago
- serialize all of Pythonβ2,364Updated 2 weeks ago
- spellchecking library for pythonβ611Updated last year
- Rapid fuzzy string matching in Python using various string metricsβ3,220Updated 2 weeks ago
- Multilingual text (NLP) processing toolkitβ2,346Updated last year
- π Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.β3,476Updated 2 months ago
- A powerful and modular toolkit for record linkage and duplicate detection in Pythonβ1,014Updated last year
- a python library for parsing unstructured western names into name components.β606Updated last month
- Tika-Python is a Python binding to the Apache Tikaβ’ REST services allowing Tika to be called natively in the Python community.β1,596Updated 2 months ago
- Useful extensions to the standard Python datetime featuresβ2,481Updated 3 months ago
- Fuzzy String Matching in Pythonβ3,291Updated 4 months ago
- A toolkit for making domain-specific probabilistic parsersβ803Updated 9 months ago
- Python implementation of TextRank algorithms ("textgraphs") for phrase extractionβ2,184Updated 3 weeks ago
- Fuzzy String Matching in Pythonβ9,255Updated 2 years ago
- CONTRIBUTIONS ONLY: Voluptuous, despite the name, is a Python data validation library.β1,831Updated 2 months ago
- Parse strings using a specification based on the Python format() syntax.β1,755Updated this week
- βοΈ Python's nested data operator (and CLI), for all your declarative restructuring needs. Got data? Glom it! βοΈβ2,025Updated 6 months ago
- Utils for streaming large files (S3, HDFS, gzip, bz2...)β3,330Updated last week
- The bidirectional mapping library for Python.β1,536Updated this week