jamesturk / jellyfish
πͺΌ a python library for doing approximate and phonetic matching of strings.
β2,040Updated last week
Related projects: β
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarityβ1,261Updated 3 years ago
- Fixes mojibake and other glitches in Unicode text, after the fact.β3,761Updated 2 weeks ago
- extract text from any document. no muss. no fuss.β3,865Updated 2 weeks ago
- python parser for human readable datesβ2,525Updated 3 weeks ago
- Correctly generate plurals, ordinals, indefinite articles; convert numbers to wordsβ957Updated last week
- Fuzzy String Matching in Pythonβ9,212Updated last year
- π Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.β3,357Updated last week
- A simple Python module for parsing human names into their individual componentsβ650Updated 3 months ago
- Rapid fuzzy string matching in Python using various string metricsβ2,610Updated last week
- Computing with Python functions.β3,815Updated 3 weeks ago
- serialize all of Pythonβ2,241Updated this week
- Useful extensions to the standard Python datetime featuresβ2,324Updated last month
- python humanize functionsβ1,677Updated 2 years ago
- Utils for streaming large files (S3, HDFS, gzip, bz2...)β3,174Updated this week
- A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.β4,095Updated this week
- Port of Google's language-detection library to Python.β1,707Updated 7 months ago
- More routines for operating on iterables, beyond itertoolsβ3,694Updated this week
- NLP, before and after spaCyβ2,205Updated 11 months ago
- Multilingual text (NLP) processing toolkitβ2,308Updated 10 months ago
- A toolkit for making domain-specific probabilistic parsersβ792Updated last year
- Fuzzy String Matching in Pythonβ2,763Updated 6 months ago
- Parse strings using a specification based on the Python format() syntax.β1,693Updated 2 months ago
- Find dates inside text using Python and get back datetime objectsβ634Updated 4 months ago
- A functional standard library for Python.β4,640Updated 3 months ago
- Python datetimes made easyβ6,191Updated 3 months ago
- Python library providing function decorators for configurable backoff and retryβ2,574Updated 4 months ago
- Tika-Python is a Python binding to the Apache Tikaβ’ REST services allowing Tika to be called natively in the Python community.β1,482Updated 5 months ago
- NumPy and Pandas interface to Big Dataβ3,180Updated 11 months ago
- An in-browser Python profile viewerβ2,319Updated 9 months ago
- a python library for parsing unstructured western names into name components.β586Updated 5 months ago