jamesturk / jellyfish
๐ชผ a python library for doing approximate and phonetic matching of strings.
โ2,083Updated 2 weeks ago
Alternatives and similar repositories for jellyfish:
Users that are interested in jellyfish are comparing it to the libraries listed below
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarityโ1,267Updated 3 years ago
- ๐ Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.โ3,432Updated 4 months ago
- python parser for human readable datesโ2,584Updated this week
- Rapid fuzzy string matching in Python using various string metricsโ2,830Updated this week
- A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.โ4,200Updated last month
- Fixes mojibake and other glitches in Unicode text, after the fact.โ3,841Updated 2 months ago
- Tika-Python is a Python binding to the Apache Tikaโข REST services allowing Tika to be called natively in the Python community.โ1,530Updated 9 months ago
- Correctly generate plurals, ordinals, indefinite articles; convert numbers to wordsโ989Updated 2 weeks ago
- Useful extensions to the standard Python datetime featuresโ2,404Updated 5 months ago
- NLP, before and after spaCyโ2,214Updated last year
- Fuzzy String Matching in Pythonโ9,237Updated last year
- extract text from any document. no muss. no fuss.โ3,956Updated last month
- a python library for parsing unstructured western names into name components.โ599Updated 2 months ago
- Python library providing function decorators for configurable backoff and retryโ2,622Updated 8 months ago
- A simple Python module for parsing human names into their individual componentsโ663Updated 7 months ago
- Computing with Python functions.โ3,938Updated this week
- More routines for operating on iterables, beyond itertoolsโ3,781Updated this week
- Python port of Google's libphonenumberโ3,537Updated this week
- โ๏ธ Python's nested data operator (and CLI), for all your declarative restructuring needs. Got data? Glom it! โ๏ธโ1,928Updated this week
- Multilingual text (NLP) processing toolkitโ2,317Updated last year
- Static memory-efficient Trie-like structures for Python based on marisa-trie C++ library.โ1,050Updated 2 months ago
- Datetimes for Humansโขโ3,411Updated 5 months ago
- Easy-to-use data handling for SQL data stores with support for implicit table creation, bulk loading, and transactions.โ4,788Updated last year
- ๐ฎ A refreshing functional take on deep learning, compatible with your favorite librariesโ2,825Updated this week
- Fuzzy String Matching in Pythonโ3,010Updated 10 months ago
- Utils for streaming large files (S3, HDFS, gzip, bz2...)โ3,249Updated last month
- Python module (C extension and plain python) implementing Aho-Corasick algorithmโ967Updated 9 months ago
- Port of Google's language-detection library to Python.โ1,744Updated 11 months ago
- Python bindings to libpostal for fast international address parsing/normalizationโ775Updated 6 months ago
- Python library for creating data pipelines with chain functional programmingโ2,411Updated 6 months ago