🪼 a python library for doing approximate and phonetic matching of strings.
☆2,193Mar 3, 2026Updated this week
Alternatives and similar repositories for jellyfish
Users that are interested in jellyfish are comparing it to the libraries listed below
Sorting:
- Fuzzy String Matching in Python☆9,270Feb 24, 2023Updated 3 years ago
- A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.☆4,440Jul 29, 2025Updated 7 months ago
- 📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.☆3,519Apr 18, 2025Updated 10 months ago
- A powerful and modular toolkit for record linkage and duplicate detection in Python☆1,046Feb 21, 2024Updated 2 years ago
- NLP, before and after spaCy☆2,236Sep 22, 2023Updated 2 years ago
- a python library for parsing unstructured western names into name components.☆616May 15, 2025Updated 9 months ago
- Rapid fuzzy string matching in Python using various string metrics☆3,751Updated this week
- A toolkit for making domain-specific probabilistic parsers☆806Sep 26, 2024Updated last year
- Python implementation of TextRank algorithms ("textgraphs") for phrase extraction☆2,209Feb 15, 2026Updated 2 weeks ago
- 💫 Industrial-strength Natural Language Processing (NLP) in Python☆33,283Updated this week
- Python datetimes made easy☆6,620Feb 17, 2026Updated 2 weeks ago
- pyxDamerauLevenshtein implements the Damerau-Levenshtein (DL) edit distance algorithm for Python in Cython for high performance.☆251Oct 1, 2025Updated 5 months ago
- A very simple framework for state-of-the-art Natural Language Processing (NLP)☆14,354Oct 27, 2025Updated 4 months ago
- Fixes mojibake and other glitches in Unicode text, after the fact.☆4,013Oct 30, 2024Updated last year
- Fuzzy string matching, grouping, and evaluation.☆791Jul 10, 2025Updated 7 months ago
- Extract Keywords from sentence or Replace keywords in sentences.☆5,708Apr 13, 2025Updated 10 months ago
- Utils for streaming large files (S3, HDFS, gzip, bz2...)☆3,433Feb 23, 2026Updated last week
- Parallel computing with task scheduling☆13,754Updated this week
- Topic Modelling for Humans☆16,371Nov 1, 2025Updated 4 months ago
- Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk☆14,175Oct 29, 2025Updated 4 months ago
- Clean personally identifiable information from dirty dirty text.☆418Sep 1, 2023Updated 2 years ago
- Modin: Scale your Pandas workflows by changing a single line of code☆10,363Feb 10, 2026Updated 3 weeks ago
- A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.☆10,046Sep 11, 2025Updated 5 months ago
- Declarative visualization library for Python☆10,286Updated this week
- 🔩 Like builtins, but boltons. 250+ constructs, recipes, and snippets which extend (and rely on nothing but) the Python standard library.…☆6,854Jan 28, 2026Updated last month
- The property-based testing library for Python☆8,476Mar 1, 2026Updated last week
- python parser for human readable dates☆2,788Feb 27, 2026Updated last week
- 🦆 Contextually-keyed word vectors☆1,673Apr 23, 2025Updated 10 months ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆1,278Aug 11, 2021Updated 4 years ago
- A Python tool that automatically cleans data sets and readies them for analysis.☆1,078May 22, 2019Updated 6 years ago
- newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:☆14,997Dec 6, 2025Updated 3 months ago
- a python library for parsing unstructured United States address strings into address components☆1,616Aug 7, 2025Updated 7 months ago
- A library for debugging/inspecting machine learning classifiers and explaining their predictions☆2,772Feb 10, 2026Updated 3 weeks ago
- A library for defensive data analysis.☆502Jan 6, 2020Updated 6 years ago
- Python Classes Without Boilerplate☆5,738Updated this week
- Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.☆9,517Updated this week
- Full text geoparsing as a Python library☆758Sep 17, 2021Updated 4 years ago
- python humanize functions☆1,700Jul 17, 2022Updated 3 years ago
- 🔮 A refreshing functional take on deep learning, compatible with your favorite libraries☆2,892Feb 9, 2026Updated 3 weeks ago