jamesturk / jellyfishView external linksLinks
🪼 a python library for doing approximate and phonetic matching of strings.
☆2,189Dec 15, 2025Updated last month
Alternatives and similar repositories for jellyfish
Users that are interested in jellyfish are comparing it to the libraries listed below
Sorting:
- Fuzzy String Matching in Python☆9,271Feb 24, 2023Updated 2 years ago
- A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.☆4,436Jul 29, 2025Updated 6 months ago
- 📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.☆3,515Apr 18, 2025Updated 9 months ago
- A powerful and modular toolkit for record linkage and duplicate detection in Python☆1,045Feb 21, 2024Updated last year
- NLP, before and after spaCy☆2,232Sep 22, 2023Updated 2 years ago
- a python library for parsing unstructured western names into name components.☆616May 15, 2025Updated 8 months ago
- Rapid fuzzy string matching in Python using various string metrics☆3,716Jan 26, 2026Updated 2 weeks ago
- A toolkit for making domain-specific probabilistic parsers☆805Sep 26, 2024Updated last year
- Python implementation of TextRank algorithms ("textgraphs") for phrase extraction☆2,209Feb 1, 2026Updated 2 weeks ago
- 💫 Industrial-strength Natural Language Processing (NLP) in Python☆33,201Nov 27, 2025Updated 2 months ago
- Python datetimes made easy☆6,618Feb 6, 2026Updated last week
- pyxDamerauLevenshtein implements the Damerau-Levenshtein (DL) edit distance algorithm for Python in Cython for high performance.☆250Oct 1, 2025Updated 4 months ago
- A very simple framework for state-of-the-art Natural Language Processing (NLP)☆14,355Oct 27, 2025Updated 3 months ago
- Fixes mojibake and other glitches in Unicode text, after the fact.☆4,012Oct 30, 2024Updated last year
- Extract Keywords from sentence or Replace keywords in sentences.☆5,709Apr 13, 2025Updated 10 months ago
- Utils for streaming large files (S3, HDFS, gzip, bz2...)☆3,433Jan 28, 2026Updated 2 weeks ago
- Fuzzy string matching, grouping, and evaluation.☆788Jul 10, 2025Updated 7 months ago
- Parallel computing with task scheduling☆13,738Feb 5, 2026Updated last week
- Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk☆14,151Oct 29, 2025Updated 3 months ago
- Topic Modelling for Humans☆16,355Nov 1, 2025Updated 3 months ago
- Clean personally identifiable information from dirty dirty text.☆417Sep 1, 2023Updated 2 years ago
- Modin: Scale your Pandas workflows by changing a single line of code☆10,357Updated this week
- A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.☆10,048Sep 11, 2025Updated 5 months ago
- 🔩 Like builtins, but boltons. 250+ constructs, recipes, and snippets which extend (and rely on nothing but) the Python standard library.…☆6,851Jan 28, 2026Updated 2 weeks ago
- Declarative visualization library for Python☆10,246Feb 6, 2026Updated last week
- python parser for human readable dates☆2,780Updated this week
- The property-based testing library for Python☆8,446Updated this week
- 🦆 Contextually-keyed word vectors☆1,672Apr 23, 2025Updated 9 months ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆1,278Aug 11, 2021Updated 4 years ago
- A Python tool that automatically cleans data sets and readies them for analysis.☆1,077May 22, 2019Updated 6 years ago
- a python library for parsing unstructured United States address strings into address components☆1,614Aug 7, 2025Updated 6 months ago
- newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:☆14,977Dec 6, 2025Updated 2 months ago
- A library for debugging/inspecting machine learning classifiers and explaining their predictions☆2,771Updated this week
- A library for defensive data analysis.☆502Jan 6, 2020Updated 6 years ago
- Python Classes Without Boilerplate☆5,726Feb 4, 2026Updated last week
- Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.☆9,506Updated this week
- Full text geoparsing as a Python library☆758Sep 17, 2021Updated 4 years ago
- python humanize functions☆1,697Jul 17, 2022Updated 3 years ago
- 🔮 A refreshing functional take on deep learning, compatible with your favorite libraries☆2,891Updated this week