Fuzzy String Matching in Python
☆3,578Mar 3, 2025Updated last year
Alternatives and similar repositories for thefuzz
Users that are interested in thefuzz are comparing it to the libraries listed below
Sorting:
- Fuzzy String Matching in Python☆9,270Feb 24, 2023Updated 3 years ago
- Rapid fuzzy string matching in Python using various string metrics☆3,740Jan 26, 2026Updated last month
- Python logging made (stupidly) simple☆23,653Feb 22, 2026Updated last week
- Extremely fast Query Engine for DataFrames, written in Rust☆37,582Updated this week
- Data validation using Python type hints☆27,055Updated this week
- Retrying library for Python☆8,406Updated this week
- Fuzzy string matching, grouping, and evaluation.☆791Jul 10, 2025Updated 7 months ago
- Fast, correct Python JSON library supporting dataclasses, datetimes, and numpy☆7,933Feb 2, 2026Updated last month
- A light-weight, flexible, and expressive statistical data testing library☆4,212Feb 19, 2026Updated 2 weeks ago
- An extremely fast Python linter and code formatter, written in Rust.☆46,107Updated this week
- State-of-the-Art Text Embeddings☆18,323Feb 27, 2026Updated last week
- Typer, build great CLIs. Easy to code. Based on Python type hints.☆18,951Updated this week
- Reads key-value pairs from a .env file and can set them as environment variables. It helps in developing applications following the 12-fa…☆8,673Updated this week
- A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.☆4,440Jul 29, 2025Updated 7 months ago
- Streamlit — A faster way to build and share data apps.☆43,742Updated this week
- An extremely fast Python package and project manager, written in Rust.☆80,084Updated this week
- Prefect is a workflow orchestration framework for building resilient data pipelines in Python.☆21,697Updated this week
- A Fast, Extensible Progress Bar for Python and CLI☆30,985Feb 14, 2026Updated 2 weeks ago
- 💫 Industrial-strength Natural Language Processing (NLP) in Python☆33,254Nov 27, 2025Updated 3 months ago
- Modin: Scale your Pandas workflows by changing a single line of code☆10,363Feb 10, 2026Updated 3 weeks ago
- A next generation HTTP client for Python. 🦋☆15,122Updated this week
- 🪼 a python library for doing approximate and phonetic matching of strings.☆2,193Dec 15, 2025Updated 2 months ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆1,278Aug 11, 2021Updated 4 years ago
- Rich is a Python library for rich text and beautiful formatting in the terminal.☆55,654Feb 26, 2026Updated last week
- Build and share delightful machine learning apps, all in Python. 🌟 Star to support our work!☆41,921Updated this week
- Python packaging and dependency management made easy☆34,286Updated this week
- A library for efficient similarity search and clustering of dense vectors.☆39,255Updated this week
- SQL databases in Python, designed for simplicity, compatibility, and robustness.☆17,689Updated this week
- Memray is a memory profiler for Python☆14,904Updated this week
- 1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.☆13,399Feb 27, 2026Updated last week
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆14,074Feb 27, 2026Updated last week
- PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.☆9,146Feb 27, 2026Updated last week
- A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files☆9,847Updated this week
- Ray is an AI compute engine. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.☆41,516Updated this week
- FastAPI framework, high performance, easy to learn, fast to code, ready for production☆95,805Updated this week
- structured outputs for llms☆12,468Feb 25, 2026Updated last week
- More routines for operating on iterables, beyond itertools☆4,040Feb 10, 2026Updated 3 weeks ago
- DSPy: The framework for programming—not prompting—language models☆32,519Updated this week
- Faker is a Python package that generates fake data for you.☆19,209Feb 23, 2026Updated last week