rapidfuzz / RapidFuzzLinks
Rapid fuzzy string matching in Python using various string metrics
☆3,702Updated last week
Alternatives and similar repositories for RapidFuzz
Users that are interested in RapidFuzz are comparing it to the libraries listed below
Sorting:
- Fuzzy String Matching in Python☆3,564Updated 11 months ago
- 🪼 a python library for doing approximate and phonetic matching of strings.☆2,187Updated last month
- A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.☆4,433Updated 6 months ago
- Fixes mojibake and other glitches in Unicode text, after the fact.☆4,011Updated last year
- Port of Google's language-detection library to Python.☆1,870Updated 11 months ago
- A light-weight, flexible, and expressive statistical data testing library☆4,186Updated last week
- python parser for human readable dates☆2,778Updated this week
- Python binding to Modest and Lexbor engines. Fast HTML5 parser with CSS selectors for Python.☆1,536Updated 2 weeks ago
- A Python package for easy multiprocessing, but faster than multiprocessing☆2,076Updated last year
- The most accurate natural language detection library for Python, suitable for short text and mixed-language text☆1,627Updated 2 months ago
- Fuzzy string matching, grouping, and evaluation.☆788Updated 6 months ago
- Fast, correct Python JSON library supporting dataclasses, datetimes, and numpy☆7,863Updated this week
- 📐 Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.☆3,513Updated 9 months ago
- Iterative JSON parser with Pythonic interfaces☆1,052Updated 3 weeks ago
- Computing with Python functions.☆4,321Updated 3 weeks ago
- A powerful and modular toolkit for record linkage and duplicate detection in Python☆1,045Updated last year
- Pretty-print tabular data in Python, a library and a command-line utility. Repository migrated from bitbucket.org/astanin/python-tabulate…☆2,511Updated 6 months ago
- Python disk-backed cache (Django-compatible). Faster than Redis and Memcached. Pure-Python.☆2,817Updated last year
- 🚴 Call stack profiler for Python. Shows you why your code is slow!☆7,616Updated last month
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆381Updated 2 weeks ago
- A Python library to access ISO country, subdivision, language, currency and script definitions and their translations.☆929Updated last week
- 🧹 Python package for text cleaning☆1,001Updated last week
- More routines for operating on iterables, beyond itertools☆4,037Updated this week
- dict subclass with keylist/keypath support, built-in I/O operations (base64, csv, html, ini, json, pickle, plist, query-string, toml, xl…☆1,604Updated last week
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…☆856Updated 2 weeks ago
- Python library providing function decorators for configurable backoff and retry☆2,702Updated last year
- A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner☆2,642Updated last year
- Retrying library for Python☆8,330Updated this week
- Unbearably fast near-real-time pure-Python runtime-static type-checker.☆3,329Updated this week
- Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.☆1,641Updated 9 months ago