rapidfuzz / RapidFuzzLinks
Rapid fuzzy string matching in Python using various string metrics
β3,599Updated last week
Alternatives and similar repositories for RapidFuzz
Users that are interested in RapidFuzz are comparing it to the libraries listed below
Sorting:
- Fuzzy String Matching in Pythonβ3,531Updated 9 months ago
- πͺΌ a python library for doing approximate and phonetic matching of strings.β2,177Updated last week
- The most accurate natural language detection library for Python, suitable for short text and mixed-language textβ1,591Updated last month
- Python binding to Modest and Lexbor engines. Fast HTML5 parser with CSS selectors for Python.β1,493Updated this week
- Fuzzy string matching, grouping, and evaluation.β786Updated 5 months ago
- A light-weight, flexible, and expressive statistical data testing libraryβ4,121Updated last week
- Pretty-print tabular data in Python, a library and a command-line utility. Repository migrated from bitbucket.org/astanin/python-tabulateβ¦β2,492Updated 4 months ago
- π§Ή Python package for text cleaningβ999Updated 2 years ago
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithmβ¦β853Updated 3 weeks ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarityβ1,275Updated 4 years ago
- A powerful and modular toolkit for record linkage and duplicate detection in Pythonβ1,037Updated last year
- Pure Python Spell Checking http://pyspellchecker.readthedocs.io/en/latest/β767Updated 3 weeks ago
- A library implementing different string similarity and distance measures using Python.β1,021Updated 3 years ago
- Iterative JSON parser with Pythonic interfacesβ1,035Updated last week
- Port of Google's language-detection library to Python.β1,862Updated 9 months ago
- Community maintained fork of pdfminer - we fathom PDFβ6,830Updated last week
- π Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.β3,507Updated 8 months ago
- A Python library to access ISO country, subdivision, language, currency and script definitions and their translations.β912Updated 2 weeks ago
- Correctly generate plurals, ordinals, indefinite articles; convert numbers to wordsβ1,053Updated 7 months ago
- CleverCSV is a Python package for handling messy CSV files. It provides a drop-in replacement for the builtin CSV module with improved diβ¦β1,315Updated 2 weeks ago
- A Python package for easy multiprocessing, but faster than multiprocessingβ2,076Updated last year
- python parser for human readable datesβ2,755Updated last month
- Fuzzy String Matching in Pythonβ9,271Updated 2 years ago
- A Python library for reading and writing PDF, powered by QPDFβ2,553Updated this week
- extract text from any document. no muss. no fuss.β4,395Updated last year
- Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XMβ¦β5,060Updated 3 months ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarityβ369Updated last week
- Convert HTML to Markdownβ1,991Updated last month
- Fast, correct Python JSON library supporting dataclasses, datetimes, and numpyβ7,700Updated this week
- Python tree data libraryβ1,065Updated 8 months ago