axiak / fuzzysetLinks
A simple fuzzy matching set for python strings
☆230Updated last year
Alternatives and similar repositories for fuzzyset
Users that are interested in fuzzyset are comparing it to the libraries listed below
Sorting:
- pyxDamerauLevenshtein implements the Damerau-Levenshtein (DL) edit distance algorithm for Python in Cython for high performance.☆249Updated this week
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆155Updated 3 weeks ago
- Get list of common stop words in various languages in Python☆156Updated last year
- Levenshtein and Hamming distance computation☆116Updated 5 years ago
- Search 'from' and 'to' strings to learn a text cleaning mapping☆17Updated 10 years ago
- Fast multi-keyword search engine for text strings☆257Updated last year
- A Cython implementation of the affine gap string distance☆57Updated 2 years ago
- Python bindings to the Compact Language Detector☆33Updated 5 years ago
- Simple library to cleanup and prettify url patterns and emails☆138Updated 3 years ago
- A Python implementation of the Metaphone and Double Metaphone algorithms☆81Updated last year
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…☆170Updated 3 years ago
- Python wrapper for aspell (C extension and python version)☆82Updated 2 years ago
- persistent caching to memory, disk, or database☆275Updated 3 months ago
- Weighted Levenshtein library☆111Updated 2 years ago
- Snowball stemming library collection for Python☆121Updated 6 years ago
- The most basic Text::Unidecode port (licensed under Artistic License or GPL or GPLv2+ - choose whatever you want)☆67Updated 2 years ago
- Import tables from any Wikipedia article as a dataset in Python☆292Updated 3 years ago
- Time everything in IPython☆125Updated last year
- A pure Python Levenshtein implementation that's not freaking GPL'd.☆98Updated 2 years ago
- Plac: Parsing the Command Line the Easy Way☆298Updated 6 months ago
- Language detection extension for spaCy 2.0+☆113Updated 6 years ago
- Textpipe: clean and extract metadata from text☆302Updated 4 years ago
- 🎡 Automated build repo for Python wheels and source packages☆174Updated last year
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆389Updated 3 years ago
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆376Updated 2 years ago
- Library for unit extraction - fork of quantulum for python3☆142Updated last year
- Simple, Pythonic extraction of text, shapes and images from PDFs☆80Updated 5 years ago
- Automatic Item List Extraction☆87Updated 9 years ago
- A fast streaming JSON parser for Python that generates SAX-like events using yajl☆224Updated last year
- Just is a wrapper to automagically read/write a file based on extension☆51Updated 2 months ago