axiak / fuzzyset
A simple fuzzy matching set for python strings
☆223Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for fuzzyset
- pyxDamerauLevenshtein implements the Damerau-Levenshtein (DL) edit distance algorithm for Python in Cython for high performance.☆243Updated 6 months ago
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆144Updated 10 months ago
- A pure Python Levenshtein implementation that's not freaking GPL'd.☆97Updated last year
- ☆50Updated last year
- URL normalization for Python☆94Updated 2 years ago
- Super-fast and clean conversions to numbers for Python.☆106Updated last week
- A Cython implementation of the affine gap string distance☆58Updated last year
- Python bindings to the Compact Language Detector☆33Updated 4 years ago
- Levenshtein and Hamming distance computation☆117Updated 5 years ago
- 🦉 Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)☆435Updated 4 months ago
- Python wrapper for RE2☆99Updated 2 months ago
- A Python implementation of the Metaphone and Double Metaphone algorithms☆80Updated 8 months ago
- A fast and memory-optimized string library for heavy-text manipulation in Python☆250Updated 4 years ago
- 💥 Cython hash tables that assume keys are pre-hashed☆82Updated last year
- Fast multi-keyword search engine for text strings☆247Updated 2 months ago
- Textpipe: clean and extract metadata from text☆299Updated 3 years ago
- The most basic Text::Unidecode port (licensed under Artistic License or GPL or GPLv2+ - choose whatever you want)☆64Updated last year
- Guess gender from first name in Python 2 and 3☆130Updated 2 years ago
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…☆170Updated 2 years ago
- Plac: Parsing the Command Line the Easy Way☆296Updated 3 months ago
- Dump (freeze) SQL query results from a database into a selection of file formats☆91Updated 5 years ago
- pyaddress is an address parsing library, taking the guesswork out of using addresses in your applications. We use it as part of our apart…☆100Updated 5 years ago
- Parse natural language time expressions in python☆131Updated last year
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆365Updated last year
- Python BK-tree data structure to allow fast querying of "close" matches☆172Updated 3 years ago
- 🔗 Immutable, Pythonic, correct URLs.☆286Updated last year
- A pipeline abstraction for Python☆168Updated 3 years ago
- 145+ extra higher-level functional tools beyond standard library's `itertools`, `functools`, etc. and popular third-party libraries like …☆161Updated 3 years ago
- A list-like type with better asymptotic performance and similar performance on small lists☆313Updated 2 years ago
- Python wrapper for aspell (C extension and python version)☆81Updated last year