axiak / fuzzyset
A simple fuzzy matching set for python strings
☆222Updated last month
Related projects: ⓘ
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆144Updated 8 months ago
- A pure Python Levenshtein implementation that's not freaking GPL'd.☆97Updated last year
- Python bindings to the Compact Language Detector☆32Updated 4 years ago
- A Python implementation of the Metaphone and Double Metaphone algorithms☆80Updated 6 months ago
- pyaddress is an address parsing library, taking the guesswork out of using addresses in your applications. We use it as part of our apart…☆99Updated 5 years ago
- Super-fast and clean conversions to numbers for Python.☆104Updated 8 months ago
- ☆50Updated last year
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆380Updated 2 years ago
- pyxDamerauLevenshtein implements the Damerau-Levenshtein (DL) edit distance algorithm for Python in Cython for high performance.☆242Updated 4 months ago
- Fast multi-keyword search engine for text strings☆248Updated last week
- Levenshtein and Hamming distance computation☆117Updated 4 years ago
- URL normalization for Python☆94Updated 2 years ago
- 🎡 Automated build repo for Python wheels and source packages☆174Updated 2 months ago
- Parse, normalize and render postal addresses.☆183Updated 11 months ago
- A Python library for extracting semantic information from text, such as dates and numbers.☆74Updated 2 years ago
- Textpipe: clean and extract metadata from text☆300Updated 3 years ago
- 🦉 Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)☆425Updated 2 months ago
- Snowball stemming library collection for Python☆123Updated 5 years ago
- A Cython implementation of the affine gap string distance☆58Updated last year
- Python wrapper for aspell (C extension and python version)☆81Updated last year
- Plac: Parsing the Command Line the Easy Way☆296Updated last month
- Just is a wrapper to automagically read/write a file based on extension☆50Updated 4 months ago
- Spans is a pure Python implementation of PostgreSQL's range types.☆112Updated last year
- Python BK-tree data structure to allow fast querying of "close" matches☆170Updated 2 years ago
- An efficient simhash implementation for python☆124Updated 4 years ago
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…☆170Updated 2 years ago
- A lucene query parser generating ElasticSearch queries and more !☆188Updated 2 weeks ago
- A Python implementation of Lunr.js 🌖☆188Updated last week
- Python Set subclass that supports searching by ngram similarity☆119Updated 3 years ago
- Extracts the top level domain (TLD) from the URL given.☆177Updated last year