axiak / fuzzyset
A simple fuzzy matching set for python strings
☆226Updated 8 months ago
Alternatives and similar repositories for fuzzyset:
Users that are interested in fuzzyset are comparing it to the libraries listed below
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆150Updated 3 months ago
- Python bindings to the Compact Language Detector☆33Updated 4 years ago
- Super-fast and clean conversions to numbers for Python.☆108Updated last month
- Levenshtein and Hamming distance computation☆116Updated 5 years ago
- A pure Python Levenshtein implementation that's not freaking GPL'd.☆97Updated 2 years ago
- ☆51Updated last year
- Simple library to cleanup and prettify url patterns and emails☆139Updated 2 years ago
- pyxDamerauLevenshtein implements the Damerau-Levenshtein (DL) edit distance algorithm for Python in Cython for high performance.☆246Updated 11 months ago
- A Cython implementation of the affine gap string distance☆57Updated 2 years ago
- 💥 Cython hash tables that assume keys are pre-hashed☆87Updated 3 months ago
- Parse natural language time expressions in python☆130Updated 2 years ago
- Fast multi-keyword search engine for text strings☆252Updated 7 months ago
- URL normalization for Python☆94Updated 2 weeks ago
- pyaddress is an address parsing library, taking the guesswork out of using addresses in your applications. We use it as part of our apart…☆100Updated 5 years ago
- Language detection extension for spaCy 2.0+☆112Updated 6 years ago
- A fast and memory-optimized string library for heavy-text manipulation in Python☆250Updated 4 years ago
- A lightweight wrapper to operate on nested dictionaries seamlessly. 👌☆199Updated 2 years ago
- A Python parser for data that only looks like JSON☆65Updated last year
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆62Updated 8 years ago
- A Python implementation of the Metaphone and Double Metaphone algorithms☆81Updated last year
- Unicode transliteration in Python (clone of Tomaž Šolc repository at zemanta.com)☆114Updated 9 years ago
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…☆169Updated 3 years ago
- Extract text from HTML☆135Updated 4 years ago
- This is a pytest plugin that enables you to test your code that relies on a running Elasticsearch search engine. It allows you to specify…☆68Updated this week
- 145+ extra higher-level functional tools beyond standard library's `itertools`, `functools`, etc. and popular third-party libraries like …☆160Updated 2 weeks ago
- A pipeline abstraction for Python☆169Updated 4 years ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆386Updated 2 years ago
- persistent caching to memory, disk, or database☆266Updated this week
- Extract, parse and populate templates from strings☆27Updated 6 years ago
- A module for querying the DOM tree and writing XPath expressions using native Python syntax.☆127Updated 6 years ago