yougov / fuzzy
☆50Updated last year
Related projects ⓘ
Alternatives and complementary repositories for fuzzy
- A Python implementation of the Metaphone and Double Metaphone algorithms☆80Updated 8 months ago
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆144Updated 10 months ago
- Python bindings to the Compact Language Detector☆33Updated 4 years ago
- A maximum-strength name parser for record linkage.☆32Updated 3 months ago
- A simple command line interface to the datamade/dedupe library.☆42Updated last year
- Python wrapper for aspell (C extension and python version)☆81Updated last year
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any othe…☆65Updated 2 years ago
- Automatically exported from code.google.com/p/guess-language☆53Updated 8 months ago
- Scalable String Similarity Joins in Python☆39Updated 3 months ago
- Utility library to turn country names into ISO two-letter codes☆66Updated 3 weeks ago
- Python wrapper for a C++ Double Metaphone☆15Updated last year
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆62Updated 7 years ago
- Guess gender from first name in Python 2 and 3☆129Updated 2 years ago
- A simple fuzzy matching set for python strings☆223Updated 2 months ago
- Hidden alignment conditional random field for classifying string pairs.☆25Updated last month
- A Python 3 phonetics library.☆124Updated 4 years ago
- 💥 Cython hash tables that assume keys are pre-hashed☆82Updated last year
- Aho-Corasick string replacement utility☆23Updated 4 years ago
- Auto-generate Python APIs from JSON schema specifications☆80Updated 5 years ago
- A Cython implementation of the affine gap string distance☆58Updated last year
- Python bindings for the Google's FarmHash☆37Updated 2 months ago
- Search 'from' and 'to' strings to learn a text cleaning mapping☆17Updated 9 years ago
- A Python library for extracting semantic information from text, such as dates and numbers.☆74Updated 2 years ago
- A comprehensive and scalable set of string tokenizers and similarity measures in Python☆137Updated 3 months ago
- Pure Python wrapper to the Yajl C Library☆83Updated 10 months ago
- Extract, parse and populate templates from strings☆27Updated 5 years ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆51Updated 3 years ago
- Language detection extension for spaCy 2.0+☆111Updated 5 years ago
- Hunspell extension for spaCy 2.0.☆94Updated 3 months ago