yougov / fuzzy
☆51Updated last year
Alternatives and similar repositories for fuzzy:
Users that are interested in fuzzy are comparing it to the libraries listed below
- A Python implementation of the Metaphone and Double Metaphone algorithms☆81Updated last year
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any othe…☆67Updated 2 years ago
- Abydos NLP/IR library for Python☆185Updated 2 years ago
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆149Updated 3 months ago
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆62Updated 8 years ago
- Python bindings to the Compact Language Detector☆33Updated 4 years ago
- 💥 Cython hash tables that assume keys are pre-hashed☆87Updated 3 months ago
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- Python wrapper for aspell (C extension and python version)☆81Updated last year
- 💙 Emoji handling and meta data for spaCy with custom extension attributes☆181Updated last year
- Implementation of phonetic algorithm in python☆41Updated 7 years ago
- Binary Python bindings for poppler utils for content extraction☆42Updated 3 years ago
- Scalable String Similarity Joins in Python☆39Updated 9 months ago
- Python port for IWNLP.Lemmatizer☆17Updated last year
- Python bindings for the Google's FarmHash☆37Updated 7 months ago
- A pure Python Levenshtein implementation that's not freaking GPL'd.☆97Updated 2 years ago
- A maximum-strength name parser for record linkage.☆36Updated 2 weeks ago
- A Python library for extracting semantic information from text, such as dates and numbers.☆75Updated 2 years ago
- A Cython implementation of the affine gap string distance☆57Updated 2 years ago
- Automatically exported from code.google.com/p/guess-language☆53Updated last year
- pyaddress is an address parsing library, taking the guesswork out of using addresses in your applications. We use it as part of our apart…☆100Updated 5 years ago
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…☆169Updated 3 years ago
- Soundex Phonetic Code Algorithm Demo for Indian Languages. Supports all indian languages and English. Provides intra-indic string compari…☆57Updated 6 years ago
- Language detection extension for spaCy 2.0+☆112Updated 6 years ago
- Levenshtein and Hamming distance computation☆116Updated 5 years ago
- Hunspell extension for spaCy 2.0.☆94Updated 8 months ago
- The most basic Text::Unidecode port (licensed under Artistic License or GPL or GPLv2+ - choose whatever you want)☆66Updated 2 years ago
- Implement SQLite table-valued functions with Python☆59Updated last year
- Geotext extracts country and city mentions from text☆139Updated 2 years ago
- Multi-Langauge Identification☆28Updated 8 months ago