yougov / fuzzy
☆50Updated last year
Related projects ⓘ
Alternatives and complementary repositories for fuzzy
- A Python implementation of the Metaphone and Double Metaphone algorithms☆80Updated 8 months ago
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆144Updated 10 months ago
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any othe…☆65Updated 2 years ago
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆62Updated 7 years ago
- Python bindings to the Compact Language Detector☆33Updated 4 years ago
- Scalable String Similarity Joins in Python☆39Updated 4 months ago
- Pure Python wrapper to the Yajl C Library☆83Updated 11 months ago
- A simple fuzzy matching set for python strings☆223Updated 3 months ago
- A simple command line interface to the datamade/dedupe library.☆42Updated last year
- Python port for IWNLP.Lemmatizer☆17Updated last year
- A Python library for extracting semantic information from text, such as dates and numbers.☆74Updated 2 years ago
- Automatically exported from code.google.com/p/guess-language☆53Updated 9 months ago
- A maximum-strength name parser for record linkage.☆34Updated 3 months ago
- A simple and fast approach to selecting the best string in a list of strings despite errors or mispelling.☆10Updated 9 years ago
- pyaddress is an address parsing library, taking the guesswork out of using addresses in your applications. We use it as part of our apart…☆100Updated 5 years ago
- Abydos NLP/IR library for Python☆183Updated 2 years ago
- Implementation of phonetic algorithm in python☆40Updated 6 years ago
- Language detection extension for spaCy 2.0+☆111Updated 5 years ago
- 💫 Scripts, tools and resources for developing spaCy☆125Updated 5 years ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆52Updated 3 years ago
- Python 3 AsyncIO powered scraping framework with batteries included☆20Updated 8 years ago
- 💥 Cython hash tables that assume keys are pre-hashed☆82Updated last year
- 💙 Emoji handling and meta data for spaCy with custom extension attributes☆180Updated last year
- Geotext extracts country and city mentions from text☆135Updated last year
- Find which links on a web page are pagination links☆29Updated 7 years ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆149Updated last year
- Python wrapper for aspell (C extension and python version)☆81Updated last year
- A disk-based key/value store in Python with no dependencies.☆21Updated 9 years ago
- A Python 3 phonetics library.☆124Updated 4 years ago
- Soundex Phonetic Code Algorithm Demo for Indian Languages. Supports all indian languages and English. Provides intra-indic string compari…☆55Updated 5 years ago