yougov / fuzzyLinks
β52Updated last year
Alternatives and similar repositories for fuzzy
Users that are interested in fuzzy are comparing it to the libraries listed below
Sorting:
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.β152Updated 6 months ago
- π₯ Cython hash tables that assume keys are pre-hashedβ86Updated last month
- A Python implementation of the Metaphone and Double Metaphone algorithmsβ81Updated last year
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any otheβ¦β68Updated 2 years ago
- A simple fuzzy matching set for python stringsβ228Updated 11 months ago
- Pure Python wrapper to the Yajl C Libraryβ83Updated 7 months ago
- Abydos NLP/IR library for Pythonβ186Updated 2 years ago
- Levenshtein and Hamming distance computationβ116Updated 5 years ago
- A spell-checker extending Peter Norvig's with multi-typo correction, hamming distance weighting, and more.β98Updated 4 years ago
- A Python library for extracting semantic information from text, such as dates and numbers.β76Updated 3 years ago
- Automatically exported from code.google.com/p/guess-languageβ53Updated last year
- Regular Expression based parsers for extracting data from natural languagesβ70Updated 8 years ago
- A pure Python Levenshtein implementation that's not freaking GPL'd.β98Updated 2 years ago
- Python library to infer date format from examplesβ43Updated 3 years ago
- Lightning Fast Language Prediction πβ167Updated 6 years ago
- Python BK-tree data structure to allow fast querying of "close" matchesβ185Updated 3 years ago
- Handle many API calls from a single HTTP requestβ55Updated 6 years ago
- An automated ingestion service for blogs to construct a corpus for NLP research.β86Updated 7 years ago
- π Ordered Multivalue Dictionary. Powers furl.β68Updated 3 years ago
- π‘ Automated build repo for Python wheels and source packagesβ173Updated last year
- Thin wrapper for the Microsoft Cognitive Servicesβ60Updated 7 years ago
- geonamescache - a Python library for quick access to a subset of GeoNames data.β109Updated 11 months ago
- A pipeline abstraction for Pythonβ168Updated 4 years ago
- Pyed Piper tool by Toby Rosen at Sony Imageworks converted to Python 3β35Updated 3 years ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.β52Updated 4 years ago
- Python bindings to the Compact Language Detectorβ33Updated 5 years ago
- Get list of common stop words in various languages in Pythonβ156Updated last year
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic feβ¦β170Updated 3 years ago
- Digs into Dicts (lists and tuples)β15Updated 10 years ago
- Python bindings for the Google's FarmHashβ39Updated 10 months ago