oubiwann / metaphoneLinks
A Python implementation of the Metaphone and Double Metaphone algorithms
β81Updated last year
Alternatives and similar repositories for metaphone
Users that are interested in metaphone are comparing it to the libraries listed below
Sorting:
- π₯ Cython hash tables that assume keys are pre-hashedβ87Updated last week
- Language detection extension for spaCy 2.0+β112Updated 6 years ago
- A Python 3 phonetics library.β132Updated 5 years ago
- β52Updated last year
- Python bindings to the Compact Language Detectorβ33Updated 5 years ago
- A disk-based key/value store in Python with no dependencies.β21Updated 10 years ago
- Hunspell extension for spaCy 2.0.β94Updated 10 months ago
- Text normalization library for Pythonβ204Updated 7 years ago
- Python search module for fast approximate string matchingβ54Updated 2 years ago
- A Cython implementation of the affine gap string distanceβ57Updated 2 years ago
- Python wrapper for aspell (C extension and python version)β82Updated last year
- Original, standard and customisable versions of the Jaro-Winkler functions.β31Updated 2 years ago
- Fast Word Clustering Softwareβ78Updated 3 months ago
- Server/Client around Spacy to load spacy only onceβ46Updated 7 years ago
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.β151Updated 4 months ago
- Levenshtein and Hamming distance computationβ116Updated 5 years ago
- Python Set subclass that supports searching by ngram similarityβ119Updated 3 years ago
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any otheβ¦β68Updated 2 years ago
- WebAnnotator is a tool for annotating Web pages. WebAnnotator is implemented as a Firefox extension (https://addons.mozilla.org/en-US/fiβ¦β48Updated 3 years ago
- Python wrapper for Apache OpenNLP toolsβ34Updated 8 years ago
- Python bindings for libwapitiβ67Updated 5 years ago
- A fully customisable language detection pipeline for spaCyβ92Updated 6 years ago
- Python bindings for cld3β27Updated last year
- A sentiment classifier tool and library trained on Twitter dataβ22Updated last year
- A tool to segment text based on frequencies and the Viterbi algorithm "#TheBoyWhoLived" => ['#', 'The', 'Boy', 'Who', 'Lived']β81Updated 9 years ago
- Fast supervised sentence boundary detection using the averaged perceptronβ90Updated 6 years ago
- Non-Overlapping Aho-Corasick Python extension, for Python 2 (str and unicode) and Python 3β51Updated 9 years ago
- SPARK-n-SPELL [WARNING: inactive project, not being updated]β7Updated 8 years ago
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic feβ¦β170Updated 3 years ago
- Search 'from' and 'to' strings to learn a text cleaning mappingβ17Updated 9 years ago