gpoulter / python-ngram
Python Set subclass that supports searching by ngram similarity
☆120Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for python-ngram
- pyxDamerauLevenshtein implements the Damerau-Levenshtein (DL) edit distance algorithm for Python in Cython for high performance.☆243Updated 6 months ago
- ☆165Updated 5 months ago
- Fast multi-keyword search engine for text strings☆247Updated 2 months ago
- Python search module for fast approximate string matching☆53Updated last year
- Text normalization library for Python☆203Updated 6 years ago
- A Python implementation of the Metaphone and Double Metaphone algorithms☆80Updated 8 months ago
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆365Updated last year
- Python bindings for libwapiti☆66Updated 4 years ago
- Python wrapper for aspell (C extension and python version)☆81Updated last year
- Python extension module for accelerating regular expressions using libesm☆132Updated last year
- An efficient simhash implementation for python☆125Updated 5 years ago
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…☆170Updated 2 years ago
- Python bindings for cld3☆27Updated last year
- Fast, efficiently stored Trie for Python. Uses libdatrie.☆531Updated 9 months ago
- CogComp's light-weight Python NLP annotators☆116Updated 5 years ago
- simple text preprocessing tool☆18Updated 7 years ago
- DAFSA-based dictionary-like read-only objects for Python. Based on `dawgdic` C++ library.☆300Updated 5 months ago
- ☆130Updated 3 years ago
- Python bindings to the Compact Language Detector☆33Updated 4 years ago
- C++ Ternary Search Tree implementation with Python bindings☆43Updated 6 years ago
- Hunspell extension for spaCy 2.0.☆94Updated 3 months ago
- Deep-learning based sentence auto-segmentation from unstructured text w/o punctuation☆37Updated 7 years ago
- [NO LONGER MAINTAINED AS OPEN SOURCE - USE SCALETEXT.COM INSTEAD]☆109Updated 11 years ago
- Python package for lexicon; Trie and DAWG implementation.☆55Updated 5 months ago
- Pure python Aho-Corasick library.☆212Updated last year
- Python wrapper around SVDLIBC, a fast library for sparse Singular Value Decomposition☆55Updated 11 years ago
- Python tool for normilizing text and text canonicalization (DISCONTINUED)☆41Updated 11 years ago
- A Cython implementation of the affine gap string distance☆58Updated last year
- Query-Document Relevance☆42Updated 9 years ago