abusix / ahocorapy
Pure python Aho-Corasick library.
☆212Updated last year
Related projects ⓘ
Alternatives and complementary repositories for ahocorapy
- Python module (C extension and plain python) implementing Aho-Corasick algorithm☆951Updated 8 months ago
- Python extension module for accelerating regular expressions using libesm☆132Updated last year
- Fast multi-keyword search engine for text strings☆247Updated 2 months ago
- Check for multiple patterns in a single string at the same time: a fast Aho-Corasick algorithm for Python☆163Updated this week
- Fast, efficiently stored Trie for Python. Uses libdatrie.☆531Updated 9 months ago
- An efficient simhash implementation for python☆125Updated 5 years ago
- Simhash and near-duplicate detection☆410Updated last year
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆149Updated last year
- ☆165Updated 5 months ago
- DAFSA-based dictionary-like read-only objects for Python. Based on `dawgdic` C++ library.☆300Updated 5 months ago
- Static memory-efficient Trie-like structures for Python based on marisa-trie C++ library.☆1,047Updated last month
- A python binding for crfsuite☆771Updated last month
- 🐍 A CPython extension for the Hyperscan regular expression matching library.☆168Updated this week
- Constants used in Chinese text processing☆359Updated last year
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆365Updated last year
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆382Updated 2 years ago
- Python bindings for cld3☆27Updated last year
- Whoosh is a fast, featureful full-text indexing and searching library implemented in pure Python.☆252Updated 9 months ago
- A Python Implementation of Simhash Algorithm☆982Updated 2 years ago
- ☆23Updated 3 years ago
- pyxDamerauLevenshtein implements the Damerau-Levenshtein (DL) edit distance algorithm for Python in Cython for high performance.☆243Updated 6 months ago
- Pythonic search engine based on PyLucene.☆120Updated this week
- Python wrapper for RE2☆99Updated 2 months ago
- Python Set subclass that supports searching by ngram similarity☆120Updated 3 years ago
- Scalable Bloom Filter implemented in Python☆164Updated 2 years ago
- A library implementing different string similarity and distance measures using Python.☆992Updated 2 years ago
- A pytorch implementation of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".☆305Updated 2 years ago
- scikit-learn inspired API for CRFsuite☆426Updated last year
- Find parts of long text or data, allowing for some changes/typos.☆312Updated 3 months ago