wharris / esmreLinks
Python extension module for accelerating regular expressions using libesm
☆132Updated 2 years ago
Alternatives and similar repositories for esmre
Users that are interested in esmre are comparing it to the libraries listed below
Sorting:
- Fast, efficiently stored Trie for Python. Uses libdatrie.☆547Updated 3 weeks ago
- Pure python Aho-Corasick library.☆220Updated 2 weeks ago
- Fast multi-keyword search engine for text strings☆258Updated last year
- An efficient simhash implementation for python☆127Updated 6 years ago
- Simhash and near-duplicate detection☆423Updated 2 years ago
- High performance Trie and Ahocorasick automata (AC automata) Keyword Match & Replace Tool for python. Correct case insensitive implementa…☆94Updated last year
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆389Updated 3 years ago
- DAFSA-based dictionary-like read-only objects for Python. Based on `dawgdic` C++ library.☆305Updated last year
- Constants used in Chinese text processing☆386Updated last year
- Non-Overlapping Aho-Corasick Python extension, for Python 2 (str and unicode) and Python 3☆51Updated 10 years ago
- Python wrapper for RE2☆297Updated 2 years ago
- A Python Implementation of Simhash Algorithm☆1,033Updated 3 years ago
- A fast Python RPC library☆336Updated 3 years ago
- Python API for Various DB-Backed Simhash Clusters☆64Updated 8 years ago
- Python Set subclass that supports searching by ngram similarity☆119Updated 4 years ago
- An easy-install script for LibShortText☆27Updated 11 years ago
- Static memory-efficient Trie-like structures for Python based on marisa-trie C++ library.☆1,123Updated last month
- a chinese segment base on crf☆234Updated 7 years ago
- Python Non-cryptographic Hash Library☆287Updated 2 years ago
- Chinese Words Segment Library based on HMM model☆166Updated 11 years ago
- Fast Redis Bloom Filters in Python☆290Updated 7 years ago
- Collection of persistent (disk-based) and non-persistent (memory-based) queues for Python☆291Updated this week
- Yet another Chinese word segmentation package based on character-based tagging heuristics and CRF algorithm☆245Updated 13 years ago
- Roaring Bitmap in Cython☆82Updated last year
- unofficial git mirror of http://svn.whoosh.ca svn repo☆49Updated 15 years ago
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆378Updated 3 years ago
- A toolbox for working with the Chinese language in Python☆149Updated 5 years ago
- Fast Python Bloom Filter using Mmap☆747Updated 6 years ago
- Python search module for fast approximate string matching☆54Updated 3 years ago
- ZPar statistical parser. Universal language support (depending on the availability of training data), with language-specific features for…☆135Updated 9 years ago