WojciechMula / pyahocorasick
Python module (C extension and plain python) implementing Aho-Corasick algorithm
☆949Updated 7 months ago
Related projects ⓘ
Alternatives and complementary repositories for pyahocorasick
- Pure python Aho-Corasick library.☆212Updated last year
- Static memory-efficient Trie-like structures for Python based on marisa-trie C++ library.☆1,046Updated 3 weeks ago
- Fast, efficiently stored Trie for Python. Uses libdatrie.☆530Updated 9 months ago
- A python binding for crfsuite☆770Updated last month
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆1,263Updated 3 years ago
- Python library implementing a trie data structure.☆815Updated 3 years ago
- Python extension module for accelerating regular expressions using libesm☆132Updated last year
- High performance Trie and Ahocorasick automata (AC automata) Keyword Match & Replace Tool for python. Correct case insensitive implementa…☆94Updated 3 weeks ago
- A Python Implementation of Simhash Algorithm☆980Updated 2 years ago
- Python port of SymSpell: 1 million times faster spelling correction & fuzzy search through Symmetric Delete spelling correction algorithm…☆800Updated this week
- DAFSA-based dictionary-like read-only objects for Python. Based on `dawgdic` C++ library.☆300Updated 4 months ago
- Simhash and near-duplicate detection☆409Updated last year
- Fast multi-keyword search engine for text strings☆247Updated last month
- ☆165Updated 4 months ago
- scikit-learn inspired API for CRFsuite☆426Updated last year
- Fast implementation of the edit distance(Levenshtein distance)☆661Updated 8 months ago
- spellchecking library for python☆601Updated 4 months ago
- English word segmentation, written in pure-Python, and based on a trillion-word corpus.☆365Updated last year
- pyxDamerauLevenshtein implements the Damerau-Levenshtein (DL) edit distance algorithm for Python in Cython for high performance.☆243Updated 6 months ago
- Python Set subclass that supports searching by ngram similarity☆120Updated 3 years ago
- CRFsuite: a fast implementation of Conditional Random Fields (CRFs)☆647Updated 4 months ago
- Constants used in Chinese text processing☆359Updated last year
- sentence embedding by Smooth Inverse Frequency weighting scheme☆1,084Updated 5 years ago
- Named-entity recognition using neural networks. Easy-to-use and state-of-the-art results.☆1,696Updated last year
- 🪼 a python library for doing approximate and phonetic matching of strings.☆2,067Updated last week
- Named Entity Recognition Tool☆1,156Updated 5 years ago
- A library implementing different string similarity and distance measures using Python.☆993Updated last year
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆149Updated last year
- NCRF++, a Neural Sequence Labeling Toolkit. Easy use to any sequence labeling tasks (e.g. NER, POS, Segmentation). It includes character …☆1,888Updated 2 years ago