ermanh / trieregex
Build efficient trie-based regular expressions from large word lists
☆62Updated 2 years ago
Alternatives and similar repositories for trieregex:
Users that are interested in trieregex are comparing it to the libraries listed below
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆71Updated 2 months ago
- Custom Python functions for working with SQLite FTS4☆22Updated 2 years ago
- A fast, simple and lightweight Bloom filter library for Python, implemented in Rust.☆256Updated 6 months ago
- Homoglyphs: get similar letters, convert to ASCII, detect possible languages and UTF-8 group.☆81Updated 4 years ago
- Abydos NLP/IR library for Python☆185Updated 2 years ago
- Find parts of long text or data, allowing for some changes/typos.☆318Updated 8 months ago
- 🐍 A CPython extension for the Hyperscan regular expression matching library.☆171Updated 2 months ago
- ☆15Updated this week
- Find strings/words in text; convenience and C speed☆126Updated 2 years ago
- Minimal, super readable string pattern matching for python.☆183Updated 10 months ago
- Easy Python tree data structures☆16Updated 3 years ago
- 🦉 Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)☆463Updated 3 months ago
- Loadable spellfix1 extension for sqlite as python package☆26Updated last year
- A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any othe…☆67Updated 2 years ago
- Efficient string matching with regular expressions☆143Updated this week
- Pure Python Go parser, AST and unparser library☆56Updated 4 years ago
- srt3 is a simple library for parsing, modifying, and composing SRT files.☆18Updated 3 years ago
- Parse natural language time expressions in python☆130Updated 2 years ago
- Python bindings for Milli, the embeddable Rust-based search engine powering Meilisearch☆131Updated 10 months ago
- Python bindings for Tantivy☆326Updated this week
- Faster, modernized fork of the language identification tool langid.py☆55Updated 5 months ago
- Python wrapper for RE2☆103Updated last week
- Open, extensible Python injector/profiler/analyzer☆65Updated 3 years ago
- A fast and memory-optimized string library for heavy-text manipulation in Python☆250Updated 5 years ago
- Super lightweight function registries for your library☆179Updated 10 months ago
- Python package that offers text scrubbing functionality, providing building blocks for string cleaning as well as normalizing geographica…☆22Updated 8 months ago
- Build a trie-structured regular expression from a list of words☆21Updated 5 years ago
- Confection: the sweetest config system for Python☆186Updated 2 weeks ago
- A Python library for simple evaluation of natural language predicates☆66Updated 4 years ago
- PyPruningRadixTrie - Python version of super fast Radix trie for prefix search & auto-complete☆39Updated 4 months ago