Intsights / PySubstringSearchLinks
Python library for fast substring/pattern search written in C++ leveraging Suffix Array Algorithm
☆42Updated 5 months ago
Alternatives and similar repositories for PySubstringSearch
Users that are interested in PySubstringSearch are comparing it to the libraries listed below
Sorting:
- Python library for fast fuzzy search over a big file written in Rust☆45Updated 5 months ago
- Python library for a duplicate lines removal written in C++☆33Updated 5 months ago
- A blazingly fast domain extraction library written in Rust☆67Updated 5 months ago
- Intsights open-source wrappers library for some AWS resources and high level management objects for distributed backend systems☆17Updated 2 years ago
- Concatenated-word segmentation Python library written in Rust☆17Updated 5 months ago
- A Git Repository Secrets Scanner written in Rust☆40Updated 5 months ago
- Uncompromising and opinionated flake8 plugin which follows Intsights' practices☆14Updated 2 months ago
- A fast and easy adblockplus parser and matcher based on adblock-rust package☆28Updated 11 months ago
- Queue server base on RocksDB as a KV-Store backend and gRPC as an interface☆10Updated 2 years ago
- Word frequency checker based on Wikipedia corpus written in Rust☆10Updated 5 months ago
- A Python based alternative to Elasticsearch Reindex API with multiprocessing support.☆18Updated 5 months ago
- Train a model, and detect gibberish strings with it.☆68Updated 3 years ago
- Fast Python Bloom Filter using Mmap☆133Updated 4 months ago
- Using queues, tqdm-multiprocess supports multiple worker processes, each with multiple tqdm progress bars, displaying them cleanly throug…☆42Updated 5 years ago
- A file utility for accessing both local and remote files through a unified interface.☆46Updated this week
- Check for multiple patterns in a single string at the same time: a fast Aho-Corasick algorithm for Python☆217Updated last month
- A Vectorized Python Dict/Set☆116Updated 2 years ago
- Run all the tests at the same time with modal.com☆11Updated last year
- Encode and decode pairs of surrogate characters in Python 3☆10Updated 3 years ago
- CyDifflib is a fast implementation of difflib's algorithms, which can be used as a drop-in replacement.☆32Updated 9 months ago
- A Fast Levenshtein Distance Library for Python☆86Updated 11 months ago
- Checkpoint the state of Python programs using Pythonic setjmp and longjmp☆68Updated 5 years ago
- Pythonic search engine based on PyLucene.☆132Updated last month
- Probabilistic data structures in python http://pyprobables.readthedocs.io/en/latest/index.html☆122Updated last week
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆76Updated 2 weeks ago
- decontamination☆24Updated 2 months ago
- ☆17Updated 3 years ago
- A library that automatically infers dependencies for Python files☆177Updated last year
- A python package for running directed acyclic graphs of asynchronous I/O operations☆17Updated 4 years ago
- Python package for deduplication/entity resolution using active learning☆83Updated last year