Intsights / PySubstringSearch
Python library for fast substring/pattern search written in C++ leveraging Suffix Array Algorithm
☆41Updated 3 weeks ago
Alternatives and similar repositories for PySubstringSearch:
Users that are interested in PySubstringSearch are comparing it to the libraries listed below
- Python library for a duplicate lines removal written in C++☆33Updated 3 weeks ago
- Intsights open-source wrappers library for some AWS resources and high level management objects for distributed backend systems☆17Updated last year
- Python library for fast fuzzy search over a big file written in Rust☆45Updated last month
- ☆25Updated last month
- A blazingly fast domain extraction library written in Rust☆65Updated 3 weeks ago
- A Git Repository Secrets Scanner written in Rust☆39Updated 3 weeks ago
- Concatenated-word segmentation Python library written in Rust☆17Updated 3 weeks ago
- Uncompromising and opinionated flake8 plugin which follows Intsights' practices☆14Updated last month
- Queue server base on RocksDB as a KV-Store backend and gRPC as an interface☆10Updated last year
- Word frequency checker based on Wikipedia corpus written in Rust☆10Updated 3 weeks ago
- A fast and easy adblockplus parser and matcher based on adblock-rust package☆27Updated last month
- A Python based alternative to Elasticsearch Reindex API with multiprocessing support.☆17Updated last month
- Fast, Safe & Simple Asynchronous Task Queues Written In Pure Python☆144Updated last month
- Simplest and fastest image and text annotation tool.☆233Updated this week
- Experiments to assess SPADE on different LLM pipelines.☆16Updated 11 months ago
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 3 years ago
- A framework for multi-task learning, where you may precondition tasks and compose them into bigger tasks. Conditional objectives and per-…☆49Updated 3 years ago
- Fast Python Bloom Filter using Mmap☆129Updated 10 months ago
- Fast event-sourcing library using Redis and Mongo.☆81Updated 4 years ago
- Python 3 library to store memory mappable objects into pickle-compatible files☆38Updated 6 years ago
- A python client for the Sypht API☆162Updated 8 months ago
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆69Updated 2 months ago
- A better PyTorch data loader capable of custom image operations and image subsets☆35Updated 3 years ago
- A Fast Levenshtein Distance Library for Python☆82Updated last month
- Official source code repository for QueryBlazer: Efficient Query Autocompletion Framework☆19Updated 3 years ago
- GitHub Actions pipeline to build the grpcio wheel on Apple Silicon☆66Updated 2 years ago
- Extending Python's process pool to support asyncio functions☆12Updated 3 years ago
- Parse numbers written in natural language☆110Updated 5 months ago
- Official repository of kANNolo.☆26Updated 4 months ago
- Fast and versatile tokenizer for language models, compatible with SentencePiece, Tokenizers, Tiktoken and more. Supports BPE, Unigram and…☆19Updated 2 weeks ago