Intsights / PySubstringSearchLinks
Python library for fast substring/pattern search written in C++ leveraging Suffix Array Algorithm
☆41Updated 2 months ago
Alternatives and similar repositories for PySubstringSearch
Users that are interested in PySubstringSearch are comparing it to the libraries listed below
Sorting:
- Python library for fast fuzzy search over a big file written in Rust☆45Updated 2 months ago
- Python library for a duplicate lines removal written in C++☆33Updated 2 months ago
- A blazingly fast domain extraction library written in Rust☆67Updated 2 months ago
- A Git Repository Secrets Scanner written in Rust☆39Updated 2 months ago
- Concatenated-word segmentation Python library written in Rust☆17Updated 2 months ago
- Uncompromising and opinionated flake8 plugin which follows Intsights' practices☆14Updated 3 weeks ago
- A fast and easy adblockplus parser and matcher based on adblock-rust package☆28Updated 8 months ago
- Using queues, tqdm-multiprocess supports multiple worker processes, each with multiple tqdm progress bars, displaying them cleanly throug…☆43Updated 4 years ago
- A Vectorized Python Dict/Set☆117Updated 2 years ago
- A Fast Levenshtein Distance Library for Python☆85Updated 8 months ago
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆76Updated last week
- An efficient algorithm for k-bounded (Damerau-)Levenshtein distance☆16Updated 7 years ago
- Check for multiple patterns in a single string at the same time: a fast Aho-Corasick algorithm for Python☆215Updated last week
- Train a model, and detect gibberish strings with it.☆67Updated 3 years ago
- 🎯 aimrocks 🎸 — python & cython bindings for RocksDB. Batteries included! 🔋☆32Updated 2 months ago
- Python 3 library to store memory mappable objects into pickle-compatible files☆38Updated 7 years ago
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 3 years ago
- A library to instantiate any Python object from configuration files.☆24Updated 3 years ago
- A file utility for accessing both local and remote files through a unified interface.☆44Updated last month
- Rust python bindings for symspell☆21Updated last year
- super fast cpp implementation of longest common subsequence/substring☆72Updated 2 years ago
- Bounded Process&Thread Pool Executor☆63Updated last year
- CyDifflib is a fast implementation of difflib's algorithms, which can be used as a drop-in replacement.☆29Updated 6 months ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆121Updated last week
- Tool for disambiguating acronyms and abbreviations in text for NLP applications☆22Updated this week
- Python package for deduplication/entity resolution using active learning☆82Updated last year
- Source code and data for Like a Good Nearest Neighbor☆30Updated 9 months ago
- Fast Python Bloom Filter using Mmap☆129Updated last month
- Pyinfer is a model agnostic tool for ML developers and researchers to benchmark the inference statistics for machine learning models or f…☆24Updated 4 years ago
- 🦉 Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)☆474Updated 9 months ago