Intsights / PySubstringSearchLinks
Python library for fast substring/pattern search written in C++ leveraging Suffix Array Algorithm
☆41Updated 3 months ago
Alternatives and similar repositories for PySubstringSearch
Users that are interested in PySubstringSearch are comparing it to the libraries listed below
Sorting:
- Python library for fast fuzzy search over a big file written in Rust☆45Updated 3 months ago
- Python library for a duplicate lines removal written in C++☆33Updated 3 months ago
- A blazingly fast domain extraction library written in Rust☆67Updated 3 months ago
- Intsights open-source wrappers library for some AWS resources and high level management objects for distributed backend systems☆17Updated 2 years ago
- Concatenated-word segmentation Python library written in Rust☆17Updated 3 months ago
- A fast and easy adblockplus parser and matcher based on adblock-rust package☆28Updated 9 months ago
- A Git Repository Secrets Scanner written in Rust☆39Updated 3 months ago
- Uncompromising and opinionated flake8 plugin which follows Intsights' practices☆14Updated 2 weeks ago
- Queue server base on RocksDB as a KV-Store backend and gRPC as an interface☆10Updated 2 years ago
- Word frequency checker based on Wikipedia corpus written in Rust☆10Updated 3 months ago
- Python package for deduplication/entity resolution using active learning☆82Updated last year
- A Fast Levenshtein Distance Library for Python☆85Updated 9 months ago
- Efficient string matching with regular expressions☆146Updated last week
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆77Updated 3 weeks ago
- Check for multiple patterns in a single string at the same time: a fast Aho-Corasick algorithm for Python☆217Updated this week
- A Python based alternative to Elasticsearch Reindex API with multiprocessing support.☆18Updated 3 months ago
- A Vectorized Python Dict/Set☆117Updated 2 years ago
- Python 3 library to store memory mappable objects into pickle-compatible files☆38Updated 7 years ago
- Super lightweight function registries for your library☆180Updated last year
- Implementation of the paper "Deep Indexed Active Learning for Matching Heterogeneous Entity Representations"☆17Updated 3 years ago
- Probabilistic data structures in python http://pyprobables.readthedocs.io/en/latest/index.html☆122Updated 2 months ago
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 4 years ago
- Synchronicity lets you interoperate with asynchronous Python APIs.☆128Updated last week
- Convert monolithic Jupyter notebooks 📙 into maintainable Ploomber pipelines. 📊☆79Updated last year
- A python package for running directed acyclic graphs of asynchronous I/O operations☆17Updated 4 years ago
- Confection: the sweetest config system for Python☆192Updated 3 weeks ago
- ☆43Updated 6 months ago
- 🎯 aimrocks 🎸 — python & cython bindings for RocksDB. Batteries included! 🔋☆32Updated 3 months ago
- Train a model, and detect gibberish strings with it.☆67Updated 3 years ago
- Toolkit for graph-relational data across space and time☆116Updated last year