Intsights / PySubstringSearchLinks
Python library for fast substring/pattern search written in C++ leveraging Suffix Array Algorithm
☆41Updated last month
Alternatives and similar repositories for PySubstringSearch
Users that are interested in PySubstringSearch are comparing it to the libraries listed below
Sorting:
- Python library for a duplicate lines removal written in C++☆33Updated last month
- Python library for fast fuzzy search over a big file written in Rust☆45Updated last month
- Intsights open-source wrappers library for some AWS resources and high level management objects for distributed backend systems☆17Updated last year
- A blazingly fast domain extraction library written in Rust☆66Updated last month
- Concatenated-word segmentation Python library written in Rust☆17Updated last month
- ☆25Updated last month
- Uncompromising and opinionated flake8 plugin which follows Intsights' practices☆14Updated last month
- A Git Repository Secrets Scanner written in Rust☆39Updated last month
- A fast and easy adblockplus parser and matcher based on adblock-rust package☆28Updated 7 months ago
- Word frequency checker based on Wikipedia corpus written in Rust☆10Updated last month
- Queue server base on RocksDB as a KV-Store backend and gRPC as an interface☆10Updated last year
- A Python based alternative to Elasticsearch Reindex API with multiprocessing support.☆18Updated last month
- Python package for deduplication/entity resolution using active learning☆81Updated last year
- A Vectorized Python Dict/Set☆118Updated 2 years ago
- A module for lazy loading of Python modules☆88Updated 2 years ago
- Confection: the sweetest config system for Python☆190Updated 5 months ago
- Run all the tests at the same time with modal.com☆11Updated last year
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆75Updated 3 weeks ago
- Check for multiple patterns in a single string at the same time: a fast Aho-Corasick algorithm for Python☆209Updated 2 weeks ago
- Label data at scale. Fun and precision included.☆330Updated this week
- Fast, Safe & Simple Asynchronous Task Queues Written In Pure Python☆144Updated last month
- 🤗 Push your spaCy pipelines to the Hugging Face Hub☆44Updated last year
- This repository contains the code and data download links to reproduce building the WDC Products Benchmark.☆13Updated 2 years ago
- Synchronicity lets you interoperate with asynchronous Python APIs.☆123Updated last month
- Train a model, and detect gibberish strings with it.☆66Updated 3 years ago
- Checkpoint the state of Python programs using Pythonic setjmp and longjmp☆67Updated 4 years ago
- Source code and data for Like a Good Nearest Neighbor☆30Updated 8 months ago
- A file utility for accessing both local and remote files through a unified interface.☆44Updated 4 months ago
- A Fast Levenshtein Distance Library for Python☆84Updated 7 months ago
- A library to instantiate any Python object from configuration files.☆24Updated 2 years ago