Intsights / PySubstringSearchLinks
Python library for fast substring/pattern search written in C++ leveraging Suffix Array Algorithm
☆42Updated 5 months ago
Alternatives and similar repositories for PySubstringSearch
Users that are interested in PySubstringSearch are comparing it to the libraries listed below
Sorting:
- Python library for a duplicate lines removal written in C++☆33Updated 5 months ago
- Python library for fast fuzzy search over a big file written in Rust☆45Updated 5 months ago
- A blazingly fast domain extraction library written in Rust☆67Updated 5 months ago
- Concatenated-word segmentation Python library written in Rust☆17Updated 5 months ago
- A Git Repository Secrets Scanner written in Rust☆40Updated 5 months ago
- A fast and easy adblockplus parser and matcher based on adblock-rust package☆28Updated 11 months ago
- Uncompromising and opinionated flake8 plugin which follows Intsights' practices☆14Updated 2 months ago
- Queue server base on RocksDB as a KV-Store backend and gRPC as an interface☆10Updated 2 years ago
- Word frequency checker based on Wikipedia corpus written in Rust☆10Updated 5 months ago
- Train a model, and detect gibberish strings with it.☆68Updated 3 years ago
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆76Updated last week
- A Python based alternative to Elasticsearch Reindex API with multiprocessing support.☆18Updated 5 months ago
- Using queues, tqdm-multiprocess supports multiple worker processes, each with multiple tqdm progress bars, displaying them cleanly throug…☆42Updated 5 years ago
- Fast Python Bloom Filter using Mmap☆133Updated 4 months ago
- Gather module dependencies of source code☆13Updated 2 years ago
- Fast Autocomplete: When Elastcsearch suggestions are not fast and flexible enough☆287Updated 4 months ago
- Python package for deduplication/entity resolution using active learning☆83Updated last year
- This repository contains the code and data download links to reproduce building the WDC Products Benchmark.☆14Updated 2 years ago
- The Levenshtein Python C extension module contains functions for fast computation of Levenshtein distance and string similarity☆123Updated 3 months ago
- Confection: the sweetest config system for Python☆193Updated last month
- Run all the tests at the same time with modal.com☆11Updated last year
- Fast fuzzy text search☆11Updated 2 years ago
- Check for multiple patterns in a single string at the same time: a fast Aho-Corasick algorithm for Python☆218Updated last month
- Rust python bindings for symspell☆21Updated 2 years ago
- 🦉 Modern high-performance serialization utilities for Python (JSON, MessagePack, Pickle)☆482Updated 2 months ago
- A file utility for accessing both local and remote files through a unified interface.☆46Updated last month
- Multi-Langauge Identification☆28Updated last year
- A Vectorized Python Dict/Set☆117Updated 2 years ago
- ☆22Updated 2 years ago
- Synchronicity lets you interoperate with asynchronous Python APIs.☆134Updated last month