Intsights / PySubstringSearch
Python library for fast substring/pattern search written in C++ leveraging Suffix Array Algorithm
☆41Updated 9 months ago
Alternatives and similar repositories for PySubstringSearch:
Users that are interested in PySubstringSearch are comparing it to the libraries listed below
- Python library for a duplicate lines removal written in C++☆33Updated 9 months ago
- ☆25Updated last month
- Python library for fast fuzzy search over a big file written in Rust☆45Updated 9 months ago
- A Git Repository Secrets Scanner written in Rust☆39Updated 9 months ago
- A blazingly fast domain extraction library written in Rust☆65Updated 5 months ago
- Concatenated-word segmentation Python library written in Rust☆16Updated 9 months ago
- Uncompromising and opinionated flake8 plugin which follows Intsights' practices☆14Updated 5 months ago
- Queue server base on RocksDB as a KV-Store backend and gRPC as an interface☆10Updated last year
- Word frequency checker based on Wikipedia corpus written in Rust☆10Updated 9 months ago
- A fast and easy adblockplus parser and matcher based on adblock-rust package☆27Updated 9 months ago
- A Python based alternative to Elasticsearch Reindex API with multiprocessing support.☆17Updated 9 months ago
- Fast, Safe & Simple Asynchronous Task Queues Written In Pure Python☆144Updated 2 months ago
- Pyinfer is a model agnostic tool for ML developers and researchers to benchmark the inference statistics for machine learning models or f…☆24Updated 3 years ago
- A python package for running directed acyclic graphs of asynchronous I/O operations☆16Updated 3 years ago
- Fast fuzzy text search☆11Updated last year
- Python bindings for MetroHash☆19Updated 4 months ago
- Fast Levenshtein Distance Library for Python 3☆82Updated 2 years ago
- An email segmentation system (reference implementation of ECIR 2018 paper)☆10Updated 5 years ago
- Check for multiple patterns in a single string at the same time: a fast Aho-Corasick algorithm for Python☆167Updated last month
- Extract city and country mentions from Text like GeoText without regex, but FlashText, a Aho-Corasick implementation.☆60Updated this week
- Library for fast text representation and classification.☆28Updated last year
- 🤝 Trade any tensors over the network☆30Updated last year
- Simplest and fastest image and text annotation tool.☆230Updated this week
- code and data used to build a training dataset for dragnet models☆10Updated 4 years ago
- Pre-train Static Word Embeddings☆34Updated this week
- An Elasticsearch Python ORM based on Pydantic.☆124Updated last year
- Implementation of the paper "Deep Indexed Active Learning for Matching Heterogeneous Entity Representations"☆16Updated 3 years ago
- auto fix invalid json / 自动修复补全残缺无效的 JSON☆48Updated last year
- 🐍 Python bidding for the Hora Approximate Nearest Neighbor Search Algorithm library☆68Updated 3 years ago
- ☆40Updated 7 months ago