Intsights / PySubstringSearchLinks
Python library for fast substring/pattern search written in C++ leveraging Suffix Array Algorithm
☆41Updated 3 weeks ago
Alternatives and similar repositories for PySubstringSearch
Users that are interested in PySubstringSearch are comparing it to the libraries listed below
Sorting:
- Python library for a duplicate lines removal written in C++☆33Updated 3 weeks ago
- Python library for fast fuzzy search over a big file written in Rust☆45Updated 3 weeks ago
- A blazingly fast domain extraction library written in Rust☆66Updated 3 weeks ago
- Concatenated-word segmentation Python library written in Rust☆17Updated 3 weeks ago
- Uncompromising and opinionated flake8 plugin which follows Intsights' practices☆14Updated 3 weeks ago
- A fast and easy adblockplus parser and matcher based on adblock-rust package☆27Updated 6 months ago
- A Git Repository Secrets Scanner written in Rust☆39Updated 3 weeks ago
- Word frequency checker based on Wikipedia corpus written in Rust☆10Updated 3 weeks ago
- A Python based alternative to Elasticsearch Reindex API with multiprocessing support.☆18Updated 3 weeks ago
- Train a model, and detect gibberish strings with it.☆64Updated 3 years ago
- Run all the tests at the same time with modal.com☆11Updated last year
- A file utility for accessing both local and remote files through a unified interface.☆44Updated 3 months ago
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆75Updated this week
- Python package for deduplication/entity resolution using active learning☆81Updated last year
- A Vectorized Python Dict/Set☆118Updated 2 years ago
- Source code and data for Like a Good Nearest Neighbor☆30Updated 7 months ago
- 🤗 Push your spaCy pipelines to the Hugging Face Hub☆44Updated last year
- Check for multiple patterns in a single string at the same time: a fast Aho-Corasick algorithm for Python☆208Updated this week
- Efficient string matching with regular expressions☆145Updated 2 weeks ago
- Hebrew oriented NER spaCy pipeline☆18Updated last year
- ☆15Updated last year
- 🤝 Trade any tensors over the network☆30Updated last year
- Fast fuzzy text search☆11Updated 2 years ago
- Multi-Langauge Identification☆28Updated last year
- A Fast Levenshtein Distance Library for Python☆84Updated 6 months ago
- Streamlit component for Jina neural search☆42Updated 3 years ago
- This repository contains the code and data download links to reproduce building the WDC Products Benchmark.☆13Updated 2 years ago
- python library to perform Locality-Sensitive Hashing for faster nearest neighbors search in high dimensional data☆19Updated last year
- Confection: the sweetest config system for Python☆190Updated 4 months ago
- Python Powerful Timeout Decorator that can be used safely on classes, methods, class methods☆160Updated 2 months ago