Intsights / PyDeduplines
Python library for a duplicate lines removal written in C++
☆33Updated last month
Alternatives and similar repositories for PyDeduplines:
Users that are interested in PyDeduplines are comparing it to the libraries listed below
- Intsights open-source wrappers library for some AWS resources and high level management objects for distributed backend systems☆17Updated last year
- ☆25Updated 2 months ago
- A Git Repository Secrets Scanner written in Rust☆39Updated last month
- Python library for fast fuzzy search over a big file written in Rust☆45Updated 2 months ago
- A blazingly fast domain extraction library written in Rust☆65Updated last month
- Uncompromising and opinionated flake8 plugin which follows Intsights' practices☆14Updated 2 months ago
- Queue server base on RocksDB as a KV-Store backend and gRPC as an interface☆10Updated last year
- Concatenated-word segmentation Python library written in Rust☆17Updated last month
- Word frequency checker based on Wikipedia corpus written in Rust☆10Updated last month
- A fast and easy adblockplus parser and matcher based on adblock-rust package☆27Updated 2 months ago
- A Python based alternative to Elasticsearch Reindex API with multiprocessing support.☆17Updated 2 months ago
- Fast, Safe & Simple Asynchronous Task Queues Written In Pure Python☆144Updated 2 months ago
- Simplest and fastest image and text annotation tool.☆233Updated this week
- A declarative and intuitive way to describe data filtering and sorting in your application.☆15Updated 2 months ago
- Fast event-sourcing library using Redis and Mongo.☆81Updated 4 years ago
- Binder repository for the hover package☆44Updated 2 years ago
- URLExtract is python class for collecting (extracting) URLs from given text based on locating TLD.☆258Updated last year
- A python client for the Sypht API☆162Updated 9 months ago
- 🐍 A CPython extension for the Hyperscan regular expression matching library.☆171Updated 2 months ago
- Some CF templates☆114Updated 4 years ago
- URL normalization for Python☆94Updated 3 weeks ago
- A Java client for the Sypht API☆87Updated 3 years ago
- MutiCloud_Overlay demonstrates a use case of overlay over one or more clouds such as AWS, Azure, GCP, OCI, Alibaba and a vSphere private …☆136Updated 2 years ago
- a complete reproducible example of training a word2vec model for Hebrew☆12Updated 2 years ago
- Python Simple Object Storage - provides a list and dictionary interface that seamlessly stores data in a file, like a simplified database…☆58Updated 2 years ago
- A tiny Go utility to generate a large amount realistic-looking Nginx logs quickly.☆67Updated last year
- Next-gen Public Key Infrastructure protocol☆130Updated 4 years ago
- Check for multiple patterns in a single string at the same time: a fast Aho-Corasick algorithm for Python☆181Updated this week
- Multi-Langauge Identification☆28Updated 9 months ago
- Open-source and npm-published React component library.☆119Updated 2 years ago