Intsights / PyDeduplines
Python library for a duplicate lines removal written in C++
☆33Updated 9 months ago
Alternatives and similar repositories for PyDeduplines:
Users that are interested in PyDeduplines are comparing it to the libraries listed below
- ☆25Updated last month
- Intsights open-source wrappers library for some AWS resources and high level management objects for distributed backend systems☆17Updated last year
- Python library for fast substring/pattern search written in C++ leveraging Suffix Array Algorithm☆41Updated 9 months ago
- A Git Repository Secrets Scanner written in Rust☆39Updated 9 months ago
- Python library for fast fuzzy search over a big file written in Rust☆45Updated 9 months ago
- A blazingly fast domain extraction library written in Rust☆65Updated 5 months ago
- Uncompromising and opinionated flake8 plugin which follows Intsights' practices☆14Updated 5 months ago
- Queue server base on RocksDB as a KV-Store backend and gRPC as an interface☆10Updated last year
- Word frequency checker based on Wikipedia corpus written in Rust☆10Updated 9 months ago
- Concatenated-word segmentation Python library written in Rust☆16Updated 9 months ago
- A Python based alternative to Elasticsearch Reindex API with multiprocessing support.☆17Updated 9 months ago
- A fast and easy adblockplus parser and matcher based on adblock-rust package☆27Updated 9 months ago
- Fast, Safe & Simple Asynchronous Task Queues Written In Pure Python☆144Updated 2 months ago
- This collection of general purpose python magic was too good to keep for ourselves!☆15Updated 5 months ago
- Simplest and fastest image and text annotation tool.☆230Updated this week
- A python client for the Sypht API☆162Updated 6 months ago
- Extending Python's process pool to support asyncio functions☆12Updated 3 years ago
- Fast, correct Python JSON library supporting dataclasses and datetimes☆14Updated 2 years ago
- Python bindings for RocksDB☆34Updated 2 years ago
- Collections of pydantic models☆47Updated 6 months ago
- HeBERT: Pre-training BERT for modern Hebrew☆75Updated last year
- Fast event-sourcing library using Redis and Mongo.☆82Updated 4 years ago
- MutiCloud_Overlay demonstrates a use case of overlay over one or more clouds such as AWS, Azure, GCP, OCI, Alibaba and a vSphere private …☆135Updated 2 years ago
- A Python library for converting HTML files into PDF with Chrome's engine.☆19Updated 5 months ago
- Safe and fast evaluation of untrusted user-supplied python expressions☆29Updated last month
- Binder repository for the hover package☆44Updated last year
- A Serverless skeleton project using Python☆41Updated 3 years ago
- A tool that allows users to choose a Ministry of the Interior Office to make a booking without a long waiting time☆14Updated 2 years ago
- An email segmentation system (reference implementation of ECIR 2018 paper)☆10Updated 5 years ago
- Fast graph database in pure Python☆14Updated 3 years ago