Intsights / PyDomainExtractor
A blazingly fast domain extraction library written in Rust
☆65Updated 3 weeks ago
Alternatives and similar repositories for PyDomainExtractor:
Users that are interested in PyDomainExtractor are comparing it to the libraries listed below
- Python library for a duplicate lines removal written in C++☆33Updated 3 weeks ago
- Python library for fast substring/pattern search written in C++ leveraging Suffix Array Algorithm☆41Updated 3 weeks ago
- Python library for fast fuzzy search over a big file written in Rust☆45Updated last month
- Intsights open-source wrappers library for some AWS resources and high level management objects for distributed backend systems☆17Updated last year
- A Git Repository Secrets Scanner written in Rust☆39Updated 3 weeks ago
- ☆25Updated last month
- A fast and easy adblockplus parser and matcher based on adblock-rust package☆27Updated last month
- Concatenated-word segmentation Python library written in Rust☆17Updated 3 weeks ago
- Uncompromising and opinionated flake8 plugin which follows Intsights' practices☆14Updated last month
- Word frequency checker based on Wikipedia corpus written in Rust☆10Updated 3 weeks ago
- Queue server base on RocksDB as a KV-Store backend and gRPC as an interface☆10Updated last year
- Fast, Safe & Simple Asynchronous Task Queues Written In Pure Python☆144Updated last month
- A Python based alternative to Elasticsearch Reindex API with multiprocessing support.☆17Updated last month
- Extract text from HTML☆135Updated 4 years ago
- URLExtract is python class for collecting (extracting) URLs from given text based on locating TLD.☆255Updated last year
- Parse numbers written in natural language☆110Updated 5 months ago
- universal character encoding detector☆58Updated 6 months ago
- A small Python library to deal with publicsuffix data (includes a bundled PSL as "package data") in a wheel friendly format. Fork and con…☆31Updated 3 years ago
- URL normalization for Python☆94Updated this week
- Fast Python Bloom Filter using Mmap☆129Updated 10 months ago
- 🐍 A CPython extension for the Hyperscan regular expression matching library.☆171Updated last month
- Fetches security vulnerabilities and creates pip-constraints based on them.☆12Updated 2 months ago
- Caching for HTTPX☆70Updated 8 months ago
- A regular dump of the most-downloaded packages from PyPI☆229Updated this week
- A Requests-compatible interface for PycURL.☆66Updated 8 months ago
- python module for ripgrep☆60Updated 2 months ago
- Make every function async and await-able.☆98Updated 2 years ago
- Cython based high performance alternative to Python (re) module for doing basic pattern matching on large data-set..☆12Updated 2 years ago
- Common interface for data container classes☆67Updated last week
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆69Updated 2 months ago