Intsights / PyDomainExtractorLinks
A blazingly fast domain extraction library written in Rust
☆67Updated 4 months ago
Alternatives and similar repositories for PyDomainExtractor
Users that are interested in PyDomainExtractor are comparing it to the libraries listed below
Sorting:
- A fast and easy adblockplus parser and matcher based on adblock-rust package☆28Updated 9 months ago
- Python library for fast fuzzy search over a big file written in Rust☆45Updated 4 months ago
- Python library for a duplicate lines removal written in C++☆33Updated 4 months ago
- A Git Repository Secrets Scanner written in Rust☆38Updated 4 months ago
- Python library for fast substring/pattern search written in C++ leveraging Suffix Array Algorithm☆42Updated 4 months ago
- Intsights open-source wrappers library for some AWS resources and high level management objects for distributed backend systems☆17Updated 2 years ago
- Concatenated-word segmentation Python library written in Rust☆17Updated 4 months ago
- Uncompromising and opinionated flake8 plugin which follows Intsights' practices☆14Updated last month
- URL normalization for Python☆99Updated 7 months ago
- URLExtract is python class for collecting (extracting) URLs from given text based on locating TLD.☆272Updated last year
- Common interface for data container classes☆68Updated this week
- 🐍 A CPython extension for the Hyperscan regular expression matching library.☆188Updated 2 weeks ago
- Parse numbers written in natural language☆124Updated last year
- Word frequency checker based on Wikipedia corpus written in Rust☆10Updated 4 months ago
- publicsuffixlist for python☆72Updated this week
- Extracts the top level domain (TLD) from the URL given.☆181Updated 6 months ago
- Python wrapper for RE2☆106Updated 2 months ago
- Check for multiple patterns in a single string at the same time: a fast Aho-Corasick algorithm for Python☆219Updated this week
- universal character encoding detector☆63Updated last year
- Queue server base on RocksDB as a KV-Store backend and gRPC as an interface☆10Updated 2 years ago
- A pure-Python robots.txt parser with support for modern conventions.☆74Updated last week
- Caching for HTTPX☆72Updated 2 months ago
- A small Python library to deal with publicsuffix data (includes a bundled PSL as "package data") in a wheel friendly format. Fork and con…☆31Updated 4 years ago
- Web scraping Page Objects core library☆103Updated last week
- ⏲️ Easy rate limiting for Python using a token bucket algorithm, with async and thread-safe decorators and context managers☆52Updated last year
- A Python implementation of Lunr.js 🌖☆202Updated 9 months ago
- Bounded Process&Thread Pool Executor☆63Updated last year
- Make every function async and await-able.☆98Updated 3 years ago
- Pool for asyncio with multiprocessing, threading and gevent -like interface☆120Updated 2 years ago
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆77Updated 2 weeks ago