Intsights / PyDomainExtractorLinks
A blazingly fast domain extraction library written in Rust
☆66Updated last month
Alternatives and similar repositories for PyDomainExtractor
Users that are interested in PyDomainExtractor are comparing it to the libraries listed below
Sorting:
- A fast and easy adblockplus parser and matcher based on adblock-rust package☆28Updated 7 months ago
- Python library for a duplicate lines removal written in C++☆33Updated last month
- Python library for fast fuzzy search over a big file written in Rust☆45Updated last month
- Python library for fast substring/pattern search written in C++ leveraging Suffix Array Algorithm☆41Updated last month
- A Git Repository Secrets Scanner written in Rust☆39Updated last month
- Concatenated-word segmentation Python library written in Rust☆17Updated last month
- Uncompromising and opinionated flake8 plugin which follows Intsights' practices☆14Updated last month
- Word frequency checker based on Wikipedia corpus written in Rust☆10Updated last month
- 🐍 A CPython extension for the Hyperscan regular expression matching library.☆183Updated last month
- URL normalization for Python☆98Updated 5 months ago
- URLExtract is python class for collecting (extracting) URLs from given text based on locating TLD.☆268Updated last year
- Python WHOIS and RDAP utility for querying and parsing information about Domains, IPv4s, IPv6s, and AS numbers☆82Updated 6 months ago
- Python wrapper for RE2☆104Updated last month
- Queue server base on RocksDB as a KV-Store backend and gRPC as an interface☆10Updated last year
- publicsuffixlist for python☆70Updated this week
- Check for multiple patterns in a single string at the same time: a fast Aho-Corasick algorithm for Python☆209Updated 2 weeks ago
- Caching for HTTPX☆72Updated last year
- Common interface for data container classes☆68Updated last week
- Python RDAP utility for querying and parsing information about Domains, IPv4s, IPv6s, and AS numbers☆27Updated last month
- Parse numbers written in natural language☆123Updated 11 months ago
- A pure-Python robots.txt parser with support for modern conventions.☆70Updated 2 months ago
- Easy rate-limiting for python requests☆110Updated 2 weeks ago
- Complete lxml external type annotation☆68Updated 3 weeks ago
- A Python implementation of Lunr.js 🌖☆200Updated 6 months ago
- Extracts the top level domain (TLD) from the URL given.☆181Updated 4 months ago
- Bounded Process&Thread Pool Executor☆63Updated last year
- Fast Python Bloom Filter using Mmap☆127Updated last week
- Dlint is a tool for encouraging best coding practices and helping ensure Python code is secure.☆169Updated 10 months ago
- The async transformation code.☆96Updated last week
- Fast and robust date extraction from web pages, with Python or on the command-line☆141Updated last month