Intsights / PyDomainExtractorLinks
A blazingly fast domain extraction library written in Rust
☆66Updated this week
Alternatives and similar repositories for PyDomainExtractor
Users that are interested in PyDomainExtractor are comparing it to the libraries listed below
Sorting:
- A fast and easy adblockplus parser and matcher based on adblock-rust package☆27Updated 5 months ago
- Python library for fast fuzzy search over a big file written in Rust☆45Updated 5 months ago
- Python library for a duplicate lines removal written in C++☆33Updated this week
- Python library for fast substring/pattern search written in C++ leveraging Suffix Array Algorithm☆41Updated 5 months ago
- Intsights open-source wrappers library for some AWS resources and high level management objects for distributed backend systems☆17Updated last year
- A Git Repository Secrets Scanner written in Rust☆39Updated 5 months ago
- Concatenated-word segmentation Python library written in Rust☆17Updated 5 months ago
- ☆25Updated this week
- Uncompromising and opinionated flake8 plugin which follows Intsights' practices☆14Updated 5 months ago
- Word frequency checker based on Wikipedia corpus written in Rust☆10Updated 5 months ago
- 🐍 A CPython extension for the Hyperscan regular expression matching library.☆183Updated this week
- Queue server base on RocksDB as a KV-Store backend and gRPC as an interface☆10Updated last year
- URLExtract is python class for collecting (extracting) URLs from given text based on locating TLD.☆265Updated last year
- Extracts the top level domain (TLD) from the URL given.☆181Updated 2 months ago
- Fast Python Bloom Filter using Mmap☆127Updated last year
- Python WHOIS and RDAP utility for querying and parsing information about Domains, IPv4s, IPv6s, and AS numbers☆82Updated 5 months ago
- URL normalization for Python☆96Updated 3 months ago
- Parse numbers written in natural language☆122Updated 9 months ago
- Extract text from HTML☆134Updated 5 years ago
- A Python based alternative to Elasticsearch Reindex API with multiprocessing support.☆18Updated 5 months ago
- Efficient string matching with regular expressions☆145Updated this week
- A pure-Python robots.txt parser with support for modern conventions.☆70Updated 2 weeks ago
- Python tool to support lazy imports.☆30Updated 2 months ago
- A small Python library to deal with publicsuffix data (includes a bundled PSL as "package data") in a wheel friendly format. Fork and con…☆31Updated 3 years ago
- Common interface for data container classes☆68Updated this week
- universal character encoding detector☆59Updated 11 months ago
- Python RDAP utility for querying and parsing information about Domains, IPv4s, IPv6s, and AS numbers☆27Updated 2 weeks ago
- Check for multiple patterns in a single string at the same time: a fast Aho-Corasick algorithm for Python☆205Updated last week
- Library to populate items using XPath and CSS with a convenient API☆47Updated 2 weeks ago
- Fast and robust date extraction from web pages, with Python or on the command-line☆136Updated last week