Intsights / PyDomainExtractorLinks
A blazingly fast domain extraction library written in Rust
☆66Updated 2 months ago
Alternatives and similar repositories for PyDomainExtractor
Users that are interested in PyDomainExtractor are comparing it to the libraries listed below
Sorting:
- A fast and easy adblockplus parser and matcher based on adblock-rust package☆28Updated 7 months ago
- Python library for fast fuzzy search over a big file written in Rust☆45Updated 2 months ago
- Python library for fast substring/pattern search written in C++ leveraging Suffix Array Algorithm☆41Updated 2 months ago
- A Git Repository Secrets Scanner written in Rust☆39Updated 2 months ago
- Concatenated-word segmentation Python library written in Rust☆17Updated 2 months ago
- Uncompromising and opinionated flake8 plugin which follows Intsights' practices☆14Updated 2 months ago
- URLExtract is python class for collecting (extracting) URLs from given text based on locating TLD.☆269Updated last year
- URL normalization for Python☆99Updated 5 months ago
- 🐍 A CPython extension for the Hyperscan regular expression matching library.☆185Updated 2 months ago
- Parse numbers written in natural language☆123Updated 11 months ago
- Python wrapper for RE2☆105Updated this week
- universal character encoding detector☆60Updated last year
- Common interface for data container classes☆68Updated last month
- A pure-Python robots.txt parser with support for modern conventions.☆70Updated 2 months ago
- Extracts the top level domain (TLD) from the URL given.☆181Updated 4 months ago
- Library to populate items using XPath and CSS with a convenient API☆47Updated last month
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆75Updated 2 weeks ago
- Extract text from HTML☆134Updated 5 years ago
- Your Tool For Python Performance Tracking☆24Updated last year
- Python RDAP utility for querying and parsing information about Domains, IPv4s, IPv6s, and AS numbers☆28Updated 2 months ago
- Homoglyphs: get similar letters, convert to ASCII, detect possible languages and UTF-8 group.☆84Updated 4 years ago
- A Python based alternative to Elasticsearch Reindex API with multiprocessing support.☆18Updated 2 months ago
- Fast Python Bloom Filter using Mmap☆128Updated last month
- A library to help automate the creation of universal python libraries☆41Updated this week
- Python WHOIS and RDAP utility for querying and parsing information about Domains, IPv4s, IPv6s, and AS numbers☆83Updated 7 months ago
- Efficient string matching with regular expressions☆144Updated last week
- Complete lxml external type annotation☆69Updated 2 weeks ago
- A Requests-compatible interface for PycURL.☆70Updated 3 weeks ago
- Turn Pydantic defined Data Models into CLI Tools☆155Updated last week
- Pool for asyncio with multiprocessing, threading and gevent -like interface☆120Updated 2 years ago