Intsights / PyDomainExtractorLinks
A blazingly fast domain extraction library written in Rust
☆67Updated 2 months ago
Alternatives and similar repositories for PyDomainExtractor
Users that are interested in PyDomainExtractor are comparing it to the libraries listed below
Sorting:
- A fast and easy adblockplus parser and matcher based on adblock-rust package☆28Updated 8 months ago
- Python library for fast fuzzy search over a big file written in Rust☆45Updated 2 months ago
- Python library for a duplicate lines removal written in C++☆33Updated 2 months ago
- Python library for fast substring/pattern search written in C++ leveraging Suffix Array Algorithm☆41Updated 2 months ago
- A Git Repository Secrets Scanner written in Rust☆39Updated 2 months ago
- Concatenated-word segmentation Python library written in Rust☆17Updated 2 months ago
- Uncompromising and opinionated flake8 plugin which follows Intsights' practices☆14Updated 3 weeks ago
- URLExtract is python class for collecting (extracting) URLs from given text based on locating TLD.☆271Updated last year
- 🐍 A CPython extension for the Hyperscan regular expression matching library.☆186Updated 2 weeks ago
- Word frequency checker based on Wikipedia corpus written in Rust☆10Updated 2 months ago
- Train a model, and detect gibberish strings with it.☆67Updated 3 years ago
- URL normalization for Python☆99Updated 6 months ago
- Extracts the top level domain (TLD) from the URL given.☆181Updated 5 months ago
- universal character encoding detector☆61Updated last year
- Bounded Process&Thread Pool Executor☆63Updated last year
- Parse numbers written in natural language☆123Updated last year
- Check for multiple patterns in a single string at the same time: a fast Aho-Corasick algorithm for Python☆215Updated last week
- Extract text from HTML☆134Updated 5 years ago
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆76Updated last week
- A Requests-compatible interface for PycURL.☆71Updated last month
- Python WHOIS and RDAP utility for querying and parsing information about Domains, IPv4s, IPv6s, and AS numbers☆85Updated last week
- Python wrapper for RE2☆105Updated 3 weeks ago
- Fast and robust date extraction from web pages, with Python or on the command-line☆141Updated last week
- A Python based alternative to Elasticsearch Reindex API with multiprocessing support.☆18Updated 2 months ago
- publicsuffixlist for python☆72Updated this week
- 🔍 PyPI package information at a glance for Python dependencies – a VS Code extension☆36Updated last month
- A small Python library to deal with publicsuffix data (includes a bundled PSL as "package data") in a wheel friendly format. Fork and con…☆31Updated 4 years ago
- A Python implementation of Lunr.js 🌖☆200Updated 8 months ago
- Parse natural language time expressions in python☆131Updated 2 years ago
- Stackable cache classes for sharing, encryption, statistics and more on top of cachetools, redis and memcached☆34Updated 7 months ago