john-kurkowski / tldextractLinks
Accurately separates a URL’s subdomain, domain, and public suffix, using the Public Suffix List (PSL).
☆1,908Updated 2 months ago
Alternatives and similar repositories for tldextract
Users that are interested in tldextract are comparing it to the libraries listed below
Sorting:
- A Python library that provides an easy way to identify devices like mobile phones, tablets and their capabilities by parsing (browser) us…☆1,484Updated 2 years ago
- A python module for retrieving and parsing WHOIS data☆407Updated 3 years ago
- Retrieve and parse whois data for IPv4 and IPv6 addresses☆584Updated 9 months ago
- A collection of common regular expressions bundled with an easy to use interface.☆1,576Updated 2 years ago
- Extract embedded metadata from HTML markup☆923Updated 3 months ago
- Asynchronous Python HTTP Requests for Humans using Futures☆2,121Updated 3 weeks ago
- Python email address and Mime parsing library☆1,643Updated last year
- Python module/library for retrieving domain WHOIS information (only domain)☆289Updated last year
- Python code for GeoIP2 webservice client and database reader☆1,155Updated this week
- python parser for human readable dates☆2,695Updated 2 weeks ago
- Python implementation of ua-parser☆618Updated last month
- Extracts the top level domain (TLD) from the URL given.☆182Updated last month
- a powerful DNS toolkit for python☆2,575Updated this week
- ☆419Updated last month
- 🌐 URL parsing and manipulation made easy.☆2,697Updated 3 months ago
- Persistent HTTP cache for python requests☆1,436Updated this week
- Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors☆1,247Updated 2 months ago
- Returns unicode slugs☆1,537Updated 3 weeks ago
- A simple Python module for parsing human names into their individual components☆676Updated last year
- Bleach is an allowed-list-based HTML sanitizing library that escapes or strips markup and attributes