john-kurkowski / tldextractLinks
Accurately separates a URL’s subdomain, domain, and public suffix, using the Public Suffix List (PSL).
☆1,929Updated 2 months ago
Alternatives and similar repositories for tldextract
Users that are interested in tldextract are comparing it to the libraries listed below
Sorting:
- Python email address and Mime parsing library☆1,641Updated last year
- A Python library that provides an easy way to identify devices like mobile phones, tablets and their capabilities by parsing (browser) us…☆1,494Updated 2 years ago
- Returns unicode slugs☆1,551Updated last week
- Extracts the top level domain (TLD) from the URL given.☆181Updated 4 months ago
- Python implementation of ua-parser☆627Updated 3 months ago
- A python module for retrieving and parsing WHOIS data☆409Updated 4 years ago
- python parser for human readable dates☆2,727Updated last month
- Python code for GeoIP2 webservice client and database reader☆1,169Updated last week
- A collection of common regular expressions bundled with an easy to use interface.☆1,581Updated 2 years ago
- Retrieve and parse whois data for IPv4 and IPv6 addresses☆587Updated 11 months ago
- Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors☆1,274Updated 2 weeks ago
- Persistent HTTP cache for python requests☆1,453Updated 3 weeks ago
- Python Data Validation for Humans™.☆1,073Updated 3 months ago
- Python module/library for retrieving domain WHOIS information (only domain)☆287Updated last year
- Asynchronous Python HTTP Requests for Humans using Futures☆2,217Updated 3 months ago
- Extract embedded metadata from HTML markup☆933Updated last month
- Bleach is an allowed-list-based HTML sanitizing library that escapes or strips markup and attributes☆2,718Updated 4 months ago
- 🌐 The easiest way to parse and modify URLs in Python.☆2,767Updated 3 weeks ago
- a powerful DNS toolkit for python☆2,604Updated last week
- Useful extensions to the standard Python datetime features☆2,527Updated 2 weeks ago
- URLExtract is python class for collecting (extracting) URLs from given text based on locating TLD.☆268Updated last year
- A toolbelt of useful classes and functions to be used with python-requests☆1,027Updated 8 months ago
- Convert HTML to Markdown-formatted text.☆2,062Updated 5 months ago
- ☆426Updated this week
- A simple Python module for parsing human names into their individual components☆686Updated last year
- Requests + Gevent = <3☆4,574Updated last year
- Parse feeds in Python☆2,205Updated 3 weeks ago
- Python library of web-related functions☆412Updated last month
- A Python 3 compatible version of goose http://goose3.readthedocs.io/en/latest/index.html☆887Updated 2 weeks ago
- fast python port of arc90's readability tool, updated to match latest readability.js!☆2,847Updated 5 months ago