john-kurkowski / tldextractLinks
Accurately separates a URL’s subdomain, domain, and public suffix, using the Public Suffix List (PSL).
☆1,943Updated 3 weeks ago
Alternatives and similar repositories for tldextract
Users that are interested in tldextract are comparing it to the libraries listed below
Sorting:
- Python implementation of ua-parser☆633Updated 4 months ago
- A Python library that provides an easy way to identify devices like mobile phones, tablets and their capabilities by parsing (browser) us…☆1,497Updated 2 years ago
- python parser for human readable dates☆2,739Updated 2 weeks ago
- Extracts the top level domain (TLD) from the URL given.☆181Updated 5 months ago
- 🌐 The easiest way to parse and modify URLs in Python.☆2,778Updated last week
- A collection of common regular expressions bundled with an easy to use interface.☆1,581Updated 2 years ago
- A python module for retrieving and parsing WHOIS data☆410Updated 4 years ago
- Python code for GeoIP2 webservice client and database reader☆1,178Updated this week
- A simple Python module for parsing human names into their individual components☆691Updated last year
- Extract embedded metadata from HTML markup☆935Updated last month
- Asynchronous Python HTTP Requests for Humans using Futures☆2,221Updated 4 months ago
- Returns unicode slugs☆1,558Updated last month
- Python module/library for retrieving domain WHOIS information (only domain)☆287Updated last year
- Persistent HTTP cache for python requests☆1,458Updated last month
- Port of Google's language-detection library to Python.☆1,853Updated 8 months ago
- Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors☆1,285Updated last week
- Python Data Validation for Humans™.☆1,091Updated last month
- ☆436Updated 3 weeks ago
- Python email address and Mime parsing library☆1,645Updated last year
- Bleach is an allowed-list-based HTML sanitizing library that escapes or strips markup and attributes☆2,730Updated 2 weeks ago
- A tool that parses emails by enhancing the Python standard library, extracting all details into a comprehensive object.☆417Updated last week
- a powerful DNS toolkit for python☆2,615Updated 2 weeks ago
- Requests + Gevent = <3☆4,579Updated last year
- A robust email syntax and deliverability validation library for Python.☆1,327Updated 2 weeks ago
- URLExtract is python class for collecting (extracting) URLs from given text based on locating TLD.☆271Updated last year
- Useful extensions to the standard Python datetime features☆2,555Updated last month
- Find dates inside text using Python and get back datetime objects☆664Updated last year
- A python wrapper for libmagic☆2,839Updated last month
- Retrying is an Apache 2.0 licensed general-purpose retrying library, written in Python, to simplify the task of adding retry behavior to …☆1,928Updated 4 years ago
- Python library of web-related functions☆411Updated this week