john-kurkowski / tldextractLinks
Accurately separates a URL’s subdomain, domain, and public suffix, using the Public Suffix List (PSL).
☆1,905Updated last month
Alternatives and similar repositories for tldextract
Users that are interested in tldextract are comparing it to the libraries listed below
Sorting:
- Retrieve and parse whois data for IPv4 and IPv6 addresses☆582Updated 8 months ago
- python parser for human readable dates☆2,682Updated 3 weeks ago
- 🌐 URL parsing and manipulation made easy.☆2,689Updated 2 months ago
- Simple DNS resolver for asyncio☆561Updated this week
- Extracts the top level domain (TLD) from the URL given.☆182Updated 3 weeks ago
- Asynchronous Python HTTP Requests for Humans using Futures☆2,122Updated this week
- A toolbelt of useful classes and functions to be used with python-requests☆1,019Updated 5 months ago
- Persistent HTTP cache for python requests☆1,429Updated last week
- Useful extensions to the standard Python datetime features☆2,475Updated 2 months ago
- Lightweight Python utilities for working with Redis☆1,174Updated last week
- 🎭 Twisted Deferred Thread backend for Requests.☆418Updated 6 years ago
- Yet another URL library☆1,416Updated this week
- Bleach is an allowed-list-based HTML sanitizing library that escapes or strips markup and attributes☆2,703Updated 3 weeks ago
- Python binding to Modest and Lexbor engines (fast HTML5 parser with CSS selectors).☆1,287Updated last week
- Tokenizer for raw mails☆389Updated 3 weeks ago
- a powerful DNS toolkit for python☆2,563Updated last week
- Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors☆1,233Updated last month
- A Python library that provides an easy way to identify devices like mobile phones, tablets and their capabilities by parsing (browser) us…☆1,482Updated 2 years ago
- A python module for retrieving and parsing WHOIS data☆407Updated 3 years ago
- Fast HTTP parser☆1,255Updated 8 months ago
- Extract embedded metadata from HTML markup☆919Updated 3 months ago
- URLExtract is python class for collecting (extracting) URLs from given text based on locating TLD.☆264Updated last year
- JMESPath is a query language for JSON.☆2,324Updated last year
- fast python port of arc90's readability tool, updated to match latest readability.js!☆2,803Updated last month
- Safely pass trusted data to untrusted environments and back.☆3,022Updated last week
- A simple Python module for parsing human names into their individual components☆674Updated last year
- Python implementation of ua-parser☆615Updated last week
- PostgreSQL database adapter for the Python programming language☆3,498Updated last month
- Python library of web-related functions☆405Updated last month
- A Python 3 compatible version of goose http://goose3.readthedocs.io/en/latest/index.html☆870Updated 6 months ago