niksite / url-normalize
URL normalization for Python
☆94Updated 2 years ago
Alternatives and similar repositories for url-normalize:
Users that are interested in url-normalize are comparing it to the libraries listed below
- A pure-Python robots.txt parser with support for modern conventions.☆60Updated 2 weeks ago
- URL Transformation, Sanitization☆103Updated last year
- Library to populate items using XPath and CSS with a convenient API☆47Updated 2 weeks ago
- CSS Selectors for Python☆293Updated last week
- Python wrapper for RE2☆102Updated 6 months ago
- Extracts the top level domain (TLD) from the URL given.☆181Updated last year
- aioelasticsearch-py wrapper for asyncio☆137Updated 2 years ago
- URLExtract is python class for collecting (extracting) URLs from given text based on locating TLD.☆254Updated last year
- xmlsjon converts XML into Python dictionary structures (trees, like in JSON) and vice-versa.☆122Updated 2 years ago
- A pure Python Levenshtein implementation that's not freaking GPL'd.☆97Updated last year
- Internationalized Domain Names for Python (IDNA 2008 and UTS #46)☆256Updated 6 months ago
- 🔗 Immutable, Pythonic, correct URLs.☆285Updated 2 years ago
- Schema.org classes in pydantic☆66Updated 2 years ago
- A Python Implementation of RFC3986 including validations☆184Updated 2 weeks ago
- Python interface for c-ares☆166Updated 3 months ago
- Atom, RSS and JSON feed parser for Python 3☆116Updated 2 years ago
- universal character encoding detector☆396Updated 3 months ago
- Standalone ISO 3166-1 country definitions☆143Updated 9 months ago
- URI parsing, classification and composition☆64Updated 3 months ago
- Python library of 60+ commonly-used validator functions☆129Updated 2 years ago
- A configurable HTML Minifier with safety features☆132Updated 3 years ago
- Modern robots.txt Parser for Python☆192Updated last year
- LRU cache for Python. Use Redis as backend. Provides a dictionary-like object as well as a method decorator. pip install redis-lru☆43Updated 2 years ago
- Extract text from HTML☆134Updated 4 years ago
- The most basic Text::Unidecode port (licensed under Artistic License or GPL or GPLv2+ - choose whatever you want)☆65Updated 2 years ago
- A python module to parse the Open Graph Protocol☆232Updated 3 years ago
- Backport of Python 3's csv module for Python 2☆64Updated 4 years ago
- publicsuffixlist for python☆66Updated this week
- yet easy url☆22Updated 3 years ago
- aiohttp_debugtoolbar is library for debugtoolbar support for aiohttp☆197Updated this week