Alir3z4 / html2text
Convert HTML to Markdown-formatted text.
☆1,972Updated last week
Alternatives and similar repositories for html2text:
Users that are interested in html2text are comparing it to the libraries listed below
- fast python port of arc90's readability tool, updated to match latest readability.js!☆2,762Updated 3 months ago
- Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors☆1,219Updated 3 weeks ago
- Convert HTML to Markdown-formatted text.☆2,713Updated last year
- A Python 3 compatible version of goose http://goose3.readthedocs.io/en/latest/index.html☆863Updated 4 months ago
- extract text from any document. no muss. no fuss.☆4,094Updated 4 months ago
- Thin wrapper for "pandoc" (MIT)☆970Updated 2 weeks ago
- python parser for human readable dates☆2,642Updated last month
- Convert HTML to Markdown☆1,564Updated last week
- Parse feeds in Python☆2,098Updated 2 weeks ago
- A fast yet powerful Python Markdown parser with renderers and plugins.☆2,754Updated 3 weeks ago
- The lxml XML toolkit for Python☆2,810Updated this week
- Useful extensions to the standard Python datetime features☆2,448Updated 3 weeks ago
- A python wrapper for libmagic☆2,745Updated last month
- A python based HTML to text conversion library, command line client and Web service.☆302Updated last month
- Accurately separates a URL’s subdomain, domain, and public suffix, using the Public Suffix List (PSL).☆1,888Updated this week
- Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.☆1,576Updated last week
- Pure-Python full-text search library☆620Updated last year
- A generator library for concise, unambiguous and URL-safe UUIDs.☆2,121Updated 4 months ago
- ASCII transliterations of Unicode text - GitHub mirror☆559Updated last week
- markdown2: A fast and complete implementation of Markdown in Python☆2,741Updated 2 weeks ago
- Html Content / Article Extractor, web scrapping lib in Python☆4,027Updated 3 years ago
- 🌐 URL parsing and manipulation made easy.☆2,678Updated last month
- emoji terminal output for Python☆1,957Updated last week
- Python library providing function decorators for configurable backoff and retry☆2,654Updated 11 months ago
- A service daemon to run Scrapy spiders☆3,027Updated 2 weeks ago
- Lightweight, scriptable browser as a service with an HTTP API☆4,136Updated 8 months ago
- Returns unicode slugs☆1,526Updated last year
- 🏹 Better dates & times for Python☆8,834Updated 5 months ago
- A Python implementation of John Gruber’s Markdown with Extension support.☆3,961Updated this week
- Asynchronous Python HTTP Requests for Humans using Futures☆2,116Updated 3 months ago