aaronsw / html2textLinks
Convert HTML to Markdown-formatted text.
☆2,746Updated last year
Alternatives and similar repositories for html2text
Users that are interested in html2text are comparing it to the libraries listed below
Sorting:
- Convert HTML to Markdown-formatted text.☆2,065Updated 6 months ago
- fast python port of arc90's readability tool, updated to match latest readability.js!☆2,849Updated 5 months ago
- markdown2: A fast and complete implementation of Markdown in Python☆2,794Updated last week
- A fast yet powerful Python Markdown parser with renderers and plugins.☆2,878Updated last month
- A Python implementation of John Gruber’s Markdown with Extension support.☆4,097Updated 2 weeks ago
- Parse feeds in Python☆2,212Updated last week
- A jquery-like library for python☆2,361Updated last year
- A Python 3 compatible version of goose http://goose3.readthedocs.io/en/latest/index.html☆889Updated 3 weeks ago
- A library for converting HTML into PDFs using ReportLab☆2,344Updated last month
- Thin wrapper for "pandoc" (MIT)☆1,053Updated last week
- [abandoned] python port of arc90's readability bookmarklet☆542Updated 14 years ago
- Html Content / Article Extractor, web scrapping lib in Python☆4,047Updated 3 years ago
- Convert HTML to Markdown☆1,820Updated 2 months ago
- A versatile Python library for EPUB2/EPUB3 manipulation and processing.☆1,696Updated this week
- Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors☆1,277Updated last month
- extract text from any document. no muss. no fuss.☆4,327Updated 10 months ago
- Python PDF Parser (Not actively maintained). Check out pdfminer.six.☆5,298Updated 2 years ago
- Bleach is an allowed-list-based HTML sanitizing library that escapes or strips markup and attributes☆2,721Updated last week
- Standards-compliant library for parsing and serializing HTML documents and fragments in Python☆1,209Updated last year
- Python script that takes screenshots (browsershots) using webkit☆941Updated 5 years ago
- Wkhtmltopdf python wrapper to convert html to pdf☆2,030Updated last year
- The lxml XML toolkit for Python☆2,931Updated 2 weeks ago
- The ctypes-based simple ImageMagick binding for Python☆1,463Updated last week
- Webkit based scriptable web browser for python.☆2,761Updated last year
- A pure-python HTML screen-scraping library☆1,887Updated 3 years ago
- Python character encoding detector☆2,288Updated 9 months ago
- Lightweight, scriptable browser as a service with an HTTP API☆4,178Updated last year
- Stateful programmatic web browsing in Python, after Andy Lester's Perl module WWW::Mechanize .☆615Updated 8 years ago
- A Python library that provides an easy way to identify devices like mobile phones, tablets and their capabilities by parsing (browser) us…☆1,495Updated 2 years ago
- splinter - python test framework for web applications☆2,754Updated last month