aaronsw / html2textLinks
Convert HTML to Markdown-formatted text.
☆2,881Updated last year
Alternatives and similar repositories for html2text
Users that are interested in html2text are comparing it to the libraries listed below
Sorting:
- Convert HTML to Markdown-formatted text.☆2,130Updated 3 months ago
- fast python port of arc90's readability tool, updated to match latest readability.js!☆2,886Updated 2 weeks ago
- markdown2: A fast and complete implementation of Markdown in Python☆2,806Updated 3 weeks ago
- A Python implementation of John Gruber’s Markdown with Extension support.☆4,159Updated last week
- A fast yet powerful Python Markdown parser with renderers and plugins.☆2,971Updated last month
- Thin wrapper for "pandoc" (MIT)☆1,100Updated last month
- A Python Static Website Generator (See https://duct-ui.org from the author).☆1,739Updated last year
- A jquery-like library for python☆2,379Updated last year
- Html Content / Article Extractor, web scrapping lib in Python☆4,061Updated 4 years ago
- [abandoned] python port of arc90's readability bookmarklet☆542Updated 14 years ago
- A fully tested, abstract interface to creating OAuth clients and servers.☆3,014Updated last year
- Webkit based scriptable web browser for python.☆2,763Updated last year
- Standards-compliant library for parsing and serializing HTML documents and fragments in Python☆1,217Updated last year
- The lxml XML toolkit for Python☆2,993Updated 2 weeks ago
- Mustache in Python☆1,317Updated 4 years ago
- Python character encoding detector☆2,321Updated last month
- The granddaddy of HTML tools, with support for modern standards☆2,898Updated last year
- A Python 3 compatible version of goose http://goose3.readthedocs.io/en/latest/index.html☆902Updated last week
- Reads, queries and modifies Microsoft Word 2007/2008 docx files.☆1,073Updated 10 years ago
- Python process launching☆7,233Updated 3 months ago
- Stateful programmatic web browsing in Python, after Andy Lester's Perl module WWW::Mechanize .☆616Updated 8 years ago
- Python interface to Boilerpipe, Boilerplate Removal and Fulltext Extraction from HTML pages☆542Updated 4 years ago
- Collection of plugins for the Pelican static site generator☆1,398Updated 7 months ago
- Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors☆1,313Updated 2 weeks ago
- Implementation of hashids (http://hashids.org) in Python. Compatible with Python 2 and Python 3☆1,421Updated 2 years ago
- livereload server in python☆1,047Updated last week
- A pure python based utility to extract text and images from docx files.☆582Updated 10 months ago
- Bleach is an allowed-list-based HTML sanitizing library that escapes or strips markup and attributes☆2,749Updated last week
- Python email address and Mime parsing library☆1,648Updated last year
- extract text from any document. no muss. no fuss.☆4,434Updated last week