aaronsw / html2textLinks
Convert HTML to Markdown-formatted text.
☆2,730Updated last year
Alternatives and similar repositories for html2text
Users that are interested in html2text are comparing it to the libraries listed below
Sorting:
- Convert HTML to Markdown-formatted text.☆2,025Updated 3 months ago
- fast python port of arc90's readability tool, updated to match latest readability.js!☆2,826Updated 3 months ago
- markdown2: A fast and complete implementation of Markdown in Python☆2,782Updated last week
- A Python implementation of John Gruber’s Markdown with Extension support.☆4,037Updated last week
- Standards-compliant library for parsing and serializing HTML documents and fragments in Python☆1,203Updated last year
- A fast yet powerful Python Markdown parser with renderers and plugins.☆2,837Updated 2 months ago
- extract text from any document. no muss. no fuss.☆4,229Updated 8 months ago
- Reads, queries and modifies Microsoft Word 2007/2008 docx files.☆1,073Updated 9 years ago
- A jquery-like library for python☆2,359Updated 11 months ago
- A fully tested, abstract interface to creating OAuth clients and servers.☆3,003Updated last year
- Parse feeds in Python☆2,163Updated 3 weeks ago
- Python PDF Parser (Not actively maintained). Check out pdfminer.six.☆5,293Updated 2 years ago
- Mustache in Python☆1,311Updated 3 years ago
- [abandoned] python port of arc90's readability bookmarklet☆542Updated 14 years ago
- Thin wrapper for "pandoc" (MIT)☆1,020Updated 3 weeks ago
- Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors☆1,254Updated last week
- A Python 3 compatible version of goose http://goose3.readthedocs.io/en/latest/index.html☆876Updated 7 months ago
- Python Command-line Application Tools☆97Updated last year
- Convert HTML to Markdown☆1,729Updated 2 weeks ago
- A Python Static Website Generator (Presently Unmaintained).☆1,634Updated 10 months ago
- A library for converting HTML into PDFs using ReportLab☆2,328Updated 2 months ago
- Bleach is an allowed-list-based HTML sanitizing library that escapes or strips markup and attributes☆2,708Updated 2 months ago
- The ctypes-based simple ImageMagick binding for Python☆1,458Updated 2 weeks ago
- Webkit based scriptable web browser for python.☆2,765Updated last year
- Pure-Python Git implementation☆2,136Updated last week
- Convert Word documents (.docx files) to HTML☆976Updated last month
- Python script that takes screenshots (browsershots) using webkit☆940Updated 4 years ago
- Scalable Bloom Filter implemented in Python☆1,622Updated 4 years ago
- Stateful programmatic web browsing in Python, after Andy Lester's Perl module WWW::Mechanize .☆616Updated 8 years ago
- Wkhtmltopdf python wrapper to convert html to pdf☆2,027Updated last year