aaronsw / html2textLinks
Convert HTML to Markdown-formatted text.
☆2,794Updated last year
Alternatives and similar repositories for html2text
Users that are interested in html2text are comparing it to the libraries listed below
Sorting:
- Convert HTML to Markdown-formatted text.☆2,075Updated last week
- fast python port of arc90's readability tool, updated to match latest readability.js!☆2,852Updated 6 months ago
- Python PDF Parser (Not actively maintained). Check out pdfminer.six.☆5,302Updated 2 years ago
- markdown2: A fast and complete implementation of Markdown in Python☆2,801Updated 3 weeks ago
- A jquery-like library for python☆2,361Updated last year
- A Python implementation of John Gruber’s Markdown with Extension support.☆4,111Updated this week
- A fast yet powerful Python Markdown parser with renderers and plugins.☆2,889Updated 2 months ago
- Webkit based scriptable web browser for python.☆2,760Updated last year
- A versatile Python library for EPUB2/EPUB3 manipulation and processing.☆1,706Updated last week
- Parse feeds in Python☆2,228Updated this week
- [abandoned] python port of arc90's readability bookmarklet☆542Updated 14 years ago
- A Python 3 compatible version of goose http://goose3.readthedocs.io/en/latest/index.html☆891Updated last week
- pdfrw is a pure Python library that reads and writes PDFs☆1,911Updated last year
- Reads, queries and modifies Microsoft Word 2007/2008 docx files.☆1,072Updated 10 years ago
- Html Content / Article Extractor, web scrapping lib in Python☆4,049Updated 3 years ago
- Thin wrapper for "pandoc" (MIT)☆1,057Updated 2 weeks ago
- The lxml XML toolkit for Python☆2,943Updated 2 weeks ago
- Standards-compliant library for parsing and serializing HTML documents and fragments in Python☆1,209Updated last year
- extract text from any document. no muss. no fuss.☆4,351Updated 11 months ago
- Wkhtmltopdf python wrapper to convert html to pdf☆2,031Updated 2 years ago
- Python character encoding detector☆2,293Updated this week
- Lightweight, scriptable browser as a service with an HTTP API☆4,192Updated last year
- Convert Word documents (.docx files) to HTML☆1,020Updated last month
- A pure-python HTML screen-scraping library☆1,886Updated 3 years ago
- Python Command-line Application Tools☆97Updated 2 years ago
- Google search from Python (unofficial).☆1,233Updated last year
- Python module to generate ATOM feeds, RSS feeds and Podcasts.