aaronsw / html2text
Convert HTML to Markdown-formatted text.
☆2,698Updated last year
Alternatives and similar repositories for html2text:
Users that are interested in html2text are comparing it to the libraries listed below
- Convert HTML to Markdown-formatted text.☆1,917Updated 8 months ago
- fast python port of arc90's readability tool, updated to match latest readability.js!☆2,751Updated 2 months ago
- Python PDF Parser (Not actively maintained). Check out pdfminer.six.☆5,284Updated 2 years ago
- A fast yet powerful Python Markdown parser with renderers and plugins.☆2,715Updated last week
- A Python implementation of John Gruber’s Markdown with Extension support.☆3,939Updated last week
- Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors☆1,207Updated this week
- A Python Static Website Generator (Presently Unmaintained).☆1,629Updated 6 months ago
- [abandoned] python port of arc90's readability bookmarklet☆539Updated 13 years ago
- Standards-compliant library for parsing and serializing HTML documents and fragments in Python☆1,182Updated last year
- Webkit based scriptable web browser for python.☆2,762Updated last year
- Python Command-line Application Tools☆97Updated last year
- Generate HTML5 slideshows from markdown, ReST, or textile☆2,093Updated last year
- Parse feeds in Python☆2,079Updated 3 weeks ago
- Web Content Retrieval for Humans™☆618Updated 2 years ago
- A jquery-like library for python☆2,343Updated 6 months ago
- Reads, queries and modifies Microsoft Word 2007/2008 docx files.☆1,073Updated 9 years ago
- Requests + Gevent = <3☆4,549Updated 7 months ago
- web.py is a web framework for python that is as simple as it is powerful.☆5,906Updated last month
- Convert HTML to Markdown☆1,490Updated 3 weeks ago
- A Python module for controlling interactive programs in a pseudo-terminal☆2,672Updated 3 weeks ago
- Python Module for Tabular Datasets in XLS, CSV, JSON, YAML, &c.☆4,680Updated last month
- urllib3 is a user-friendly HTTP client library for Python☆3,852Updated this week
- extract text from any document. no muss. no fuss.☆4,013Updated 3 months ago
- A cool logging replacement for Python.☆1,483Updated last week
- 🛏 An HTML to Markdown converter written in JavaScript☆9,458Updated 7 months ago
- Python Subprocesses for Humans™.☆2,270Updated 8 years ago
- Lightweight, scriptable browser as a service with an HTTP API☆4,130Updated 7 months ago
- 🌐 URL parsing and manipulation made easy.☆2,671Updated this week
- ☆3,708Updated 4 years ago
- Asynchronous Python HTTP Requests for Humans using Futures☆2,113Updated 2 months ago