aaronsw / html2text
Convert HTML to Markdown-formatted text.
☆2,661Updated 10 months ago
Alternatives and similar repositories for html2text:
Users that are interested in html2text are comparing it to the libraries listed below
- Convert HTML to Markdown-formatted text.☆1,870Updated 5 months ago
- A fast yet powerful Python Markdown parser with renderers and plugins.☆2,643Updated last week
- fast python port of arc90's readability tool, updated to match latest readability.js!☆2,699Updated this week
- markdown2: A fast and complete implementation of Markdown in Python☆2,689Updated 3 weeks ago
- A jquery-like library for python☆2,311Updated 4 months ago
- A Python implementation of John Gruber’s Markdown with Extension support.☆3,857Updated 3 weeks ago
- extract text from any document. no muss. no fuss.☆3,956Updated last month
- A service daemon to run Scrapy spiders☆2,984Updated 3 weeks ago
- Standards-compliant library for parsing and serializing HTML documents and fragments in Python☆1,146Updated 10 months ago
- Parse feeds in Python☆2,021Updated last week
- Python Module for Tabular Datasets in XLS, CSV, JSON, YAML, &c.☆4,650Updated last week
- HTTP API for Scrapy spiders☆843Updated 6 months ago
- Ultra fast JSON decoder and encoder written in C with Python bindings☆4,362Updated last week
- 🌐 URL parsing and manipulation made easy.☆2,656Updated last month
- [abandoned] python port of arc90's readability bookmarklet☆538Updated 13 years ago
- Static site generator that supports Markdown and reST syntax. Powered by Python.☆12,683Updated this week
- 🏹 Better dates & times for Python☆8,768Updated last month
- Python Subprocesses for Humans™.☆2,270Updated 8 years ago
- Convert HTML to Markdown.☆530Updated 4 years ago
- Lightweight, scriptable browser as a service with an HTTP API☆4,112Updated 5 months ago
- Python E-book library for handling books in EPUB2/EPUB3 format -☆1,534Updated 5 months ago
- Python module that makes working with XML feel like you are working with JSON☆5,547Updated 3 months ago
- Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors☆1,165Updated last month
- Python PDF Parser (Not actively maintained). Check out pdfminer.six.☆5,275Updated 2 years ago
- Html Content / Article Extractor, web scrapping lib in Python☆3,998Updated 3 years ago
- A Python Static Website Generator (Presently Unmaintained).☆1,630Updated 3 months ago
- A library for converting HTML into PDFs using ReportLab☆2,271Updated last week
- Python character encoding detector☆2,212Updated this week
- Create *beautiful* command-line interfaces with Python☆7,958Updated 7 months ago
- Pure-Python Git implementation☆2,080Updated this week