aaronsw / html2textLinks
Convert HTML to Markdown-formatted text.
☆2,729Updated last year
Alternatives and similar repositories for html2text
Users that are interested in html2text are comparing it to the libraries listed below
Sorting:
- Convert HTML to Markdown-formatted text.☆2,022Updated 2 months ago
- markdown2: A fast and complete implementation of Markdown in Python☆2,782Updated last month
- A fast yet powerful Python Markdown parser with renderers and plugins.☆2,825Updated last month
- fast python port of arc90's readability tool, updated to match latest readability.js!☆2,813Updated 2 months ago
- A Python implementation of John Gruber’s Markdown with Extension support.☆4,018Updated 3 weeks ago
- A jquery-like library for python☆2,358Updated 10 months ago
- Reads, queries and modifies Microsoft Word 2007/2008 docx files.☆1,073Updated 9 years ago
- A fully tested, abstract interface to creating OAuth clients and servers.☆3,003Updated last year
- Parse feeds in Python☆2,150Updated 2 weeks ago
- web.py is a web framework for python that is as simple as it is powerful.☆5,914Updated last month
- Python Module for Tabular Datasets in XLS, CSV, JSON, YAML, &c.☆4,706Updated this week
- Standards-compliant library for parsing and serializing HTML documents and fragments in Python☆1,197Updated last year
- Wkhtmltopdf python wrapper to convert html to pdf☆2,027Updated last year
- A pure-python HTML screen-scraping library☆1,875Updated 3 years ago
- Html Content / Article Extractor, web scrapping lib in Python☆4,040Updated 3 years ago
- simplejson is a simple, fast, extensible JSON encoder/decoder for Python☆1,682Updated 3 months ago
- Thin wrapper for "pandoc" (MIT)☆1,005Updated this week
- Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors☆1,241Updated last month
- Python Subprocesses for Humans™.☆2,267Updated 8 years ago
- [abandoned] python port of arc90's readability bookmarklet☆542Updated 14 years ago
- Webkit based scriptable web browser for python.☆2,767Updated last year
- Stateful programmatic web browsing in Python, after Andy Lester's Perl module WWW::Mechanize .☆616Updated 8 years ago
- Mustache in Python☆1,311Updated 3 years ago
- extract text from any document. no muss. no fuss.☆4,184Updated 7 months ago
- A library for converting HTML into PDFs using ReportLab☆2,320Updated last month
- Translate Chinese hanzi to pinyin (拼音) by Python, 汉字转拼音☆829Updated last month
- Bleach is an allowed-list-based HTML sanitizing library that escapes or strips markup and attributes☆2,706Updated last month
- Python script that takes screenshots (browsershots) using webkit☆940Updated 4 years ago
- A versatile Python library for EPUB2/EPUB3 manipulation and processing.☆1,651Updated 2 months ago
- Monitoring filesystems events with inotify on Linux.☆2,304Updated last year