aaronsw / html2textLinks
Convert HTML to Markdown-formatted text.
☆2,743Updated last year
Alternatives and similar repositories for html2text
Users that are interested in html2text are comparing it to the libraries listed below
Sorting:
- Convert HTML to Markdown-formatted text.☆2,047Updated 5 months ago
- fast python port of arc90's readability tool, updated to match latest readability.js!☆2,843Updated 4 months ago
- markdown2: A fast and complete implementation of Markdown in Python☆2,789Updated last month
- A fast yet powerful Python Markdown parser with renderers and plugins.☆2,860Updated 2 weeks ago
- A Python implementation of John Gruber’s Markdown with Extension support.☆4,076Updated last week
- Parse feeds in Python☆2,193Updated this week
- [abandoned] python port of arc90's readability bookmarklet☆541Updated 14 years ago
- A jquery-like library for python☆2,360Updated last year
- A Python 3 compatible version of goose http://goose3.readthedocs.io/en/latest/index.html☆883Updated 8 months ago
- extract text from any document. no muss. no fuss.☆4,281Updated 9 months ago
- Html Content / Article Extractor, web scrapping lib in Python☆4,047Updated 3 years ago
- A pure-python HTML screen-scraping library☆1,883Updated 3 years ago
- Please use openpyxl where you can...☆2,181Updated 3 months ago
- A library for converting HTML into PDFs using ReportLab☆2,337Updated this week
- The lxml XML toolkit for Python☆2,913Updated this week
- Webkit based scriptable web browser for python.☆2,764Updated last year
- Python PDF Parser (Not actively maintained). Check out pdfminer.six.☆5,300Updated 2 years ago
- web.py is a web framework for python that is as simple as it is powerful.☆5,917Updated last week
- Reads, queries and modifies Microsoft Word 2007/2008 docx files.☆1,072Updated 10 years ago
- simplejson is a simple, fast, extensible JSON encoder/decoder for Python☆1,690Updated 5 months ago
- Lightweight, scriptable browser as a service with an HTTP API☆4,173Updated last year
- Python character encoding detector☆2,285Updated 8 months ago
- Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors☆1,268Updated last month
- Python script that takes screenshots (browsershots) using webkit☆942Updated 4 years ago
- A fully tested, abstract interface to creating OAuth clients and servers.☆3,002Updated last year
- Python module to generate ATOM feeds, RSS feeds and Podcasts.☆771Updated last year
- Convert Word documents (.docx files) to HTML☆985Updated this week
- Thin wrapper for "pandoc" (MIT)☆1,041Updated last week
- Stateful programmatic web browsing in Python, after Andy Lester's Perl module WWW::Mechanize .☆615Updated 8 years ago
- A Python Static Website Generator (See https://duct-ui.org from the author).☆1,633Updated 11 months ago