Alir3z4 / html2textLinks
Convert HTML to Markdown-formatted text.
☆2,130Updated 3 months ago
Alternatives and similar repositories for html2text
Users that are interested in html2text are comparing it to the libraries listed below
Sorting:
- fast python port of arc90's readability tool, updated to match latest readability.js!☆2,886Updated 2 weeks ago
- Convert HTML to Markdown-formatted text.☆2,880Updated last year
- A Python 3 compatible version of goose http://goose3.readthedocs.io/en/latest/index.html☆902Updated this week
- Convert HTML to Markdown☆2,059Updated 2 months ago
- python parser for human readable dates☆2,778Updated this week
- Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors☆1,309Updated last week
- Convert Word documents (.docx files) to HTML☆1,050Updated 2 months ago
- Parse feeds in Python☆2,277Updated this week
- 🌐 The easiest way to parse and modify URLs in Python.☆2,795Updated 2 months ago
- Thin wrapper for "pandoc" (MIT)☆1,100Updated last month
- Port of Google's language-detection library to Python.☆1,870Updated 11 months ago
- Accurately separates a URL’s subdomain, domain, and public suffix, using the Public Suffix List (PSL).☆1,970Updated last month
- extract text from any document. no muss. no fuss.☆4,434Updated this week
- pdfrw is a pure Python library that reads and writes PDFs☆1,911Updated last year
- Wkhtmltopdf python wrapper to convert html to pdf☆2,038Updated 2 years ago
- A fast yet powerful Python Markdown parser with renderers and plugins.☆2,971Updated 3 weeks ago
- A Python implementation of John Gruber’s Markdown with Extension support.☆4,159Updated this week
- Useful extensions to the standard Python datetime features☆2,595Updated 4 months ago
- A jquery-like library for python☆2,379Updated last year
- Extract embedded metadata from HTML markup☆945Updated 4 months ago
- A python based HTML to text conversion library, command line client and Web service.☆334Updated 2 months ago
- Pure-Python full-text search library☆653Updated 2 years ago
- Python character encoding detector☆2,320Updated last month
- Returns unicode slugs☆1,584Updated last month
- Fixes mojibake and other glitches in Unicode text, after the fact.☆4,011Updated last year
- Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.☆1,641Updated 9 months ago
- markdown2: A fast and complete implementation of Markdown in Python☆2,806Updated 2 weeks ago
- 🌁 Wkhtmltoimage python wrapper to convert HTML to image☆831Updated 2 years ago
- A python wrapper for libmagic☆2,880Updated 2 months ago
- A fast, extensible and spec-compliant Markdown parser in pure Python.☆1,017Updated 3 weeks ago