aaronsw / html2textLinks
Convert HTML to Markdown-formatted text.
☆2,880Updated last year
Alternatives and similar repositories for html2text
Users that are interested in html2text are comparing it to the libraries listed below
Sorting:
- Convert HTML to Markdown-formatted text.☆2,127Updated 3 months ago
- fast python port of arc90's readability tool, updated to match latest readability.js!☆2,886Updated last week
- A fast yet powerful Python Markdown parser with renderers and plugins.☆2,962Updated 3 weeks ago
- markdown2: A fast and complete implementation of Markdown in Python☆2,808Updated 2 weeks ago
- [abandoned] python port of arc90's readability bookmarklet☆542Updated 14 years ago
- Reads, queries and modifies Microsoft Word 2007/2008 docx files.☆1,073Updated 10 years ago
- A Python 3 compatible version of goose http://goose3.readthedocs.io/en/latest/index.html☆901Updated last month
- A jquery-like library for python☆2,379Updated last year
- Python character encoding detector☆2,320Updated last month
- A Python Static Website Generator (See https://duct-ui.org from the author).☆1,738Updated last year
- Parse feeds in Python☆2,277Updated this week
- Standards-compliant library for parsing and serializing HTML documents and fragments in Python☆1,217Updated last year
- A Python implementation of John Gruber’s Markdown with Extension support.☆4,156Updated 2 weeks ago
- Scalable Bloom Filter implemented in Python☆1,624Updated 4 years ago
- Python script that takes screenshots (browsershots) using webkit☆941Updated 5 years ago
- Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors☆1,309Updated last week
- Stateful programmatic web browsing in Python, after Andy Lester's Perl module WWW::Mechanize .☆616Updated 8 years ago
- A pure-python HTML screen-scraping library☆1,888Updated 3 years ago
- 🌐 The easiest way to parse and modify URLs in Python.☆2,795Updated 2 months ago
- Html Content / Article Extractor, web scrapping lib in Python☆4,061Updated 4 years ago
- Convert Word documents (.docx files) to HTML☆1,050Updated 2 months ago
- Python interface to Boilerpipe, Boilerplate Removal and Fulltext Extraction from HTML pages☆542Updated 4 years ago
- Wkhtmltopdf python wrapper to convert html to pdf☆2,038Updated 2 years ago
- Python PDF Parser (Not actively maintained). Check out pdfminer.six.☆5,303Updated 3 years ago
- Thin wrapper for "pandoc" (MIT)☆1,100Updated last month
- Python Subprocesses for Humans™.☆2,265Updated 9 years ago
- Python Command-line Application Tools☆97Updated 2 years ago
- Requests + Gevent = <3☆4,587Updated last year
- Create beautiful tag clouds as images or HTML☆396Updated 7 years ago
- Webkit based scriptable web browser for python.☆2,763Updated last year