Alir3z4 / html2text
Convert HTML to Markdown-formatted text.
☆1,897Updated 6 months ago
Alternatives and similar repositories for html2text:
Users that are interested in html2text are comparing it to the libraries listed below
- Convert HTML to Markdown-formatted text.☆2,671Updated 11 months ago
- fast python port of arc90's readability tool, updated to match latest readability.js!☆2,724Updated last month
- extract text from any document. no muss. no fuss.☆3,970Updated 2 months ago
- pdfrw is a pure Python library that reads and writes PDFs☆1,884Updated 9 months ago
- Python character encoding detector☆2,227Updated last month
- Parse feeds in Python☆2,047Updated 2 weeks ago
- A Python 3 compatible version of goose http://goose3.readthedocs.io/en/latest/index.html☆851Updated last month
- Useful extensions to the standard Python datetime features☆2,423Updated 6 months ago
- The lxml XML toolkit for Python☆2,759Updated this week
- python parser for human readable dates☆2,616Updated 2 weeks ago
- Fixes mojibake and other glitches in Unicode text, after the fact.☆3,856Updated 3 months ago
- Convert HTML to Markdown☆1,360Updated this week
- A Python library for automating interaction with websites.☆4,711Updated this week
- Python PDF Parser (Not actively maintained). Check out pdfminer.six.☆5,281Updated 2 years ago
- Safely pass trusted data to untrusted environments and back.☆2,974Updated last month
- Python datetimes made easy☆6,359Updated 2 weeks ago
- Community maintained fork of pdfminer - we fathom PDF☆6,211Updated 6 months ago
- A Python implementation of John Gruber’s Markdown with Extension support.☆3,892Updated 2 weeks ago
- Simple PDF text extraction☆897Updated last week
- 🌐 URL parsing and manipulation made easy.☆2,658Updated 2 months ago
- Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors☆1,187Updated 2 weeks ago
- Extract embedded metadata from HTML markup☆884Updated 2 weeks ago
- Convert Word documents (.docx files) to HTML☆895Updated last month
- API Rate Limit Decorator☆780Updated 2 years ago
- A python based HTML to text conversion library, command line client and Web service.☆287Updated last month
- Web Content Retrieval for Humans™☆616Updated 2 years ago
- Returns unicode slugs☆1,510Updated 11 months ago
- 🏹 Better dates & times for Python☆8,792Updated 3 months ago
- A fast and friendly PDF scraping library.☆772Updated last year
- A fast yet powerful Python Markdown parser with renderers and plugins.☆2,675Updated 3 weeks ago