Convert HTML to Markdown-formatted text.
☆2,874Feb 27, 2024Updated 2 years ago
Alternatives and similar repositories for html2text
Users that are interested in html2text are comparing it to the libraries listed below
Sorting:
- Convert HTML to Markdown-formatted text.☆2,130Oct 28, 2025Updated 4 months ago
- fast python port of arc90's readability tool, updated to match latest readability.js!☆2,890Jan 26, 2026Updated last month
- Convert HTML to Markdown☆2,076Nov 16, 2025Updated 3 months ago
- markdown2: A fast and complete implementation of Markdown in Python☆2,807Feb 15, 2026Updated 2 weeks ago
- Convert HTML to Markdown.☆533Feb 2, 2020Updated 6 years ago
- Html Content / Article Extractor, web scrapping lib in Python☆4,068Dec 26, 2021Updated 4 years ago
- 🛏 An HTML to Markdown converter written in JavaScript☆10,842Oct 24, 2025Updated 4 months ago
- newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs:☆14,997Dec 6, 2025Updated 3 months ago
- A fast yet powerful Python Markdown parser with renderers and plugins.☆2,984Feb 22, 2026Updated last week
- bringing sanity to world of messed-up data☆33Nov 15, 2023Updated 2 years ago
- A Python implementation of John Gruber’s Markdown with Extension support.☆4,173Feb 9, 2026Updated 3 weeks ago
- SQL for Humans™☆7,225Feb 9, 2026Updated 3 weeks ago
- Parsel lets you extract data from XML/HTML documents using XPath or CSS selectors☆1,317Jan 29, 2026Updated last month
- Python Module for Tabular Datasets in XLS, CSV, JSON, YAML, &c.☆4,756Jan 6, 2026Updated last month
- Simple job queues for Python☆10,589Feb 26, 2026Updated last week
- Scrapy, a fast high-level web crawling & scraping framework for Python.☆60,007Feb 23, 2026Updated last week
- A jquery-like library for python☆2,381Feb 18, 2026Updated 2 weeks ago
- Pythonic HTML Parsing for Humans™☆13,877Apr 16, 2024Updated last year
- Library for building powerful interactive command line applications in Python☆10,304Nov 17, 2025Updated 3 months ago
- 💫 Industrial-strength Natural Language Processing (NLP) in Python☆33,254Nov 27, 2025Updated 3 months ago
- extract text from any document. no muss. no fuss.☆4,458Feb 4, 2026Updated last month
- A simple, yet elegant, HTTP library.☆53,831Updated this week
- Bleach is an allowed-list-based HTML sanitizing library that escapes or strips markup and attributes☆2,757Feb 2, 2026Updated last month
- Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.☆28,140Updated this week
- Coroutine-based concurrency library for Python☆6,439Updated this week
- Python process launching☆7,235Nov 1, 2025Updated 4 months ago
- Python composable command line interface toolkit☆17,328Updated this week
- Accelerate your web app development | Build fast. Run fast.☆18,640Jan 7, 2026Updated last month
- Python Development Workflow for Humans.☆25,101Feb 16, 2026Updated 2 weeks ago
- Fuzzy String Matching in Python☆9,270Feb 24, 2023Updated 3 years ago
- a little task queue for python☆5,933Updated this week
- gunicorn 'Green Unicorn' is a WSGI HTTP Server for UNIX, fast clients and sleepy applications.☆10,450Feb 27, 2026Updated last week
- Redis Python client☆13,491Feb 25, 2026Updated last week
- Python library and shell utilities to monitor filesystem events.☆7,267Updated this week
- Static site generator that supports Markdown and reST syntax. Powered by Python.☆13,232Feb 3, 2026Updated last month
- Distributed Task Queue (development branch)☆28,152Feb 25, 2026Updated last week
- a small, expressive orm -- supports postgresql, mysql, sqlite and cockroachdb☆11,950Updated this week
- Simple, Pythonic remote execution and deployment.☆15,398Jul 20, 2025Updated 7 months ago
- A formatter for Python files☆13,988Feb 20, 2026Updated last week