Convert HTML to Markdown
☆2,128Nov 16, 2025Updated 4 months ago
Alternatives and similar repositories for python-markdownify
Users that are interested in python-markdownify are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Convert HTML to Markdown-formatted text.☆2,138Oct 28, 2025Updated 5 months ago
- Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XM…☆5,659Sep 12, 2025Updated 6 months ago
- structured outputs for llms☆12,702Updated this week
- Markdown parser, done right. 100% CommonMark support, extensions, syntax plugins & high speed. Now in Python!☆1,271Mar 30, 2026Updated last week
- Convert HTML to Markdown-formatted text.☆2,870Feb 27, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Convert PDF to markdown + JSON quickly with high accuracy☆33,352Updated this week
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆14,383Updated this week
- PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.☆9,373Updated this week
- A Python implementation of John Gruber’s Markdown with Extension support.☆4,192Feb 9, 2026Updated 2 months ago
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆41,858Apr 2, 2026Updated last week
- Python tool for converting files and office documents to Markdown.☆93,259Mar 30, 2026Updated last week
- A simple HTML content extractor in Python. Can be run as a wrapper for Mozilla's Readability.js package or in pure-python mode.☆355Dec 2, 2024Updated last year
- 🛏 An HTML to Markdown converter written in JavaScript☆11,006Updated this week
- LlamaIndex is the leading document agent and OCR platform☆48,389Updated this week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A fast yet powerful Python Markdown parser with renderers and plugins.☆3,008Mar 16, 2026Updated 3 weeks ago
- A fast, extensible and spec-compliant Markdown parser in pure Python.☆1,029Mar 22, 2026Updated 2 weeks ago
- DSPy: The framework for programming—not prompting—language models☆33,495Updated this week
- An extremely fast Python package and project manager, written in Rust.☆82,599Updated this week
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆10,455May 8, 2025Updated 11 months ago
- Data validation using Python type hints☆27,344Updated this week
- Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and a…☆24,720Updated this week
- Retrying library for Python☆8,526Mar 23, 2026Updated 2 weeks ago
- A markdown parser with high extensibility.☆452Mar 30, 2026Updated last week
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- AI Agent Framework, the Pydantic way☆16,027Apr 2, 2026Updated last week
- Thin wrapper for "pandoc" (MIT)☆1,118Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆75,637Updated this week
- Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.☆10,044Jan 28, 2026Updated 2 months ago
- Data infrastructure for AI☆27,173Updated this week
- tiktoken is a fast BPE tokeniser for use with OpenAI's models.☆17,825Mar 27, 2026Updated last week
- Reads key-value pairs from a .env file and can set them as environment variables. It helps in developing applications following the 12-fa…☆8,713Mar 9, 2026Updated last month
- markdown2: A fast and complete implementation of Markdown in Python☆2,810Mar 28, 2026Updated last week
- Structured Outputs☆13,631Mar 26, 2026Updated 2 weeks ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Supercharge Your LLM Application Evaluations 🚀☆13,195Feb 24, 2026Updated last month
- A guidance language for controlling large language models.☆21,365Mar 18, 2026Updated 3 weeks ago
- 💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows☆12,368Updated this week
- A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files☆9,907Updated this week
- Streamlit — A faster way to build and share data apps.☆44,156Updated this week
- Get your documents ready for gen AI☆57,163Updated this week
- Python version of the Playwright testing and automation library.☆14,485Mar 26, 2026Updated 2 weeks ago