Convert HTML to Markdown
☆2,067Nov 16, 2025Updated 3 months ago
Alternatives and similar repositories for python-markdownify
Users that are interested in python-markdownify are comparing it to the libraries listed below
Sorting:
- Convert HTML to Markdown-formatted text.☆2,130Oct 28, 2025Updated 3 months ago
- Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XM…☆5,337Sep 12, 2025Updated 5 months ago
- structured outputs for llms☆12,428Updated this week
- Markdown parser, done right. 100% CommonMark support, extensions, syntax plugins & high speed. Now in Python!☆1,236Updated this week
- Convert PDF to markdown + JSON quickly with high accuracy☆31,857Feb 9, 2026Updated 2 weeks ago
- PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.☆9,103Updated this week
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆14,011Feb 20, 2026Updated last week
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆37,083Updated this week
- Convert HTML to Markdown-formatted text.☆2,883Feb 27, 2024Updated 2 years ago
- Python tool for converting files and office documents to Markdown.☆87,527Feb 20, 2026Updated last week
- A simple HTML content extractor in Python. Can be run as a wrapper for Mozilla's Readability.js package or in pure-python mode.☆353Dec 2, 2024Updated last year
- LlamaIndex is the leading document agent and OCR platform☆47,210Updated this week
- A Python implementation of John Gruber’s Markdown with Extension support.☆4,168Feb 9, 2026Updated 2 weeks ago
- 🛏 An HTML to Markdown converter written in JavaScript☆10,818Oct 24, 2025Updated 4 months ago
- Open-source AI orchestration framework for building context-engineered, production-ready LLM applications. Design modular pipelines and a…☆24,295Updated this week
- DSPy: The framework for programming—not prompting—language models☆32,381Updated this week
- Data validation using Python type hints☆26,977Updated this week
- A fast yet powerful Python Markdown parser with renderers and plugins.☆2,984Updated this week
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆9,928May 8, 2025Updated 9 months ago
- An extremely fast Python package and project manager, written in Rust.☆79,639Updated this week
- Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.☆9,774Jan 28, 2026Updated 3 weeks ago
- Retrying library for Python☆8,393Updated this week
- GenAI Agent Framework, the Pydantic way☆15,120Updated this week
- Structured Outputs☆13,456Feb 13, 2026Updated 2 weeks ago
- Open-source search and retrieval database for AI applications.☆26,269Updated this week
- Python version of the Playwright testing and automation library.☆14,289Feb 11, 2026Updated 2 weeks ago
- tiktoken is a fast BPE tokeniser for use with OpenAI's models.☆17,384Feb 8, 2026Updated 2 weeks ago
- A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files☆9,839Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆71,234Updated this week
- Streamlit — A faster way to build and share data apps.☆43,634Updated this week
- 💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows☆12,210Updated this week
- A guidance language for controlling large language models.☆21,319Feb 13, 2026Updated 2 weeks ago
- Get your documents ready for gen AI☆54,094Updated this week
- Fast, correct Python JSON library supporting dataclasses, datetimes, and numpy☆7,923Feb 2, 2026Updated 3 weeks ago
- Reads key-value pairs from a .env file and can set them as environment variables. It helps in developing applications following the 12-fa…☆8,663Jan 12, 2026Updated last month
- Rich is a Python library for rich text and beautiful formatting in the terminal.☆55,569Feb 19, 2026Updated last week
- A next generation HTTP client for Python. 🦋☆15,099Updated this week
- A fast, extensible and spec-compliant Markdown parser in pure Python.☆1,019Jan 18, 2026Updated last month
- An extremely fast Python linter and code formatter, written in Rust.☆45,984Updated this week