microsoft / markitdown
Python tool for converting files and office documents to Markdown.
☆36,447Updated this week
Alternatives and similar repositories for markitdown:
Users that are interested in markitdown are comparing it to the libraries listed below
- Get your documents ready for gen AI☆20,283Updated this week
- OCR, layout analysis, reading order, table recognition in 90+ languages☆16,122Updated this week
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your p…☆28,156Updated this week
- Convert PDF to markdown + JSON quickly with high accuracy☆20,466Updated this week
- Self-hosted AI coding assistant☆29,597Updated this week
- A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。☆25,466Updated this week
- A Comprehensive Toolkit for High-Quality PDF Content Extraction☆6,536Updated last month
- Jan is an open source alternative to ChatGPT that runs 100% offline on your computer☆27,378Updated this week
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆71,326Updated this week
- aider is AI pair programming in your terminal☆26,757Updated this week
- PDF to Markdown with vision models☆9,384Updated this week
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆22,235Updated this week
- OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched☆17,564Updated this week
- An extremely fast Python package and project manager, written in Rust.☆38,822Updated this week
- Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory☆25,622Updated this week
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆5,254Updated 3 weeks ago
- ✨ Innovative and open-source visualization application that transforms various data formats, such as JSON, YAML, XML, CSV and more, into …☆36,281Updated this week
- Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚☆22,337Updated this week
- Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI☆19,647Updated this week
- Code at the speed of thought – Zed is a high-performance, multiplayer code editor from the creators of Atom and Tree-sitter.☆54,004Updated this week
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆24,486Updated this week
- Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.☆123,122Updated this week
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆17,194Updated this week
- Windows inside a Docker container.☆32,612Updated this week
- SOTA Open Source TTS☆18,968Updated last week
- Virtual Machine for the Web☆11,651Updated last week
- D2 is a modern diagram scripting language that turns text to diagrams.☆19,800Updated this week
- Virtual whiteboard for sketching hand-drawn like diagrams☆91,897Updated this week
- the AI-native open-source embedding database☆17,521Updated this week
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆7,718Updated this week