microsoft / markitdownLinks
Python tool for converting files and office documents to Markdown.
☆72,449Updated this week
Alternatives and similar repositories for markitdown
Users that are interested in markitdown are comparing it to the libraries listed below
Sorting:
- Toolkit for linearizing PDFs for LLM datasets/training☆13,930Updated this week
- Get your documents ready for gen AI☆37,201Updated this week
- Convert PDF to markdown + JSON quickly with high accuracy☆28,307Updated this week
- 🪄 Create rich visualizations with AI☆13,596Updated this week
- The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured data 🔥☆53,447Updated this week
- 🚀 The fast, Pythonic way to build MCP servers and clients☆16,902Updated this week
- Build Real-Time Knowledge Graphs for AI Agents☆17,442Updated this week
- A simple screen parsing tool towards pure vision based GUI agent☆23,386Updated last week
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆51,866Updated this week
- 🌐 Make websites accessible for AI agents. Automate tasks online with ease.☆68,716Updated this week
- 🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.☆20,126Updated 4 months ago
- Collection of leaked system prompts☆12,703Updated last week
- A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive vi…☆14,065Updated this week
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆28,040Updated this week
- OCR & Document Extraction using vision models☆11,806Updated 3 months ago
- What are the principles we can use to build LLM-powered software that is actually good enough to put in the hands of production customers…☆13,321Updated last month
- Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚☆30,565Updated 5 months ago
- Free, simple, fast interactive diagrams for any GitHub repository☆14,146Updated 3 months ago
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less…☆44,900Updated this week
- A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。☆42,775Updated this week
- Lightweight coding agent that runs in your terminal☆36,501Updated last week
- Official inference framework for 1-bit LLMs☆21,135Updated 2 months ago
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your p…☆49,894Updated this week
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,123Updated 6 months ago
- A collection of MCP servers.☆68,138Updated this week
- An open protocol enabling communication and interoperability between opaque agentic applications.☆19,380Updated this week
- An extremely fast Python package and project manager, written in Rust.☆66,211Updated this week
- Anthropic's Interactive Prompt Engineering Tutorial☆17,860Updated last year
- ⚡️ GenBI (Generative BI) queries any database in natural language, generates accurate SQL (Text-to-SQL), charts (Text-to-Chart), and AI-p…☆11,189Updated this week
- A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.☆19,582Updated 2 weeks ago