docling-project / docling
Get your documents ready for gen AI
☆28,950Updated this week
Alternatives and similar repositories for docling:
Users that are interested in docling are comparing it to the libraries listed below
- An open-source RAG-based tool for chatting with your documents.☆22,205Updated 3 weeks ago
- Toolkit for linearizing PDFs for LLM datasets/training☆12,311Updated this week
- Agno is a lightweight library for building Agents with memory, knowledge, tools and reasoning.☆26,158Updated this week
- LLM-powered multiagent persona simulation for imagination enhancement and business insights.☆6,171Updated last month
- 🚀 The fast, Pythonic way to build MCP servers and clients☆9,251Updated this week
- Convert PDF to markdown + JSON quickly with high accuracy☆24,799Updated last week
- OCR & Document Extraction using vision models☆11,107Updated last week
- OCR, layout analysis, reading order, table recognition in 90+ languages☆17,346Updated this week
- Python tool for converting files and office documents to Markdown.☆56,041Updated 3 weeks ago
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆42,518Updated this week
- Agent Framework / shim to use Pydantic with LLMs☆9,267Updated this week
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆6,393Updated 2 months ago
- The official Python SDK for Model Context Protocol servers and clients☆11,467Updated this week
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.☆24,223Updated last week
- Build Real-Time Knowledge Graphs for AI Agents☆8,122Updated this week
- Python SDK, Proxy Server (LLM Gateway) to call 100+ LLM APIs in OpenAI format - [Bedrock, Azure, OpenAI, VertexAI, Cohere, Anthropic, Sag…☆22,215Updated this week
- Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work t…☆31,277Updated this week
- A high-quality tool for convert PDF to Markdown and JSON.一站式 开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。☆32,914Updated this week
- Collection of awesome LLM apps with AI Agents and RAG using OpenAI, Anthropic, Gemini and opensource models.☆31,493Updated this week
- 🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.☆37,616Updated this week
- Memory for AI Agents; SOTA in AI Agent Memory, beating OpenAI Memory in accuracy by 26% - https://mem0.ai/research☆28,809Updated this week
- Python scraper based on AI☆19,490Updated this week
- Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI☆21,719Updated last week
- AI app store powered by 24/7 desktop history. open source | 100% local | dev friendly | 24/7 screen, mic recording☆14,388Updated last week
- Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ in…☆90,905Updated this week
- Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/☆8,684Updated this week
- Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚☆27,984Updated last month
- The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.☆43,680Updated this week
- SoTA production-ready AI retrieval system. Agentic Retrieval-Augmented Generation (RAG) with a RESTful API.☆6,757Updated this week
- Replace 'hub' with 'ingest' in any github url to get a prompt-friendly extract of a codebase☆8,574Updated 2 weeks ago