docling-project / doclingLinks
Get your documents ready for gen AI
β45,259Updated this week
Alternatives and similar repositories for docling
Users that are interested in docling are comparing it to the libraries listed below
Sorting:
- An open-source RAG-based tool for chatting with your documents.β24,676Updated 4 months ago
- File Parser optimised for LLM Ingestion with no loss π§ Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.β7,234Updated 9 months ago
- Python tool for converting files and office documents to Markdown.β83,302Updated last week
- Multi-agent framework, runtime and control plane. Built for speed, privacy, and scale.β35,542Updated this week
- An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.β27,643Updated 2 months ago
- A modular graph-based Retrieval-Augmented Generation (RAG) systemβ29,384Updated last week
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing aβ¦β31,849Updated this week
- OCR, layout analysis, reading order, table recognition in 90+ languagesβ18,942Updated last month
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)β116,446Updated this week
- ππ€ Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyNβ56,514Updated this week
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into cleanβ¦β13,254Updated last week
- Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work tβ¦β40,903Updated this week
- Toolkit for linearizing PDFs for LLM datasets/trainingβ16,058Updated last week
- RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to creatβ¦β68,321Updated last week
- πͺ’ Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with Openβ¦β18,609Updated last week
- A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive viβ¦β16,999Updated this week
- Fine-tuning & Reinforcement Learning for LLMs. π¦₯ Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.β48,786Updated this week
- GenAI Agent Framework, the Pydantic wayβ13,573Updated this week
- Build Real-Time Knowledge Graphs for AI Agentsβ20,476Updated this week
- This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information rβ¦β23,165Updated last month
- Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. β¦β31,727Updated last week
- Convert PDF to markdown + JSON quickly with high accuracyβ30,047Updated last week
- Run your own AI cluster at home with everyday devices π±π» π₯οΈββ32,584Updated 3 weeks ago
- [EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"β24,741Updated this week
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your pβ¦β52,683Updated this week
- β‘οΈ GenBI (Generative BI) queries any database in natural language, generates accurate SQL (Text-to-SQL), charts (Text-to-Chart), and AI-pβ¦β13,115Updated this week
- π₯ The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured dataβ68,626Updated this week
- π€ Chat with your SQL database π. Accurate Text-to-SQL Generation via LLMs using Agentic Retrieval π.β21,736Updated last week
- Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations,β¦β16,045Updated last week
- OCR & Document Extraction using vision modelsβ11,968Updated 6 months ago