docling-project / doclingLinks
Get your documents ready for gen AI
β52,169Updated last week
Alternatives and similar repositories for docling
Users that are interested in docling are comparing it to the libraries listed below
Sorting:
- Python tool for converting files and office documents to Markdown.β86,605Updated last month
- ππ€ Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyNβ59,492Updated last week
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing aβ¦β35,429Updated this week
- An open-source RAG-based tool for chatting with your documents.β25,019Updated 7 months ago
- πͺ Create rich visualizations with AIβ14,801Updated last week
- Convert PDF to markdown + JSON quickly with high accuracyβ31,582Updated this week
- Toolkit for linearizing PDFs for LLM datasets/trainingβ16,860Updated this week
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your pβ¦β57,756Updated this week
- File Parser optimised for LLM Ingestion with no loss π§ Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.β7,275Updated 11 months ago
- This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information rβ¦β24,625Updated last week
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)β123,582Updated this week
- Build Real-Time Knowledge Graphs for AI Agentsβ22,690Updated this week
- π€ Chat with your SQL database π. Accurate Text-to-SQL Generation via LLMs using Agentic Retrieval π.β22,632Updated last week
- A modular graph-based Retrieval-Augmented Generation (RAG) systemβ30,872Updated this week
- OCR, layout analysis, reading order, table recognition in 90+ languagesβ19,228Updated last week
- [EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"β27,911Updated last week
- Model Context Protocol Serversβ78,372Updated this week
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into cleanβ¦β13,915Updated this week
- π The fast, Pythonic way to build MCP servers and clientsβ22,675Updated this week
- OCR & Document Extraction using vision modelsβ12,136Updated 8 months ago
- A collection of MCP servers.β80,690Updated last week
- screenpipe turns your computer into a personal AI that knows everything you've done. record. search. automate. all local, all private, alβ¦β16,810Updated this week
- A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive viβ¦β24,058Updated last month
- π₯ The Web Data API for AI - Turn entire websites into LLM-ready markdown or structured dataβ80,940Updated this week
- The official Python SDK for Model Context Protocol servers and clientsβ21,587Updated this week
- Free, simple, fast interactive diagrams for any GitHub repositoryβ15,164Updated last month
- GenAI Agent Framework, the Pydantic wayβ14,704Updated this week
- Fine-tuning & Reinforcement Learning for LLMs. π¦₯ Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.β51,625Updated last week
- Anthropic's educational coursesβ18,502Updated 2 months ago
- Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.β54,207Updated this week