Running Docling as an API service
☆1,279Feb 25, 2026Updated last week
Alternatives and similar repositories for docling-serve
Users that are interested in docling-serve are comparing it to the libraries listed below
Sorting:
- Docling core data types and transformations☆230Updated this week
- Easily deployable and scalable backend server that efficiently converts various document formats (pdf, docx, pptx, html, images, etc) int…☆753Mar 4, 2025Updated last year
- Simple package to extract text with coordinates from programmatic PDFs☆245Feb 25, 2026Updated last week
- Get your documents ready for gen AI☆54,754Updated this week
- ☆187Feb 20, 2026Updated last week
- Making docling agentic through MCP☆426Jan 22, 2026Updated last month
- Create fast graph language models from converted PDF documents for knowledge extraction and Q&A.☆58Jan 27, 2025Updated last year
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆14,074Updated this week
- Examples using the Deep Search functionalities☆83Jan 29, 2025Updated last year
- Pipelines: Versatile, UI-Agnostic OpenAI-Compatible Plugin Framework☆2,290Aug 18, 2025Updated 6 months ago
- 🦛 CHONK docs with Chonkie ✨ — The lightweight ingestion library for fast, efficient and robust RAG pipelines☆3,781Updated this week
- Convert PDF to markdown + JSON quickly with high accuracy☆32,069Updated this week
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆37,083Updated this week
- Open source project for data preparation for GenAI applications☆905Updated this week
- Interact with the Deep Search platform for new knowledge explorations and discoveries☆220Jan 24, 2025Updated last year
- Build document-native LLM applications☆56Sep 11, 2024Updated last year
- 📚 Process PDFs, Word documents and more with spaCy☆861Mar 8, 2025Updated 11 months ago
- ☆22Feb 1, 2025Updated last year
- A set of tools to create synthetically-generated data from documents☆39Aug 15, 2025Updated 6 months ago
- A simple, secure MCP-to-OpenAPI proxy server☆4,017Updated this week
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,688Feb 5, 2026Updated last month
- ☆15Jul 21, 2025Updated 7 months ago
- Toolkit for linearizing PDFs for LLM datasets/training☆16,947Feb 19, 2026Updated 2 weeks ago
- User-friendly AI Interface (Supports Ollama, OpenAI API, ...)☆125,513Updated this week
- ☆3,002Updated this week
- Python tool for converting files and office documents to Markdown.☆88,637Feb 20, 2026Updated last week
- An open-source RAG-based tool for chatting with your documents.☆25,168Updated this week
- The most accurate document search and store for building AI apps☆3,516Feb 25, 2026Updated last week
- [EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"☆28,932Updated this week
- Add/remove knowledge from local files/folder.☆25May 15, 2025Updated 9 months ago
- RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to creat…☆73,900Updated this week
- A blazing fast inference solution for text embeddings models☆4,525Feb 25, 2026Updated last week
- 💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows☆12,247Feb 25, 2026Updated last week
- This repository shows how we deploy docling on aws lambda☆28Jun 10, 2025Updated 8 months ago
- OCR, layout analysis, reading order, table recognition in 90+ languages☆19,360Feb 24, 2026Updated last week
- Universal memory layer for AI Agents☆48,604Updated this week
- Build, run, manage agentic software at scale.☆38,276Updated this week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆71,883Updated this week
- Humans and AI agents, building knowledge bases together. Self-hosted document annotation, version control, semantic search, and MCP.☆1,225Updated this week