Easily deployable and scalable backend server that efficiently converts various document formats (pdf, docx, pptx, html, images, etc) into Markdown. With support for both CPU and GPU processing, it is Ideal for large-scale workflows, it offers text/table extraction, OCR, and batch processing with sync/async endpoints.
☆753Mar 4, 2025Updated 11 months ago
Alternatives and similar repositories for docling-api
Users that are interested in docling-api are comparing it to the libraries listed below
Sorting:
- Running Docling as an API service☆1,279Updated this week
- Get your documents ready for gen AI☆54,094Updated this week
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,342Feb 21, 2025Updated last year
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,940Sep 24, 2025Updated 5 months ago
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents…☆2,983Dec 8, 2025Updated 2 months ago
- A system for agentic LLM-powered data processing and ETL☆3,636Feb 2, 2026Updated last month
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,485Aug 27, 2025Updated 6 months ago
- A Python library to orchestrate LLMs in a neural network-inspired structure☆52Oct 4, 2024Updated last year
- Convert PDF to markdown + JSON quickly with high accuracy☆32,069Updated this week
- Python tool for converting files and office documents to Markdown.☆88,637Feb 20, 2026Updated last week
- An open-source RAG-based tool for chatting with your documents.☆25,168Updated this week
- An asynchronous voice agent for WhatsApp built with ElevenLabs, Twilio, and Hono, running on Cloudflare 🔥☆28Jun 12, 2025Updated 8 months ago
- Sample applications built on the Graphlit Platform☆76Oct 11, 2025Updated 4 months ago
- A VPN written in Rust☆13Apr 17, 2025Updated 10 months ago
- ☆11Aug 26, 2024Updated last year
- OCR & Document Extraction using vision models☆12,144May 20, 2025Updated 9 months ago
- Build, run, manage agentic software at scale.☆38,276Updated this week
- OCR, layout analysis, reading order, table recognition in 90+ languages☆19,360Updated this week
- Toolkit for linearizing PDFs for LLM datasets/training☆16,947Feb 19, 2026Updated last week
- 🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQL☆1,144Updated this week
- Document to Markdown OCR library with Llama 3.2 vision☆2,425Jan 20, 2025Updated last year
- Structured data extraction and instruction calling with ML, LLM and Vision LLM☆5,126Updated this week
- ContextGem: Effortless LLM extraction from documents☆1,805Feb 22, 2026Updated last week
- Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations,…☆17,889Updated this week
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆14,074Updated this week
- streamline the fine-tuning process for multimodal models: PaliGemma 2, Florence-2, and Qwen2.5-VL☆2,659Feb 23, 2026Updated last week
- AI Powered Knowledge Graph Generator☆1,908Dec 28, 2025Updated 2 months ago
- 📃 A better UX for chat, writing content, and coding with LLMs.☆5,380Updated this week
- Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet and cites it too. …☆11,472Feb 10, 2026Updated 3 weeks ago
- AI Automation for everybody☆165May 21, 2025Updated 9 months ago
- Simple package to extract text with coordinates from programmatic PDFs☆245Updated this week
- This repository shows how we deploy docling on aws lambda☆28Jun 10, 2025Updated 8 months ago
- Serverless Modal + FastAPI + React + ColPali + Qdrant + GPT4o Vision RAG (V-RAG) Demo☆405Jun 26, 2025Updated 8 months ago
- The platform for LLM evaluations and AI agent testing☆2,837Updated this week
- Chrome Extension for YouTube. Acts as an assistant for the YouTube video you are watching☆23Apr 26, 2023Updated 2 years ago
- 🔥 Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser sandbox that lets you automate the web wit…☆6,494Updated this week
- structured outputs for llms☆12,428Updated this week
- ☆2,126Mar 17, 2025Updated 11 months ago
- ☆41Sep 25, 2024Updated last year