Easily deployable and scalable backend server that efficiently converts various document formats (pdf, docx, pptx, html, images, etc) into Markdown. With support for both CPU and GPU processing, it is Ideal for large-scale workflows, it offers text/table extraction, OCR, and batch processing with sync/async endpoints.
☆759Mar 4, 2025Updated last year
Alternatives and similar repositories for docling-api
Users that are interested in docling-api are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Running Docling as an API service☆1,362Updated this week
- Get your documents ready for gen AI☆56,339Updated this week
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,939Sep 24, 2025Updated 6 months ago
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,346Feb 21, 2025Updated last year
- This repository shows how we deploy docling on aws lambda☆29Jun 10, 2025Updated 9 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Collection of PDF parsing libraries like AI based docling, claude, openai, gemini, meta's llama-vision, unstructured-io, and pdfminer, py…☆173Aug 29, 2025Updated 6 months ago
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents…☆3,025Dec 8, 2025Updated 3 months ago
- Python tool for converting files and office documents to Markdown.☆91,227Mar 16, 2026Updated last week
- Convert PDF to markdown + JSON quickly with high accuracy☆32,910Mar 10, 2026Updated 2 weeks ago
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,501Aug 27, 2025Updated 6 months ago
- An open-source RAG-based tool for chatting with your documents.☆25,234Mar 8, 2026Updated 2 weeks ago
- A system for agentic LLM-powered data processing and ETL☆3,695Mar 12, 2026Updated last week
- OCR & Document Extraction using vision models☆12,191May 20, 2025Updated 10 months ago
- OCR, layout analysis, reading order, table recognition in 90+ languages☆19,506Mar 1, 2026Updated 3 weeks ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Docling Haystack integration☆28Jan 13, 2025Updated last year
- Toolkit for linearizing PDFs for LLM datasets/training☆17,043Mar 17, 2026Updated last week
- Build, run, manage agentic software at scale.☆38,835Updated this week
- Document to Markdown OCR library with Llama 3.2 vision☆2,429Jan 20, 2025Updated last year
- Structured data extraction and instruction calling with ML, LLM and Vision LLM☆5,142Updated this week
- A Python library to orchestrate LLMs in a neural network-inspired structure☆52Oct 4, 2024Updated last year
- Autonomous agent networks for task automation that requires multi-step reasoning☆30Sep 1, 2025Updated 6 months ago
- ☆2,265Mar 17, 2025Updated last year
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆14,282Mar 16, 2026Updated last week
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆194Mar 9, 2026Updated 2 weeks ago
- Simple package to extract text with coordinates from programmatic PDFs☆256Mar 9, 2026Updated 2 weeks ago
- ContextGem: Effortless LLM extraction from documents☆1,815Mar 16, 2026Updated last week
- Sample applications built on the Graphlit Platform☆77Oct 11, 2025Updated 5 months ago
- Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.☆1,703Dec 21, 2024Updated last year
- Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet and cites it too. …☆11,542Mar 18, 2026Updated last week
- The platform for LLM evaluations and AI agent testing☆3,141Updated this week
- 🪄 Create rich visualizations with AI☆15,165Updated this week
- LLM-Driven Extraction of Unstructured Data — Built for API Deployments & ETL Pipeline Workflows☆6,504Updated this week
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- 🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQL☆1,147Mar 17, 2026Updated last week
- 📃 A better UX for chat, writing content, and coding with LLMs.☆5,407Feb 25, 2026Updated last month
- structured outputs for llms☆12,589Updated this week
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆62,480Updated this week
- ☆15Apr 10, 2024Updated last year
- Serverless Modal + FastAPI + React + ColPali + Qdrant + GPT4o Vision RAG (V-RAG) Demo☆404Jun 26, 2025Updated 8 months ago
- Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations,…☆18,383Updated this week