Easily deployable and scalable backend server that efficiently converts various document formats (pdf, docx, pptx, html, images, etc) into Markdown. With support for both CPU and GPU processing, it is Ideal for large-scale workflows, it offers text/table extraction, OCR, and batch processing with sync/async endpoints.
☆764Mar 4, 2025Updated last year
Alternatives and similar repositories for docling-api
Users that are interested in docling-api are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Running Docling as an API service☆1,556Updated this week
- Get your documents ready for gen AI☆60,372Updated this week
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,942Apr 9, 2026Updated last month
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,369Feb 21, 2025Updated last year
- API service for docling document conversion☆38Feb 20, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This repository shows how we deploy docling on aws lambda☆32Jun 10, 2025Updated 11 months ago
- Collection of PDF parsing libraries like AI based docling, claude, openai, gemini, meta's llama-vision, unstructured-io, and pdfminer, py…☆182Apr 21, 2026Updated last month
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents…☆3,102Dec 8, 2025Updated 5 months ago
- Convert PDF to markdown + JSON quickly with high accuracy☆35,381May 5, 2026Updated 3 weeks ago
- Python tool for converting files and office documents to Markdown.☆124,706May 22, 2026Updated last week
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,544Aug 27, 2025Updated 9 months ago
- An open-source RAG-based tool for chatting with your documents.☆25,394Apr 3, 2026Updated last month
- A system for agentic LLM-powered data processing and ETL☆3,754May 20, 2026Updated last week
- OCR & Document Extraction using vision models☆12,233May 20, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- OCR, layout analysis, reading order, table recognition in 90+ languages☆19,787Updated this week
- Docling Haystack integration☆29Apr 9, 2026Updated last month
- Build, run, and manage agent platforms.☆40,307May 23, 2026Updated last week
- Toolkit for linearizing PDFs for LLM datasets/training☆17,336Mar 25, 2026Updated 2 months ago
- Document to Markdown OCR library with Llama 3.2 vision☆2,425Jan 20, 2025Updated last year
- Structured data extraction and instruction calling with ML, LLM and Vision LLM☆5,158May 22, 2026Updated last week
- Autonomous agent networks for task automation that requires multi-step reasoning☆30Sep 1, 2025Updated 8 months ago
- A Python library to orchestrate LLMs in a neural network-inspired structure☆53Oct 4, 2024Updated last year
- ☆2,287Mar 17, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆14,749May 18, 2026Updated last week
- ContextGem: Effortless LLM extraction from documents☆1,844May 7, 2026Updated 3 weeks ago
- ☆206May 8, 2026Updated 3 weeks ago
- Simple package to extract text with coordinates from programmatic PDFs☆282Updated this week
- Sample applications built on the Graphlit Platform☆79Oct 11, 2025Updated 7 months ago
- Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.☆1,750Dec 21, 2024Updated last year
- Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet and cites it too. …☆11,696Mar 20, 2026Updated 2 months ago
- The platform for LLM evaluations and AI agent testing☆3,265May 21, 2026Updated last week
- 🪄 Create rich visualizations with AI☆15,735Updated this week
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- 🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQL☆1,160May 18, 2026Updated last week
- LLM-Driven Extraction of Unstructured Data — Built for API Deployments & ETL Pipeline Workflows☆6,585May 22, 2026Updated last week
- 📃 A better UX for chat, writing content, and coding with LLMs.☆5,459Feb 25, 2026Updated 3 months ago
- structured outputs for llms☆13,023Updated this week
- ☆15Apr 10, 2024Updated 2 years ago
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆66,299Updated this week
- Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations,…☆19,353May 22, 2026Updated last week