Easily deployable and scalable backend server that efficiently converts various document formats (pdf, docx, pptx, html, images, etc) into Markdown. With support for both CPU and GPU processing, it is Ideal for large-scale workflows, it offers text/table extraction, OCR, and batch processing with sync/async endpoints.
☆763Mar 4, 2025Updated last year
Alternatives and similar repositories for docling-api
Users that are interested in docling-api are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Running Docling as an API service☆1,488Apr 29, 2026Updated last week
- Get your documents ready for gen AI☆59,087Apr 30, 2026Updated last week
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,942Apr 9, 2026Updated 3 weeks ago
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,367Feb 21, 2025Updated last year
- This repository shows how we deploy docling on aws lambda☆30Jun 10, 2025Updated 10 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Collection of PDF parsing libraries like AI based docling, claude, openai, gemini, meta's llama-vision, unstructured-io, and pdfminer, py…☆180Apr 21, 2026Updated 2 weeks ago
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents…☆3,097Dec 8, 2025Updated 5 months ago
- Convert PDF to markdown + JSON quickly with high accuracy☆34,606Apr 24, 2026Updated 2 weeks ago
- Python tool for converting files and office documents to Markdown.☆121,775Apr 20, 2026Updated 2 weeks ago
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,536Aug 27, 2025Updated 8 months ago
- An open-source RAG-based tool for chatting with your documents.☆25,350Apr 3, 2026Updated last month
- A system for agentic LLM-powered data processing and ETL☆3,740Mar 27, 2026Updated last month
- OCR & Document Extraction using vision models☆12,227May 20, 2025Updated 11 months ago
- OCR, layout analysis, reading order, table recognition in 90+ languages☆19,707Apr 24, 2026Updated 2 weeks ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Docling Haystack integration☆29Apr 9, 2026Updated last month
- Run agents as production software.☆39,835May 1, 2026Updated last week
- Toolkit for linearizing PDFs for LLM datasets/training☆17,283Mar 25, 2026Updated last month
- Document to Markdown OCR library with Llama 3.2 vision☆2,428Jan 20, 2025Updated last year
- Structured data extraction and instruction calling with ML, LLM and Vision LLM☆5,153Apr 30, 2026Updated last week
- Autonomous agent networks for task automation that requires multi-step reasoning☆30Sep 1, 2025Updated 8 months ago
- A Python library to orchestrate LLMs in a neural network-inspired structure☆53Oct 4, 2024Updated last year
- ☆2,283Mar 17, 2025Updated last year
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆14,646Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Simple package to extract text with coordinates from programmatic PDFs☆272Updated this week
- ContextGem: Effortless LLM extraction from documents☆1,838Updated this week
- ☆202Apr 23, 2026Updated 2 weeks ago
- Sample applications built on the Graphlit Platform☆78Oct 11, 2025Updated 6 months ago
- Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.☆1,747Dec 21, 2024Updated last year
- Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet and cites it too. …☆11,632Mar 20, 2026Updated last month
- The platform for LLM evaluations and AI agent testing☆3,240Updated this week
- 🪄 Create rich visualizations with AI☆15,247Updated this week
- 🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQL☆1,154Apr 16, 2026Updated 3 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- LLM-Driven Extraction of Unstructured Data — Built for API Deployments & ETL Pipeline Workflows☆6,557Apr 30, 2026Updated last week
- 📃 A better UX for chat, writing content, and coding with LLMs.☆5,440Feb 25, 2026Updated 2 months ago
- structured outputs for llms☆12,889Apr 22, 2026Updated 2 weeks ago
- ☆15Apr 10, 2024Updated 2 years ago
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆64,964Apr 30, 2026Updated last week
- Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations,…☆19,227Updated this week
- Serverless Modal + FastAPI + React + ColPali + Qdrant + GPT4o Vision RAG (V-RAG) Demo☆404Jun 26, 2025Updated 10 months ago