Easily deployable and scalable backend server that efficiently converts various document formats (pdf, docx, pptx, html, images, etc) into Markdown. With support for both CPU and GPU processing, it is Ideal for large-scale workflows, it offers text/table extraction, OCR, and batch processing with sync/async endpoints.
☆765Mar 4, 2025Updated last year
Alternatives and similar repositories for docling-api
Users that are interested in docling-api are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Running Docling as an API service☆1,621Updated this week
- Get your documents ready for gen AI☆62,000Updated this week
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,950Apr 9, 2026Updated 2 months ago
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,398Feb 21, 2025Updated last year
- This repository shows how we deploy docling on aws lambda☆32Jun 10, 2025Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Collection of PDF parsing libraries like AI based docling, claude, openai, gemini, meta's llama-vision, unstructured-io, and pdfminer, py…☆194Apr 21, 2026Updated 2 months ago
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents…☆3,107Dec 8, 2025Updated 6 months ago
- Convert PDF to markdown + JSON quickly with high accuracy☆36,284Jun 6, 2026Updated 3 weeks ago
- Python tool for converting files and office documents to Markdown.☆159,614Updated this week
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,577Aug 27, 2025Updated 10 months ago
- An open-source RAG-based tool for chatting with your documents.☆25,478Jun 9, 2026Updated 2 weeks ago
- A system for agentic LLM-powered data processing and ETL☆3,841Jun 17, 2026Updated last week
- OCR & Document Extraction using vision models☆12,243May 20, 2025Updated last year
- OCR, layout analysis, reading order, table recognition in 90+ languages☆20,907Jun 13, 2026Updated 2 weeks ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Docling Haystack integration☆29Apr 9, 2026Updated 2 months ago
- Build, run, and manage agent platforms.☆40,861Updated this week
- Toolkit for linearizing PDFs for LLM datasets/training☆17,412Mar 25, 2026Updated 3 months ago
- Document to Markdown OCR library with Llama 3.2 vision☆2,425Jan 20, 2025Updated last year
- Structured data extraction, instruction calling and agentic workflows with ML, LLM and Vision LLM☆5,162Jun 18, 2026Updated last week
- Autonomous agent networks for task automation that requires multi-step reasoning☆30Sep 1, 2025Updated 9 months ago
- A Python library to orchestrate LLMs in a neural network-inspired structure☆53Oct 4, 2024Updated last year
- ☆2,287Mar 17, 2025Updated last year
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆15,002Updated this week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ContextGem: Effortless LLM extraction from documents☆1,851Jun 6, 2026Updated 3 weeks ago
- ☆207Jun 4, 2026Updated 3 weeks ago
- Simple package to extract text with coordinates from programmatic PDFs☆316Updated this week
- Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.☆1,756Dec 21, 2024Updated last year
- Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet and cites it too. …☆11,725Mar 20, 2026Updated 3 months ago
- Sample applications built on the Graphlit Platform☆80Oct 11, 2025Updated 8 months ago
- The platform for LLM evaluations and AI agent testing☆3,313Updated this week
- 🪄 Create rich visualizations with AI☆15,848Jun 18, 2026Updated last week
- 🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQL☆1,188May 18, 2026Updated last month
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- LLM-Driven Extraction of Unstructured Data — Built for API Deployments & ETL Pipeline Workflows☆6,672Updated this week
- 📃 A better UX for chat, writing content, and coding with LLMs.☆5,472Feb 25, 2026Updated 4 months ago
- structured outputs for llms☆13,210Updated this week
- ☆14Apr 10, 2024Updated 2 years ago
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆69,339Jun 18, 2026Updated last week
- Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations,…☆19,871Updated this week
- Serverless Modal + FastAPI + React + ColPali + Qdrant + GPT4o Vision RAG (V-RAG) Demo☆402Jun 26, 2025Updated last year