Easily deployable and scalable backend server that efficiently converts various document formats (pdf, docx, pptx, html, images, etc) into Markdown. With support for both CPU and GPU processing, it is Ideal for large-scale workflows, it offers text/table extraction, OCR, and batch processing with sync/async endpoints.
☆763Mar 4, 2025Updated last year
Alternatives and similar repositories for docling-api
Users that are interested in docling-api are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Running Docling as an API service☆1,422Updated this week
- Get your documents ready for gen AI☆57,709Updated this week
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,939Apr 9, 2026Updated last week
- File Parser optimised for LLM Ingestion with no loss 🧠 Parse PDFs, Docx, PPTx in a format that is ideal for LLMs.☆7,348Feb 21, 2025Updated last year
- This repository shows how we deploy docling on aws lambda☆29Jun 10, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Collection of PDF parsing libraries like AI based docling, claude, openai, gemini, meta's llama-vision, unstructured-io, and pdfminer, py…☆177Mar 27, 2026Updated 2 weeks ago
- Document (PDF, Word, PPTX ...) extraction and parse API using state of the art modern OCRs + Ollama supported models. Anonymize documents…☆3,081Dec 8, 2025Updated 4 months ago
- Convert PDF to markdown + JSON quickly with high accuracy☆33,701Updated this week
- Python tool for converting files and office documents to Markdown.☆100,294Mar 30, 2026Updated 2 weeks ago
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,526Aug 27, 2025Updated 7 months ago
- An open-source RAG-based tool for chatting with your documents.☆25,260Apr 3, 2026Updated last week
- A system for agentic LLM-powered data processing and ETL☆3,706Mar 27, 2026Updated 2 weeks ago
- OCR & Document Extraction using vision models☆12,200May 20, 2025Updated 10 months ago
- OCR, layout analysis, reading order, table recognition in 90+ languages☆19,588Updated this week
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Docling Haystack integration☆29Apr 9, 2026Updated last week
- Toolkit for linearizing PDFs for LLM datasets/training☆17,120Mar 25, 2026Updated 3 weeks ago
- Build, run, manage agentic software at scale.☆39,343Updated this week
- Document to Markdown OCR library with Llama 3.2 vision☆2,425Jan 20, 2025Updated last year
- Structured data extraction and instruction calling with ML, LLM and Vision LLM☆5,151Mar 29, 2026Updated 2 weeks ago
- A Python library to orchestrate LLMs in a neural network-inspired structure☆53Oct 4, 2024Updated last year
- ☆2,277Mar 17, 2025Updated last year
- Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean…☆14,425Updated this week
- ☆198Apr 6, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Simple package to extract text with coordinates from programmatic PDFs☆262Apr 8, 2026Updated last week
- ContextGem: Effortless LLM extraction from documents☆1,821Mar 16, 2026Updated last month
- Sample applications built on the Graphlit Platform☆78Oct 11, 2025Updated 6 months ago
- Fast and efficient unstructured data extraction. Written in Rust with bindings for many languages.☆1,742Dec 21, 2024Updated last year
- The platform for LLM evaluations and AI agent testing☆3,189Updated this week
- Scira (Formerly MiniPerplx) is a minimalistic AI-powered search engine that helps you find information on the internet and cites it too. …☆11,592Mar 20, 2026Updated 3 weeks ago
- 🪄 Create rich visualizations with AI☆15,208Updated this week
- LLM-Driven Extraction of Unstructured Data — Built for API Deployments & ETL Pipeline Workflows☆6,537Updated this week
- 🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQL☆1,150Mar 17, 2026Updated 3 weeks ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- 📃 A better UX for chat, writing content, and coding with LLMs.☆5,421Feb 25, 2026Updated last month
- structured outputs for llms☆12,749Updated this week
- 🚀🤖 Crawl4AI: Open-source LLM Friendly Web Crawler & Scraper. Don't be shy, join here: https://discord.gg/jP8KfhDhyN☆63,955Updated this week
- ☆15Apr 10, 2024Updated 2 years ago
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.☆845Jan 28, 2025Updated last year
- Serverless Modal + FastAPI + React + ColPali + Qdrant + GPT4o Vision RAG (V-RAG) Demo☆404Jun 26, 2025Updated 9 months ago
- Debug, evaluate, and monitor your LLM applications, RAG systems, and agentic workflows with comprehensive tracing, automated evaluations,…☆18,745Updated this week