drmingler / docling-apiLinks
Easily deployable and scalable backend server that efficiently converts various document formats (pdf, docx, pptx, html, images, etc) into Markdown. With support for both CPU and GPU processing, it is Ideal for large-scale workflows, it offers text/table extraction, OCR, and batch processing with sync/async endpoints.
☆729Updated 9 months ago
Alternatives and similar repositories for docling-api
Users that are interested in docling-api are comparing it to the libraries listed below
Sorting:
- A Kubernetes deployable instance of GroundX for document parsing, storage, and search.☆800Updated last week
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,457Updated 3 months ago
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,388Updated 7 months ago
- 🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQL☆1,119Updated this week
- Make any LLM to think like OpenAI o1 and deepseek R1☆492Updated 10 months ago
- Reasoning Augmented Generation☆889Updated 4 months ago
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard though☆563Updated 3 weeks ago
- ContextGem: Effortless LLM extraction from documents☆1,738Updated 3 weeks ago
- Running Docling as an API service☆1,020Updated 2 weeks ago
- The open-source alternative to Carbon.ai. Build powerful RAG applications with any data source, at any scale.☆864Updated last week
- ✨ AI interface for tinkerers (Ollama, Haystack RAG, Python)☆473Updated 3 months ago
- This repository hosts a suite of specialized agents designed to power your brainstorming sessions. Each agent brings a unique perspective…☆532Updated 4 months ago
- Serverless Modal + FastAPI + React + ColPali + Qdrant + GPT4o Vision RAG (V-RAG) Demo☆402Updated 5 months ago
- AI-first Search & Answer Engine for work. Open-source alternative to Glean.☆642Updated this week
- An automated AI system (Python framework) designed to analyze any type of website content and generate structured reports using Claude 3.…☆670Updated last year
- Quickly and securely turn your code projects into LLM prompts, all locally on your own machine!☆684Updated 9 months ago
- A Chrome extension for asking questions over websites☆354Updated 10 months ago
- Eliminate hallucinations from your AI agents.☆811Updated this week
- openperplex is an opensource AI search engine☆886Updated last year
- The open-source multi-agent chat interface that lets you manage multiple agents in one dynamic conversation and add MCP servers for deep …☆463Updated 7 months ago
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,918Updated 2 months ago
- ➖ Stripped down, stable version of firecrawl optimized for self-hosting and ease of contribution. Billing logic and AI features are compl…☆621Updated 6 months ago
- Generic rag framework to apply the power of LLMs on any given dataset☆659Updated 3 months ago
- Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine☆489Updated 4 months ago
- Sample apps to help developers get started with Structured Outputs☆660Updated 11 months ago
- TWIX is an open-source data extraction tool that reconstructs structured data from documents at scale, accurately and at low cost, by inf…☆209Updated 2 weeks ago
- OCR Benchmark☆597Updated last month
- 📥 An inbox UX for interacting with human-in-the-loop agents.☆888Updated 7 months ago
- Python package and backend for the Elysia platform app.☆1,820Updated last week
- A simple Python program to implement the search-extract-summarize flow.☆275Updated 5 months ago