drmingler / docling-apiLinks
Easily deployable and scalable backend server that efficiently converts various document formats (pdf, docx, pptx, html, images, etc) into Markdown. With support for both CPU and GPU processing, it is Ideal for large-scale workflows, it offers text/table extraction, OCR, and batch processing with sync/async endpoints.
β724Updated 8 months ago
Alternatives and similar repositories for docling-api
Users that are interested in docling-api are comparing it to the libraries listed below
Sorting:
- π₯€ RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQLβ1,104Updated 2 weeks ago
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has stβ¦β1,370Updated 6 months ago
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.β1,455Updated 2 months ago
- A Kubernetes deployable instance of GroundX for document parsing, storage, and search.β800Updated last week
- Reasoning Augmented Generationβ889Updated 4 months ago
- Make any LLM to think like OpenAI o1 and deepseek R1β492Updated 9 months ago
- β¨ AI interface for tinkerers (Ollama, Haystack RAG, Python)β472Updated 2 months ago
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard thoughβ560Updated last month
- Running Docling as an API serviceβ944Updated 3 weeks ago
- The open-source alternative to Carbon.ai. Build powerful RAG applications with any data source, at any scale.β864Updated 4 months ago
- ContextGem: Effortless LLM extraction from documentsβ1,718Updated last week
- A Chrome extension for asking questions over websitesβ351Updated 9 months ago
- Serverless Modal + FastAPI + React + ColPali + Qdrant + GPT4o Vision RAG (V-RAG) Demoβ400Updated 4 months ago
- An automated AI system (Python framework) designed to analyze any type of website content and generate structured reports using Claude 3.β¦β667Updated last year
- A minimal, open-source setup for serving Agents using FastAPI and Postgres. Built for speed, clarity, and dev happiness.β332Updated 6 months ago
- Doctor is a tool for discovering, crawl, and indexing web sites to be exposed as an MCP server for LLM agents.β462Updated 5 months ago
- RAT is a powerful tool that improves AI responses by leveraging DeepSeek's reasoning capabilities to guide other models through a structuβ¦β656Updated 9 months ago
- Helping you select an AI agent frameworkβ394Updated last week
- Optimize Document Retrieval with Fine-Tuned KnowledgeBasesβ166Updated 2 weeks ago
- AI-first Search & Answer Engine for work. Open-source alternative to Glean.β635Updated this week
- The open-source multi-agent chat interface that lets you manage multiple agents in one dynamic conversation and add MCP servers for deep β¦β461Updated 7 months ago
- π₯ An inbox UX for interacting with human-in-the-loop agents.β882Updated 6 months ago
- Quickly and securely turn your code projects into LLM prompts, all locally on your own machine!β679Updated 8 months ago
- Generic rag framework to apply the power of LLMs on any given datasetβ659Updated 2 months ago
- A simple Python program to implement the search-extract-summarize flow.β275Updated 5 months ago
- xpander.ai is the runtime and control plane to build, run, and ship reliable AI agents fast and anywhereβ772Updated last week
- Turn your data into intelligent context for LLMs. Build knowledge graphs, deploy private LLMs, and launch agents with full data sovereignβ¦β741Updated this week
- OCR Benchmarkβ591Updated last month
- β Stripped down, stable version of firecrawl optimized for self-hosting and ease of contribution. Billing logic and AI features are complβ¦β604Updated 6 months ago
- β418Updated 11 months ago