drmingler / docling-apiLinks
Easily deployable and scalable backend server that efficiently converts various document formats (pdf, docx, pptx, html, images, etc) into Markdown. With support for both CPU and GPU processing, it is Ideal for large-scale workflows, it offers text/table extraction, OCR, and batch processing with sync/async endpoints.
β674Updated 5 months ago
Alternatives and similar repositories for docling-api
Users that are interested in docling-api are comparing it to the libraries listed below
Sorting:
- π₯€ RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQLβ1,055Updated 2 months ago
- A Kubernetes deployable instance of GroundX for document parsing, storage, and search.β776Updated this week
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has stβ¦β1,245Updated 3 months ago
- Reasoning Augmented Generationβ873Updated last month
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.β1,375Updated 3 weeks ago
- Running Docling as an API serviceβ629Updated this week
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard thoughβ556Updated 2 months ago
- ContextGem: Effortless LLM extraction from documentsβ1,459Updated last week
- Make any LLM to think like OpenAI o1 and deepseek R1β490Updated 6 months ago
- Optimize Document Retrieval with Fine-Tuned KnowledgeBasesβ154Updated 5 months ago
- An automated AI system (Python framework) designed to analyze any type of website content and generate structured reports using Claude 3.β¦β657Updated 9 months ago
- π¦ CHONK your texts with Chonkie β¨ β The no-nonsense RAG chunking libraryβ2,054Updated last week
- β¨ AI interface for tinkerers (Ollama, Haystack RAG, Python)β467Updated 2 months ago
- The open-source alternative to Carbon.ai. Build powerful RAG applications with any data source, at any scale.β849Updated last month
- openperplex is an opensource AI search engineβ873Updated last year
- Python package and backend for the Elysia platform app.β604Updated last week
- A Chrome extension for asking questions over websitesβ343Updated 6 months ago
- Production-Ready MCP Server Framework β’ Build, deploy & scale secure AI agent infrastructure β’ Includes Auth, Observability, Debugger, Teβ¦β733Updated this week
- Serverless Modal + FastAPI + React + ColPali + Qdrant + GPT4o Vision RAG (V-RAG) Demoβ391Updated 2 months ago
- AI-first Search & Answer Engine for work. Open-source alternative to Glean.β577Updated this week
- TWIX is an open-source data extraction tool that reconstructs structured data from documents at scale, accurately and at low cost, by infβ¦β201Updated 2 months ago
- The complete Agentic Context Engineering Foundation for building AI-native companies. Orchestrate LLMs with APIs or private deployments wβ¦β513Updated this week
- π₯ AI-powered data enrichment tool that transforms emails into rich datasets with company profiles, funding data, tech stacks, and more uβ¦β716Updated 2 months ago
- π₯ Instantly create AI chatbots for any website with RAG-powered search, streaming responses, and OpenAI-compatible API endpointsβ463Updated 2 months ago
- A simple Python program to implement the search-extract-summarize flow.β269Updated 2 months ago
- Helping you select an AI agent frameworkβ372Updated 2 weeks ago
- π π§ PageIndex: Document Index for Reasoning-based RAGβ1,177Updated this week
- π₯ Monitor websites for changes with Firecrawl's powerful change detection - see what and when websites have changedβ276Updated last month
- The open-source multi-agent chat interface that lets you manage multiple agents in one dynamic conversation and add MCP servers for deep β¦β443Updated 4 months ago
- Generic rag framework to apply the power of LLMs on any given datasetβ636Updated last month