drmingler / docling-apiLinks
Easily deployable and scalable backend server that efficiently converts various document formats (pdf, docx, pptx, html, images, etc) into Markdown. With support for both CPU and GPU processing, it is Ideal for large-scale workflows, it offers text/table extraction, OCR, and batch processing with sync/async endpoints.
☆730Updated 9 months ago
Alternatives and similar repositories for docling-api
Users that are interested in docling-api are comparing it to the libraries listed below
Sorting:
- Colivara is a suite of services that allows you to store, search, and retrieve documents based on their visual embedding. ColiVara has st…☆1,404Updated 7 months ago
- A Kubernetes deployable instance of GroundX for document parsing, storage, and search.☆801Updated this week
- 🥤 RAGLite is a Python toolkit for Retrieval-Augmented Generation (RAG) with DuckDB or PostgreSQL☆1,124Updated this week
- ExtractThinker is a Document Intelligence library for LLMs, offering ORM-style interaction for flexible and powerful document workflows.☆1,460Updated 3 months ago
- Running Docling as an API service☆1,043Updated 3 weeks ago
- Make any LLM to think like OpenAI o1 and deepseek R1☆492Updated 10 months ago
- Reasoning Augmented Generation☆891Updated 5 months ago
- An automated AI system (Python framework) designed to analyze any type of website content and generate structured reports using Claude 3.…☆670Updated last year
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard though☆563Updated 3 weeks ago
- ✨ AI interface for tinkerers (Ollama, Haystack RAG, Python)☆473Updated 3 months ago
- ContextGem: Effortless LLM extraction from documents☆1,744Updated last month
- This repository hosts a suite of specialized agents designed to power your brainstorming sessions. Each agent brings a unique perspective…☆532Updated 4 months ago
- openperplex is an opensource AI search engine☆887Updated last year
- Serverless Modal + FastAPI + React + ColPali + Qdrant + GPT4o Vision RAG (V-RAG) Demo☆402Updated 5 months ago
- Vision infrastructure to turn complex documents into RAG/LLM-ready data☆2,918Updated 2 months ago
- The open-source alternative to Carbon.ai. Build powerful RAG applications with any data source, at any scale.☆865Updated last week
- Eliminate hallucinations from your AI agents.☆817Updated this week
- Quickly and securely turn your code projects into LLM prompts, all locally on your own machine!☆694Updated 9 months ago
- Generic rag framework to apply the power of LLMs on any given dataset☆658Updated this week
- Sample apps to help developers get started with Structured Outputs☆660Updated 11 months ago
- Parse PDFs into markdown using Vision LLMs☆452Updated 2 months ago
- Doctor is a tool for discovering, crawl, and indexing web sites to be exposed as an MCP server for LLM agents.☆461Updated 6 months ago
- A Chrome extension for asking questions over websites☆354Updated 10 months ago
- 📥 An inbox UX for interacting with human-in-the-loop agents.☆888Updated 7 months ago
- Python package and backend for the Elysia platform app.☆1,835Updated last week
- ➖ Stripped down, stable version of firecrawl optimized for self-hosting and ease of contribution. Billing logic and AI features are compl…☆624Updated 6 months ago
- Optimize Document Retrieval with Fine-Tuned KnowledgeBases☆175Updated last month
- The open LLM Ops platform - Traces, Analytics, Evaluations, Datasets and Prompt Optimization ✨☆2,689Updated this week
- TWIX is an open-source data extraction tool that reconstructs structured data from documents at scale, accurately and at low cost, by inf…☆209Updated 3 weeks ago
- The most accurate document search and store for building AI apps☆3,417Updated this week