docling-project / docling-coreLinks
A python library to define and validate data types in Docling.
☆173Updated this week
Alternatives and similar repositories for docling-core
Users that are interested in docling-core are comparing it to the libraries listed below
Sorting:
- Simple package to extract text with coordinates from programmatic PDFs☆176Updated last week
- ☆137Updated last month
- Create fast graph language models from converted PDF documents for knowledge extraction and Q&A.☆55Updated 7 months ago
- Build document-native LLM applications☆54Updated 11 months ago
- Making docling agentic through MCP☆178Updated this week
- Examples using the Deep Search functionalities☆85Updated 7 months ago
- ☆122Updated 6 months ago
- ☆191Updated last week
- A fast, lightweight and easy-to-use Python library for splitting text into semantically meaningful chunks.☆362Updated 2 weeks ago
- LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.☆64Updated 10 months ago
- GLiNER model in a FastAPI microservice.☆45Updated 8 months ago
- Additional packages (components, document stores and the likes) to extend the capabilities of Haystack☆164Updated this week
- Running Docling as an API service☆646Updated this week
- Multimodal document parser for high quality data understanding and extraction☆78Updated this week
- Open source RAG evaluation package☆286Updated last week
- Interact with the Deep Search platform for new knowledge explorations and discoveries☆213Updated 7 months ago
- ☆234Updated 2 months ago
- Docling LangChain integration☆42Updated 2 months ago
- Visualize Different Text Splitting Methods☆285Updated 7 months ago
- Workflows are an event-driven, async-first, step-based way to control the execution flow of AI applications like agents.☆166Updated last week
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.☆232Updated 3 weeks ago
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆174Updated 11 months ago
- Generalist and Lightweight Model for Text Classification☆156Updated 2 months ago
- A set of tools to create synthetically-generated data from documents☆25Updated 2 weeks ago
- A Lightweight Library for AI Observability☆250Updated 6 months ago
- Python API for https://vespa.ai, the open big data serving engine☆137Updated this week
- OCR Benchmark☆553Updated 3 months ago
- 🧪 Experimental features for Haystack☆48Updated last week
- DocLLM: A layout-aware generative language model for multimodal document understanding☆128Updated last year
- ☆127Updated 3 weeks ago