docling-project / docling-sdgLinks
A set of tools to create synthetically-generated data from documents
☆20Updated last month
Alternatives and similar repositories for docling-sdg
Users that are interested in docling-sdg are comparing it to the libraries listed below
Sorting:
- Create fast graph language models from converted PDF documents for knowledge extraction and Q&A.☆55Updated 5 months ago
- Simple package to extract text with coordinates from programmatic PDFs☆141Updated last week
- Build document-native LLM applications☆53Updated 10 months ago
- Repository for ACL paper: "Statements: Universal Information Extraction from Tables with Large Language Models for ESG KPIs"☆13Updated last year
- Own your AI, search the web with it🌐😎☆86Updated 6 months ago
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆22Updated 9 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆20Updated 9 months ago
- Query Expension for Better Query Embedding using LLMs☆54Updated 5 months ago
- Auto Thinking Mode switch for Qwen3 in Open webui☆66Updated 2 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆63Updated 10 months ago
- ☆57Updated 5 months ago
- RAGLight is a lightweight and modular Python library for implementing Retrieval-Augmented Generation (RAG), Agentic RAG and RAT (Retrieva…☆25Updated 3 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆108Updated 3 months ago
- A python library to define and validate data types in Docling.☆155Updated this week
- ☆50Updated this week
- Examples using the Deep Search functionalities☆81Updated 5 months ago
- Try out HallOumi, a state-of-the-art claim verification model in a simple UI!☆37Updated 3 months ago
- Elasticsearch integration into LangChain☆57Updated 5 months ago
- Mycomind Daemon: A mycelium-inspired, advanced Mixture-of-Memory-RAG-Agents (MoMRA) cognitive assistant that combines multiple AI models …☆35Updated last year
- Lighter, cheaper and faster RAG toolkit (Graph RAG) supported by TargetPilot☆47Updated last month
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆26Updated 4 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆100Updated 6 months ago
- Making docling agentic through MCP☆125Updated last week
- AIPE (AI Pipeline Engine) is a flexible and powerful tool for creating and executing complex AI workflows☆21Updated 11 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆74Updated 8 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 5 months ago
- Public Goods Game (PGG) Benchmark: Contribute & Punish is a multi-agent benchmark that tests cooperative and self-interested strategies a…☆37Updated 3 months ago
- ☆95Updated last month
- ☆101Updated 10 months ago
- Unsloth Studio☆95Updated 3 months ago