DS4SD / docling-ibm-models
☆34Updated last week
Related projects ⓘ
Alternatives and complementary repositories for docling-ibm-models
- Create fast graph language models from converted PDF documents for knowledge extraction and Q&A.☆21Updated 3 weeks ago
- Examples using the Deep Search functionalities☆44Updated 3 months ago
- A python library to define and validate data types in Docling.☆28Updated this week
- Build document-native LLM applications☆50Updated 2 months ago
- Simple package to extract text with coordinates from programmatic PDFs☆21Updated last week
- Running Docling as an API service☆13Updated last month
- Interact with the Deep Search platform for new knowledge explorations and discoveries☆133Updated 3 weeks ago
- DocLayNet: A Large Human-Annotated Dataset for Document-Layout Analysis☆266Updated last year
- DocLLM: A layout-aware generative language model for multimodal document understanding☆112Updated 10 months ago
- GLiNER model in a FastAPI microservice.☆28Updated 2 weeks ago
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆131Updated last month
- Object Detection Model for Scanned Documents☆82Updated last year
- ☆105Updated last month
- A Docker-powered service for PDF document layout analysis. This service provides a powerful and flexible PDF analysis service. The servic…☆171Updated last week
- A Faster LayoutReader Model based on LayoutLMv3, Sort OCR bboxes to reading order.☆91Updated 5 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆68Updated 3 weeks ago
- InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions (AAAI2024)☆137Updated 5 months ago
- ☆64Updated last month
- ☆30Updated 6 months ago
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…☆58Updated 5 months ago
- Doc2Graph transforms documents into graphs and exploit a GNN to solve several tasks.☆115Updated last year
- We identify the desiderata for a comprehensive benchmark and propose Visually Rich Document Understanding (VRDU). VRDU contains two datas…☆74Updated last year
- YOLOv10 trained on DocLayNet dataset.☆57Updated last week
- A Curated List of Awesome Table Structure Recognition (TSR) Research. Including models, papers, datasets and codes. Continuously updating…☆134Updated 2 months ago
- YOLO models trained by DocLayNet - power your Document Intelligent by Layout Analysis☆61Updated last month
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"☆173Updated last week
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆59Updated last week
- My implementation of Kosmos2.5 from the paper: "KOSMOS-2.5: A Multimodal Literate Model"☆68Updated last week
- UniTable: Towards a Unified Table Foundation Model☆373Updated 5 months ago
- ☆131Updated 3 months ago