DS4SD / deepsearch-glm
Create fast graph language models from converted PDF documents for knowledge extraction and Q&A.
☆40Updated 3 weeks ago
Alternatives and similar repositories for deepsearch-glm:
Users that are interested in deepsearch-glm are comparing it to the libraries listed below
- Examples using the Deep Search functionalities☆63Updated 3 weeks ago
- Simple package to extract text with coordinates from programmatic PDFs☆68Updated this week
- A python library to define and validate data types in Docling.☆71Updated this week
- ☆75Updated this week
- Build document-native LLM applications☆51Updated 5 months ago
- Interact with the Deep Search platform for new knowledge explorations and discoveries☆159Updated 3 weeks ago
- I have explained how to create superior RAG pipeline for complex pdfs using LlamaParse. We can extract text and tables from pdf and QA on…☆42Updated 11 months ago
- python package to parse pdfs with different parsers☆35Updated 2 months ago
- Efficient few-shot learning with cross-encoders.☆48Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆64Updated 3 months ago
- Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]☆77Updated last month
- LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.☆40Updated 4 months ago
- Code, datasets, and checkpoints for the paper "CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval an…☆27Updated 5 months ago
- Generalist and Lightweight Model for Text Classification☆79Updated this week
- DocLLM: A layout-aware generative language model for multimodal document understanding☆119Updated last year
- GraphRAG database - hybrid graph / vector db☆118Updated 5 months ago
- Running Docling as an API service☆98Updated this week
- GLiNER model in a FastAPI microservice.☆38Updated 2 months ago
- OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation☆65Updated 3 weeks ago
- A new novel multi-modality (Vision) RAG architecture☆23Updated 4 months ago
- Python API for https://vespa.ai, the open big data serving engine☆113Updated this week
- Lighter, cheaper and faster RAG toolkit (Graph RAG) supported by TargetPilot☆43Updated 4 months ago
- [WWW 2025] A Dockerized Schema-Guided LLM Agent-based Knowledge Extraction System.☆38Updated last week
- ☆62Updated 7 months ago
- Code for the EMNLP'24 paper "Learning to Extract Structured Entities Using Language Models"☆25Updated 2 weeks ago
- A Unified Toolkit for Deep Learning-Based Table Extraction☆30Updated 3 months ago
- GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction☆66Updated 6 months ago
- Multimodal LLM Application with PyMuPDF4LLM☆32Updated 4 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Updated 4 months ago