DS4SD / deepsearch-glm
Create fast graph language models from converted PDF documents for knowledge extraction and Q&A.
☆48Updated 2 months ago
Alternatives and similar repositories for deepsearch-glm:
Users that are interested in deepsearch-glm are comparing it to the libraries listed below
- A python library to define and validate data types in Docling.☆92Updated this week
- Simple package to extract text with coordinates from programmatic PDFs☆83Updated 2 weeks ago
- Examples using the Deep Search functionalities☆69Updated last month
- ☆87Updated 2 weeks ago
- Build document-native LLM applications☆52Updated 6 months ago
- Official code of the paper "SimGRAG: Leveraging Similar Subgraphs for Knowledge Graphs Driven Retrieval-Augmented Generation"☆105Updated 3 months ago
- Repository for ACL paper: "Statements: Universal Information Extraction from Tables with Large Language Models for ESG KPIs"☆13Updated 8 months ago
- I have explained how to create superior RAG pipeline for complex pdfs using LlamaParse. We can extract text and tables from pdf and QA on…☆43Updated last year
- Interact with the Deep Search platform for new knowledge explorations and discoveries☆181Updated 2 months ago
- ☆22Updated last year
- DSPY on action with OpenSource LLMs.☆68Updated 11 months ago
- A new novel multi-modality (Vision) RAG architecture☆23Updated 5 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆64Updated 4 months ago
- GLiNER model in a FastAPI microservice.☆39Updated 3 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 8 months ago
- [WWW 2025] A Dockerized Schema-Guided LLM Agent-based Knowledge Extraction System.☆57Updated this week
- LitePali is a minimal, efficient implementation of ColPali for image retrieval and indexing, optimized for cloud deployment.☆44Updated 5 months ago
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆20Updated last week
- DocLLM: A layout-aware generative language model for multimodal document understanding☆123Updated last year
- Measuring RAG solutions throughput and latency☆15Updated 8 months ago
- python package to parse pdfs with different parsers☆35Updated 3 months ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆67Updated 4 months ago
- A repository for Multi-Meta-RAG: Improving RAG for Multi-Hop Queries using Database Filtering with LLM-Extracted Metadata☆33Updated 7 months ago
- Ready-to-go containerized RAG service. Implemented with text-embedding-inference + Qdrant/LanceDB.☆62Updated 3 months ago
- ☆119Updated last month
- ☆24Updated 2 months ago
- Code for the EMNLP'24 paper "Learning to Extract Structured Entities Using Language Models"☆30Updated last month
- This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.☆64Updated 6 months ago
- ☆62Updated 8 months ago
- collection of text2cypher datasets, evaluations, and finetuning instructions☆163Updated 9 months ago