DS4SD / deepsearch-glm
Create fast graph language models from converted PDF documents for knowledge extraction and Q&A.
☆21Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for deepsearch-glm
- ☆34Updated last week
- Simple package to extract text with coordinates from programmatic PDFs☆21Updated last week
- Running Docling as an API service☆13Updated last month
- Examples using the Deep Search functionalities☆44Updated 3 months ago
- A python library to define and validate data types in Docling.☆28Updated this week
- Build document-native LLM applications☆50Updated 2 months ago
- Interact with the Deep Search platform for new knowledge explorations and discoveries☆133Updated 3 weeks ago
- Codes and datasets for the paper Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Ref…☆23Updated last month
- ☆43Updated 3 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆53Updated 2 weeks ago
- Code, datasets, and checkpoints for the paper "CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval an…☆25Updated last month
- GLiNER model in a FastAPI microservice.☆28Updated 2 weeks ago
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆14Updated last month
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆59Updated last week
- Using open source LLMs to build synthetic datasets for direct preference optimization☆40Updated 8 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆68Updated 3 weeks ago
- ☆47Updated last month
- A Python pipeline tool and plugin ecosystem for processing technical documents. Process papers from arXiv, SemanticScholar, PDF, with GRO…☆43Updated 3 months ago
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…☆58Updated 5 months ago
- Efficient few-shot learning with cross-encoders.☆40Updated 8 months ago
- ☆105Updated last month
- Structured outputs from DSPy and Jinja2☆14Updated last week
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆23Updated last week
- ☆20Updated 9 months ago
- Voyage AI Official Python Library☆40Updated last week
- ☆39Updated 3 weeks ago
- ☆16Updated last week
- Showcase how mxbai-embed-large-v1 can be used to produce binary embedding. Binary embeddings enabled 32x storage savings and 40x faster r…☆16Updated 7 months ago
- 🌟EasyAGI : A generalist agent that can go online and accomplish complex tasks.☆23Updated 11 months ago
- Data preparation code for Amber 7B LLM☆82Updated 6 months ago