illuin-tech / vidore-benchmarkLinks
Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.
β258Updated 5 months ago
Alternatives and similar repositories for vidore-benchmark
Users that are interested in vidore-benchmark are comparing it to the libraries listed below
Sorting:
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. π¨π»βπ³β352Updated 7 months ago
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"β237Updated 3 months ago
- β147Updated last year
- Banishing LLM Hallucinations Requires Rethinking Generalizationβ276Updated last year
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language Mβ¦β245Updated last year
- β82Updated 2 months ago
- (ICCV 2025) OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generationβ95Updated last month
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".β241Updated last year
- β120Updated last year
- Attribute (or cite) statements generated by LLMs back to in-context information.β315Updated last year
- The first dense retrieval model that can be prompted like an LMβ90Updated 8 months ago
- awesome synthetic (text) datasetsβ321Updated last week
- Repository for βPlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makersβ, NAACL24β152Updated last year
- β162Updated last year
- Official repo for "Make Your LLM Fully Utilize the Context"β261Updated last year
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.β178Updated 8 months ago
- Complex Function Calling Benchmark.β162Updated last year
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on taskβ¦β183Updated last year
- π§ Compare how Agent systems perform on several benchmarks. ππβ103Updated 5 months ago
- [EMNLP 2024] OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs.β147Updated last year
- Codebase accompanying the Summary of a Haystack paper.β80Updated last year
- MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents [EMNLP 2024]β194Updated 4 months ago
- Beating the GAIA benchmark with Transformers Agents. πβ144Updated 11 months ago
- Manage scalable open LLM inference endpoints in Slurm clustersβ278Updated last year
- β104Updated 9 months ago
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Modelsβ114Updated 9 months ago
- TF-ID: Table/Figure IDentifier for academic papersβ245Updated last year
- This is the official repository for Auto-RAG.β232Updated 6 months ago
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".β213Updated 6 months ago
- Simple UI for debugging correlations of text embeddingsβ305Updated 7 months ago