illuin-tech / vidore-benchmarkLinks
Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.
β212Updated 2 weeks ago
Alternatives and similar repositories for vidore-benchmark
Users that are interested in vidore-benchmark are comparing it to the libraries listed below
Sorting:
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. π¨π»βπ³β313Updated 2 weeks ago
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"β211Updated this week
- β118Updated 9 months ago
- OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generationβ76Updated 2 months ago
- Manage scalable open LLM inference endpoints in Slurm clustersβ260Updated 11 months ago
- Banishing LLM Hallucinations Requires Rethinking Generalizationβ276Updated 11 months ago
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".β235Updated 9 months ago
- β143Updated 11 months ago
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".β170Updated 2 weeks ago
- awesome synthetic (text) datasetsβ282Updated 7 months ago
- This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.β72Updated 9 months ago
- Dense X Retrieval: What Retrieval Granularity Should We Use?β157Updated last year
- AWM: Agent Workflow Memoryβ275Updated 4 months ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language Mβ¦β223Updated 7 months ago
- Comprehensive benchmark for RAGβ191Updated last week
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paperβ¦β107Updated last year
- Code for explaining and evaluating late chunking (chunked pooling)β403Updated 5 months ago
- This is the official repository for Auto-RAG.β211Updated 2 months ago
- LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)β137Updated 7 months ago
- Complex Function Calling Benchmark.β114Updated 5 months ago
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.β465Updated last week
- Repository for βPlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makersβ, NAACL24β139Updated last year
- Benchmarking library for RAGβ209Updated last week
- Attribute (or cite) statements generated by LLMs back to in-context information.β240Updated 8 months ago
- InstructDoc: A Dataset for Zero-Shot Generalization of Visual Document Understanding with Instructions (AAAI2024)β161Updated last year
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, impβ¦β180Updated 9 months ago
- The first dense retrieval model that can be prompted like an LMβ73Updated last month
- code for training & evaluating Contextual Document Embedding modelsβ194Updated last month
- MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents [EMNLP 2024]β164Updated 5 months ago
- Simple UI for debugging correlations of text embeddingsβ276Updated 3 weeks ago