illuin-tech / vidore-benchmark
Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.
β126Updated this week
Related projects β
Alternatives and complementary repositories for vidore-benchmark
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. π¨π»βπ³β164Updated 3 weeks ago
- β102Updated 2 months ago
- β131Updated 3 months ago
- Routing on Random Forest (RoRF)β82Updated last month
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β48Updated 3 months ago
- awesome synthetic (text) datasetsβ239Updated last week
- β91Updated last month
- Experimental Code for StructuredRAG: Structured Outputs in Retrieval-Augmented Generationβ90Updated this week
- β204Updated 4 months ago
- β106Updated 2 weeks ago
- Simple examples using Argilla tools to build AIβ38Updated this week
- Late Interaction Models Training & Retrievalβ158Updated last week
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language Mβ¦β169Updated last week
- Just a bunch of benchmark logs for different LLMsβ113Updated 3 months ago
- A compact LLM pretrained in 9 days by using high quality dataβ260Updated last month
- Solving data for LLMs - Create quality synthetic datasets!β136Updated 3 weeks ago
- Attribute (or cite) statements generated by LLMs back to in-context information.β141Updated last month
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).β76Updated 7 months ago
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, impβ¦β158Updated 2 months ago
- Codebase accompanying the Summary of a Haystack paper.β71Updated last month
- β116Updated 2 months ago
- π Reference-Free automatic summarization evaluation with potential hallucination detectionβ99Updated 9 months ago
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on taskβ¦β129Updated last month
- β93Updated 2 months ago
- Let's build better datasets, together!β202Updated 3 months ago
- code for training & evaluating Contextual Document Embedding modelsβ92Updated this week
- β128Updated last week
- Manage scalable open LLM inference endpoints in Slurm clustersβ237Updated 3 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafteβ¦β52Updated last week
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokensβ105Updated last week