quotient-ai / judges
A small library of LLM judges
β161Updated 2 weeks ago
Alternatives and similar repositories for judges:
Users that are interested in judges are comparing it to the libraries listed below
- β150Updated 4 months ago
- Verdict is a library for scaling judge-time compute.β190Updated 2 weeks ago
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. π¨π»βπ³β264Updated 3 months ago
- Attribute (or cite) statements generated by LLMs back to in-context information.β221Updated 5 months ago
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.β415Updated last year
- XTR/WARP is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.β121Updated 5 months ago
- β109Updated 3 weeks ago
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, impβ¦β173Updated 7 months ago
- Late Interaction Models Training & Retrievalβ264Updated last week
- π Reference-Free automatic summarization evaluation with potential hallucination detectionβ100Updated last year
- β208Updated 8 months ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).β80Updated last year
- β143Updated 8 months ago
- βοΈ Awesome LLM Judges βοΈβ87Updated last month
- Pre-train Static Word Embeddingsβ51Updated 3 weeks ago
- This is the reproduction repository for my π€ Hugging Face blog post on synthetic dataβ68Updated last year
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on taskβ¦β160Updated 6 months ago
- A flexible, adaptive classification system for dynamic text classificationβ142Updated 3 weeks ago
- Synthetic Data for LLM Fine-Tuningβ113Updated last year
- FastFit β‘ When LLMs are Unfit Use FastFit β‘ Fast and Effective Text Classification with Many Classesβ191Updated 5 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracyβ99Updated 11 months ago
- awesome synthetic (text) datasetsβ265Updated 5 months ago
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.β189Updated last week
- Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embeddβ¦β97Updated 2 months ago
- A Lightweight Library for AI Observabilityβ238Updated last month
- β195Updated 10 months ago
- An Awesome list of curated DSPy resources.β301Updated last month
- Generalist and Lightweight Model for Text Classificationβ110Updated this week
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".β195Updated this week
- β76Updated 9 months ago