quotient-ai / judgesLinks
A small library of LLM judges
☆209Updated this week
Alternatives and similar repositories for judges
Users that are interested in judges are comparing it to the libraries listed below
Sorting:
- Scale your LLM-as-a-judge.☆234Updated last week
- ☆143Updated 10 months ago
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆423Updated last year
- ☆152Updated 6 months ago
- Late Interaction Models Training & Retrieval☆417Updated this week
- A Lightweight Library for AI Observability☆243Updated 3 months ago
- Simple UI for debugging correlations of text embeddings☆256Updated last week
- Attribute (or cite) statements generated by LLMs back to in-context information.☆235Updated 8 months ago
- Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embedd…☆151Updated last week
- awesome synthetic (text) datasets☆281Updated 7 months ago
- ☆176Updated 6 months ago
- A flexible, adaptive classification system for dynamic text classification☆199Updated last month
- ☆195Updated last year
- High-Performance Engine for Multi-Vector Search☆80Updated this week
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.☆131Updated last month
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆179Updated 9 months ago
- 🤗 Benchmark Large Language Models Reliably On Your Data☆318Updated last week
- TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycle☆271Updated this week
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆136Updated 2 weeks ago
- ☆71Updated 6 months ago
- ⚖️ Awesome LLM Judges ⚖️☆104Updated last month
- A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use…☆118Updated 3 weeks ago
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆80Updated last year
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper …☆104Updated last year
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆111Updated 2 weeks ago
- This package, developed as part of our research detailed in the Chroma Technical Report, provides tools for text chunking and evaluation.…☆323Updated 2 months ago
- Codebase accompanying the Summary of a Haystack paper.☆78Updated 8 months ago
- Banishing LLM Hallucinations Requires Rethinking Generalization☆276Updated 10 months ago
- ☆132Updated last week
- Generalist and Lightweight Model for Text Classification☆128Updated 2 weeks ago