quotient-ai / judges
A small library of LLM judges
☆154Updated this week
Alternatives and similar repositories for judges:
Users that are interested in judges are comparing it to the libraries listed below
- ☆149Updated 3 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆99Updated 11 months ago
- Late Interaction Models Training & Retrieval☆254Updated this week
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆260Updated 2 months ago
- awesome synthetic (text) datasets☆264Updated 4 months ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆124Updated 2 months ago
- A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use…☆99Updated this week
- 📝 Reference-Free automatic summarization evaluation with potential hallucination detection☆100Updated last year
- This is the reproduction repository for my 🤗 Hugging Face blog post on synthetic data☆68Updated last year
- ☆110Updated 6 months ago
- ☆50Updated 9 months ago
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆410Updated last year
- ☆195Updated 10 months ago
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo ranker☆107Updated last month
- A Lightweight Library for AI Observability☆236Updated 3 weeks ago
- ☆58Updated 4 months ago
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, imp…☆171Updated 6 months ago
- Attribute (or cite) statements generated by LLMs back to in-context information.☆212Updated 5 months ago
- Codebase accompanying the Summary of a Haystack paper.☆75Updated 5 months ago
- FastAPI wrapper around DSPy☆234Updated last year
- ☆77Updated 9 months ago
- TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycle☆230Updated this week
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.☆49Updated 5 months ago
- Claudette is Claude's friend☆224Updated this week
- A flexible, adaptive classification system for dynamic text classification☆119Updated this week
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆157Updated 5 months ago