huggingface / yourbenchLinks
🤗 Benchmark Large Language Models Reliably On Your Data
☆423Updated 3 weeks ago
Alternatives and similar repositories for yourbench
Users that are interested in yourbench are comparing it to the libraries listed below
Sorting:
- Build datasets using natural language☆558Updated 4 months ago
- awesome synthetic (text) datasets☆321Updated last week
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆352Updated 7 months ago
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆495Updated 4 months ago
- Simple UI for debugging correlations of text embeddings☆305Updated 7 months ago
- A Lightweight Library for AI Observability☆253Updated 11 months ago
- Automatically evaluate your LLMs in Google Colab☆682Updated last year
- A small library of LLM judges☆314Updated 5 months ago
- Banishing LLM Hallucinations Requires Rethinking Generalization☆276Updated last year
- ☆237Updated last month
- An Open Source Toolkit For LLM Distillation☆823Updated last month
- Attribute (or cite) statements generated by LLMs back to in-context information.☆315Updated last year
- ☆161Updated last year
- ☆138Updated 5 months ago
- ☆695Updated 8 months ago
- Let's build better datasets, together!☆269Updated last year
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆245Updated last year
- [ACL'25] Official Code for LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs☆314Updated 6 months ago
- Late Interaction Models Training & Retrieval☆687Updated last week
- Fast Semantic Text Deduplication & Filtering☆866Updated this week
- An interface library for RL post training with environments.☆1,066Updated this week
- A compact LLM pretrained in 9 days by using high quality data☆340Updated 9 months ago
- ☆158Updated 9 months ago
- 📝 Automatically annotate papers using LLMs☆398Updated last month
- TapeAgents is a framework that facilitates all stages of the LLM Agent development lifecycle☆302Updated last month
- Generate High-Quality Synthetics, Train, Measure, and Evaluate in a Single Pipeline☆811Updated this week
- An open-source tool for LLM prompt optimization.☆746Updated last week
- Automatic evals for LLMs☆575Updated 3 weeks ago
- A flexible, adaptive classification system for dynamic text classification☆522Updated 3 months ago
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.☆258Updated 5 months ago