huggingface / yourbenchLinks
🤗 Benchmark Large Language Models Reliably On Your Data
☆391Updated last week
Alternatives and similar repositories for yourbench
Users that are interested in yourbench are comparing it to the libraries listed below
Sorting:
- Build datasets using natural language☆523Updated 4 months ago
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆331Updated 3 months ago
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆428Updated 2 weeks ago
- Simple UI for debugging correlations of text embeddings☆290Updated 3 months ago
- awesome synthetic (text) datasets☆296Updated 2 months ago
- Banishing LLM Hallucinations Requires Rethinking Generalization☆276Updated last year
- Attribute (or cite) statements generated by LLMs back to in-context information.☆276Updated 11 months ago
- Late Interaction Models Training & Retrieval☆576Updated this week
- A Lightweight Library for AI Observability☆251Updated 6 months ago
- A small library of LLM judges☆280Updated last month
- 📝 Automatically annotate papers using LLMs☆349Updated 4 months ago
- ☆118Updated last year
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.☆235Updated last month
- ☆155Updated 9 months ago
- An open-source tool for general prompt optimization.☆616Updated 3 weeks ago
- Generate large synthetic data☆441Updated last week
- [ACL'25] Official Code for LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs☆314Updated 2 months ago
- Let's build better datasets, together!☆263Updated 8 months ago
- ☆155Updated 4 months ago
- An Open Source Toolkit For LLM Distillation☆721Updated 2 months ago
- ☆134Updated 3 weeks ago
- ☆231Updated 2 months ago
- Automatically evaluate your LLMs in Google Colab☆658Updated last year
- code for training & evaluating Contextual Document Embedding models☆197Updated 3 months ago
- Inference, Fine Tuning and many more recipes with Gemma family of models☆267Updated last month
- Fast Semantic Text Deduplication & Filtering☆800Updated this week
- ☆679Updated 4 months ago
- Automatic evals for LLMs☆524Updated 2 months ago
- ☆262Updated 2 months ago
- A compact LLM pretrained in 9 days by using high quality data☆323Updated 5 months ago