huggingface / yourbenchLinks
🤗 Benchmark Large Language Models Reliably On Your Data
☆398Updated this week
Alternatives and similar repositories for yourbench
Users that are interested in yourbench are comparing it to the libraries listed below
Sorting:
- Build datasets using natural language☆529Updated 2 weeks ago
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆451Updated last month
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆336Updated 4 months ago
- A small library of LLM judges☆287Updated 2 months ago
- A Lightweight Library for AI Observability☆251Updated 7 months ago
- Banishing LLM Hallucinations Requires Rethinking Generalization☆276Updated last year
- Simple UI for debugging correlations of text embeddings☆291Updated 4 months ago
- Automatically evaluate your LLMs in Google Colab☆661Updated last year
- awesome synthetic (text) datasets☆297Updated 2 months ago
- An Open Source Toolkit For LLM Distillation☆729Updated 2 months ago
- Fast Semantic Text Deduplication & Filtering☆810Updated 3 weeks ago
- ☆159Updated 10 months ago
- A compact LLM pretrained in 9 days by using high quality data☆327Updated 5 months ago
- Late Interaction Models Training & Retrieval☆608Updated last week
- ☆155Updated 5 months ago
- ☆682Updated 5 months ago
- ☆232Updated 3 months ago
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆450Updated last year
- ☆135Updated last month
- 📝 Automatically annotate papers using LLMs☆355Updated 5 months ago
- A simple tool that let's you explore different possible paths that an LLM might sample.☆190Updated 4 months ago