huggingface / yourbenchLinks
🤗 Benchmark Large Language Models Reliably On Your Data
☆354Updated last week
Alternatives and similar repositories for yourbench
Users that are interested in yourbench are comparing it to the libraries listed below
Sorting:
- Build datasets using natural language☆498Updated 2 months ago
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆315Updated last month
- Simple UI for debugging correlations of text embeddings☆286Updated last month
- awesome synthetic (text) datasets☆286Updated this week
- This project showcases an LLMOps pipeline that fine-tunes a small-size LLM model to prepare for the outage of the service LLM.☆307Updated 3 months ago
- A Lightweight Library for AI Observability☆246Updated 4 months ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆229Updated 8 months ago
- 📝 Automatically annotate papers using LLMs☆328Updated 2 months ago
- An Open Source Toolkit For LLM Distillation☆669Updated last month
- Automatic evals for LLMs☆461Updated 2 weeks ago
- ☆154Updated 7 months ago
- Banishing LLM Hallucinations Requires Rethinking Generalization☆276Updated 11 months ago
- Automatically evaluate your LLMs in Google Colab☆646Updated last year
- ☆259Updated 2 weeks ago
- ☆156Updated 2 months ago
- ☆127Updated 3 months ago
- A compact LLM pretrained in 9 days by using high quality data☆317Updated 3 months ago
- Generate large synthetic data using an LLM☆432Updated this week
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆447Updated 9 months ago
- ☆213Updated last week
- Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.☆214Updated 2 weeks ago
- Together Open Deep Research☆318Updated 2 months ago
- ☆673Updated 2 months ago
- CodeScientist: An automated scientific discovery system for code-based experiments☆273Updated 2 weeks ago
- Tool for generating high quality Synthetic datasets☆1,010Updated this week
- Attribute (or cite) statements generated by LLMs back to in-context information.☆245Updated 9 months ago
- code for training & evaluating Contextual Document Embedding models☆194Updated last month
- ☆118Updated 10 months ago
- [EMNLP 2024 Demo] TinyAgent: Function Calling at the Edge!☆420Updated 10 months ago
- Inference, Fine Tuning and many more recipes with Gemma family of models☆223Updated last week