mlabonne / llm-autoeval
Automatically evaluate your LLMs in Google Colab
☆575Updated 8 months ago
Alternatives and similar repositories for llm-autoeval:
Users that are interested in llm-autoeval are comparing it to the libraries listed below
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆970Updated this week
- A library for easily merging multiple LLM experts, and efficiently train the merged LLM.☆426Updated 4 months ago
- An Open Source Toolkit For LLM Distillation☆425Updated last week
- Evaluate your LLM's response with Prometheus and GPT4 💯☆841Updated last week
- Generate textbook-quality synthetic LLM pretraining data☆492Updated last year
- ☆484Updated last month
- awesome synthetic (text) datasets☆253Updated 2 months ago
- A bagel, with everything.☆315Updated 9 months ago
- Guide for fine-tuning Llama/Mistral/CodeLlama models and more☆555Updated 4 months ago
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆684Updated 9 months ago
- Official repository for ORPO☆430Updated 7 months ago
- Tutorial for building LLM router☆170Updated 5 months ago
- Best practices for distilling large language models.☆424Updated 11 months ago
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆259Updated last week
- ☆493Updated 4 months ago
- Banishing LLM Hallucinations Requires Rethinking Generalization☆268Updated 6 months ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆1,879Updated this week
- ☆446Updated 9 months ago
- [ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning☆634Updated 7 months ago
- This project showcases an LLMOps pipeline that fine-tunes a small-size LLM model to prepare for the outage of the service LLM.☆294Updated last month
- Fast & more realistic evaluation of chat language models. Includes leaderboard.☆183Updated last year
- A compact LLM pretrained in 9 days by using high quality data☆279Updated last month
- Domain Adapted Language Modeling Toolkit - E2E RAG☆313Updated 2 months ago
- Easily embed, cluster and semantically label text datasets☆488Updated 9 months ago
- A set of scripts and notebooks on LLM finetunning and dataset creation☆99Updated 3 months ago
- Fine-Tuning Embedding for RAG with Synthetic Data☆477Updated last year
- A lightweight library for generating synthetic instruction tuning datasets for your data without GPT.☆728Updated 4 months ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆247Updated 6 months ago
- Code and data for "Lumos: Learning Agents with Unified Data, Modular Design, and Open-Source LLMs"☆459Updated 9 months ago