huggingface / alignment-handbook
Robust recipes to align language models with human and AI preferences
☆4,680Updated last month
Related projects ⓘ
Alternatives and complementary repositories for alignment-handbook
- Tools for merging pretrained large language models.☆4,816Updated 2 weeks ago
- PyTorch native finetuning library☆4,336Updated this week
- General technology for enabling AI capabilities w/ LLMs and MLLMs☆3,699Updated last month
- Train transformer language models with reinforcement learning.☆10,086Updated this week
- SGLang is a fast serving framework for large language models and vision language models.☆6,127Updated this week
- A framework for few-shot evaluation of language models.☆6,990Updated this week
- Go ahead and axolotl questions☆7,930Updated this week
- A quick guide (especially) for trending instruction finetuning datasets☆2,644Updated 11 months ago
- Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verifi…☆1,634Updated this week
- The hub for EleutherAI's work on interpretability and learning dynamics☆2,282Updated 2 weeks ago
- Reference implementation for DPO (Direct Preference Optimization)☆2,188Updated 3 months ago
- A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)☆4,502Updated 10 months ago
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,045Updated this week
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆2,205Updated this week
- Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09…☆1,948Updated this week
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆7,919Updated 6 months ago
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI☆1,336Updated 7 months ago
- An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.☆1,529Updated last week
- ☆2,746Updated 2 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆1,565Updated 3 months ago
- S-LoRA: Serving Thousands of Concurrent LoRA Adapters☆1,755Updated 10 months ago
- Aligning pretrained language models with instruction data generated by themselves.☆4,164Updated last year
- Large Language Model Text Generation Inference☆9,122Updated this week
- Modeling, training, eval, and inference code for OLMo☆4,645Updated this week
- Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"☆1,624Updated last year
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆3,680Updated this week
- A family of open-sourced Mixture-of-Experts (MoE) Large Language Models☆1,391Updated 8 months ago
- A unified evaluation framework for large language models☆2,465Updated 3 weeks ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆10,059Updated 5 months ago