huggingface / alignment-handbook
Robust recipes to align language models with human and AI preferences
☆4,933Updated 2 months ago
Alternatives and similar repositories for alignment-handbook:
Users that are interested in alignment-handbook are comparing it to the libraries listed below
- A framework for few-shot evaluation of language models.☆7,576Updated this week
- Tools for merging pretrained large language models.☆5,157Updated this week
- Go ahead and axolotl questions☆8,395Updated this week
- PyTorch native post-training library☆4,765Updated this week
- Train transformer language models with reinforcement learning.☆10,781Updated this week
- [ICLR 2024] Efficient Streaming Language Models with Attention Sinks☆6,770Updated 6 months ago
- Accessible large language models via k-bit quantization for PyTorch.☆6,557Updated this week
- SGLang is a fast serving framework for large language models and vision language models.☆7,967Updated this week
- A quick guide (especially) for trending instruction finetuning datasets☆2,798Updated last year
- General technology for enabling AI capabilities w/ LLMs and MLLMs☆3,806Updated 2 weeks ago
- Aligning pretrained language models with instruction data generated by themselves.☆4,247Updated last year
- [EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which ach…☆4,828Updated this week
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.☆2,167Updated this week
- Large Language Model Text Generation Inference☆9,646Updated this week
- ☆2,341Updated this week
- Reference implementation for DPO (Direct Preference Optimization)☆2,340Updated 5 months ago
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.☆11,290Updated this week
- An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.☆4,650Updated last week
- The RedPajama-Data repository contains code for preparing large datasets for training large language models.☆4,624Updated last month
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆8,126Updated 8 months ago
- An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.☆1,623Updated last month
- Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.☆5,767Updated last month
- An Open-source Toolkit for LLM Development☆2,747Updated 2 weeks ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆10,185Updated 7 months ago
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…☆6,022Updated 4 months ago
- Modeling, training, eval, and inference code for OLMo☆5,059Updated this week
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI☆1,359Updated 9 months ago
- A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)☆2,341Updated 2 months ago
- Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs☆2,321Updated this week
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆1,668Updated 5 months ago