huggingface / trlLinks
Train transformer language models with reinforcement learning.
β14,435Updated this week
Alternatives and similar repositories for trl
Users that are interested in trl are comparing it to the libraries listed below
Sorting:
- π€ PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.β18,976Updated this week
- Fast and memory-efficient exact attentionβ18,150Updated this week
- Accessible large language models via k-bit quantization for PyTorch.β7,192Updated this week
- A framework for few-shot evaluation of language models.β9,464Updated this week
- π A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (iβ¦β8,896Updated this week
- Tools for merging pretrained large language models.β5,965Updated 2 weeks ago
- QLoRA: Efficient Finetuning of Quantized LLMsβ10,532Updated last year
- An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Asyβ¦β7,287Updated this week
- A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)β4,675Updated last year
- verl: Volcano Engine Reinforcement Learning for LLMsβ10,431Updated this week
- PyTorch native post-training libraryβ5,306Updated this week
- Go ahead and axolotl questionsβ9,810Updated this week
- Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"β12,194Updated 6 months ago
- General technology for enabling AI capabilities w/ LLMs and MLLMsβ4,043Updated last week
- Hackable and optimized Transformers building blocks, supporting a composable construction.β9,670Updated last week
- Ongoing research training transformer models at scaleβ12,751Updated this week
- 20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.β12,445Updated this week
- Large Language Model Text Generation Inferenceβ10,290Updated this week
- SGLang is a fast serving framework for large language models and vision language models.β15,747Updated this week
- A curated list of reinforcement learning with human feedback resources (continually updated)β4,030Updated this week
- LMDeploy is a toolkit for compressing, deploying, and serving LLMs.β6,672Updated this week
- A modular RL library to fine-tune language models to human preferencesβ2,325Updated last year
- An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.β4,890Updated 2 months ago
- Robust recipes to align language models with human and AI preferencesβ5,250Updated 2 months ago
- Aligning pretrained language models with instruction data generated by themselves.β4,408Updated 2 years ago
- Reference implementation for DPO (Direct Preference Optimization)β2,630Updated 10 months ago
- [ICLR 2024] Efficient Streaming Language Models with Attention Sinksβ6,921Updated 11 months ago
- Retrieval and Retrieval-augmented LLMsβ10,091Updated last month
- β4,084Updated last year
- Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We alsβ¦β17,574Updated this week