huggingface / trlLinks

Train transformer language models with reinforcement learning.

☆14,736

Alternatives and similar repositories for trl

Users that are interested in trl are comparing it to the libraries listed below

Sorting:

huggingface / peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
☆19,116Updated this week
EleutherAI / lm-evaluation-harness
A framework for few-shot evaluation of language models.
☆9,648Updated this week
huggingface / accelerate
🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (i…
☆8,971Updated last week
OpenRLHF / OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Asy…
☆7,487Updated this week
Dao-AILab / flash-attention
Fast and memory-efficient exact attention
☆18,551Updated this week
volcengine / verl
verl: Volcano Engine Reinforcement Learning for LLMs
☆11,391Updated this week
NVIDIA / Megatron-LM
Ongoing research training transformer models at scale
☆13,010Updated this week
bitsandbytes-foundation / bitsandbytes
Accessible large language models via k-bit quantization for PyTorch.
☆7,400Updated last week
CarperAI / trlx
A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)
☆4,683Updated last year
artidoro / qlora
QLoRA: Efficient Finetuning of Quantized LLMs
☆10,583Updated last year
microsoft / LoRA
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
☆12,443Updated 7 months ago
pytorch / torchtune
PyTorch native post-training library
☆5,361Updated last week
sgl-project / sglang
SGLang is a fast serving framework for large language models and vision language models.
☆16,386Updated this week
arcee-ai / mergekit
Tools for merging pretrained large language models.
☆6,092Updated last week
vllm-project / vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
☆53,220Updated this week
microsoft / LMOps
General technology for enabling AI capabilities w/ LLMs and MLLMs
☆4,073Updated last month
Lightning-AI / lit-llama
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…
☆6,079Updated 3 weeks ago
eric-mitchell / direct-preference-optimization
Reference implementation for DPO (Direct Preference Optimization)
☆2,664Updated 11 months ago
facebookresearch / xformers
Hackable and optimized Transformers building blocks, supporting a composable construction.
☆9,772Updated this week
yizhongw / self-instruct
Aligning pretrained language models with instruction data generated by themselves.
☆4,437Updated 2 years ago
huggingface / text-generation-inference
Large Language Model Text Generation Inference
☆10,367Updated this week
huggingface / alignment-handbook
Robust recipes to align language models with human and AI preferences
☆5,280Updated this week
Lightning-AI / litgpt
20+ high-performance LLMs with recipes to pretrain, finetune and deploy at scale.
☆12,547Updated this week
microsoft / unilm
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
☆21,572Updated 3 weeks ago
FlagOpen / FlagEmbedding
Retrieval and Retrieval-augmented LLMs
☆10,225Updated 2 weeks ago
haotian-liu / LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
☆23,134Updated 11 months ago
NVIDIA / FasterTransformer
Transformer related optimization, including BERT, GPT
☆6,255Updated last year
axolotl-ai-cloud / axolotl
Go ahead and axolotl questions
☆10,038Updated this week
OpenGVLab / LLaMA-Adapter
[ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters
☆5,889Updated last year
allenai / open-instruct
AllenAI's post-training codebase
☆3,077Updated this week