thomfoster / minRLHFLinks

A (somewhat) minimal library for finetuning language models with PPO on human feedback.

☆85

Alternatives and similar repositories for minRLHF

Users that are interested in minRLHF are comparing it to the libraries listed below

Sorting:

Dahoas / reward-modeling
☆96Updated 2 years ago
vwxyzjn / summarize_from_feedback_details
☆147Updated 8 months ago
vwxyzjn / lm-human-preference-details
RLHF implementation details of OAI's 2019 codebase
☆187Updated last year
tomekkorbak / pretraining-with-human-feedback
Code accompanying the paper Pretraining Language Models with Human Preferences
☆182Updated last year
haoliuhl / chain-of-hindsight
Simple next-token-prediction for RLHF
☆227Updated last year
tianjunz / HIR
☆159Updated 2 years ago
Sea-Snell / Implicit-Language-Q-Learning
Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"
☆208Updated 2 years ago
CarperAI / autocrit
A repository for transformer critique learning and generation
☆90Updated last year
xrsrke / instructGOOSE
Implementation of Reinforcement Learning from Human Feedback (RLHF)
☆171Updated 2 years ago
IBM / SALMON
Self-Alignment with Principle-Following Reward Models
☆162Updated 2 months ago
architsharma97 / dpo-rlaif
☆99Updated last year
facebookresearch / Shepherd
This is the repo for the paper Shepherd -- A Critic for Language Model Generation
☆219Updated last year
huggingface / datablations
Scaling Data-Constrained Language Models
☆338Updated last month
Linear95 / APO
Code for ACL2024 paper - Adversarial Preference Optimization (APO).
☆56Updated last year
p-lambda / dsir
DSIR large-scale data selection framework for language model training
☆257Updated last year
kyegomez / Lets-Verify-Step-by-Step
"Improving Mathematical Reasoning with Process Supervision" by OPENAI
☆112Updated last week
ethanyanjiali / minChatGPT
A minimum example of aligning language models with RLHF similar to ChatGPT
☆220Updated last year
microsoft / RLHF-APA
RL algorithm: Advantage induced policy alignment
☆65Updated last year
princeton-nlp / TransformerPrograms
[NeurIPS 2023] Learning Transformer Programs
☆162Updated last year
booydar / LM-RMT
Recurrent Memory Transformer
☆150Updated last year
mnoukhov / async_rlhf
Code and Configs for Asynchronous RLHF: Faster and More Efficient RL for Language Models
☆59Updated 3 months ago
anthropics / ConstitutionalHarmlessnessPaper
☆239Updated 2 years ago
andy-yangz / Awesome-RLHF
Awesome Reinforcement Learning from Human Feedback, the secret behind ChatGPT XD
☆23Updated 2 years ago
yegcjs / mixinglaws
☆103Updated 2 weeks ago
LLM360 / amber-train
Pre-training code for Amber 7B LLM
☆167Updated last year
Ber666 / RAP
Reasoning with Language Model is Planning with World Model
☆168Updated last year
OpenBMB / UltraFeedback
A large-scale, fine-grained, diverse preference dataset (and models).
☆345Updated last year
abdulhaim / LMRL-Gym
☆99Updated last year
SALT-NLP / demonstrated-feedback
☆124Updated 10 months ago
kyegomez / phi-1
Plug in and play implementation of " Textbooks Are All You Need", ready for training, inference, and dataset generation
☆76Updated last year