tomekkorbak / pretraining-with-human-feedbackLinks

Code accompanying the paper Pretraining Language Models with Human Preferences

☆182

Alternatives and similar repositories for pretraining-with-human-feedback

Users that are interested in pretraining-with-human-feedback are comparing it to the libraries listed below

Sorting:

tianjunz / HIR
☆159Updated 2 years ago
CarperAI / autocrit
A repository for transformer critique learning and generation
☆90Updated last year
haoliuhl / chain-of-hindsight
Simple next-token-prediction for RLHF
☆227Updated last year
Sea-Snell / Implicit-Language-Q-Learning
Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"
☆208Updated 2 years ago
IBM / SALMON
Self-Alignment with Principle-Following Reward Models
☆162Updated 2 months ago
anthropics / ConstitutionalHarmlessnessPaper
☆239Updated 2 years ago
Dahoas / reward-modeling
☆96Updated 2 years ago
seonghyeonye / TAPP
[AAAI 2024] Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following
☆79Updated 10 months ago
thomfoster / minRLHF
A (somewhat) minimal library for finetuning language models with PPO on human feedback.
☆85Updated 2 years ago
princeton-nlp / TransformerPrograms
[NeurIPS 2023] Learning Transformer Programs
☆162Updated last year
orhonovich / unnatural-instructions
☆180Updated 2 years ago
meg-tong / sycophancy-eval
datasets from the paper "Towards Understanding Sycophancy in Language Models"
☆85Updated last year
architsharma97 / dpo-rlaif
☆99Updated last year
veronica320 / Faithful-COT
Code and data accompanying our paper on arXiv "Faithful Chain-of-Thought Reasoning".
☆162Updated last year
facebookresearch / Shepherd
This is the repo for the paper Shepherd -- A Critic for Language Model Generation
☆219Updated last year
yizhongw / Tk-Instruct
Tk-Instruct is a Transformer model that is tuned to solve many NLP tasks by following instructions.
☆181Updated 2 years ago
microsoft / RLHF-APA
RL algorithm: Advantage induced policy alignment
☆65Updated last year
jlin816 / dialop
DialOp: Decision-oriented dialogue environments for collaborative language agents
☆109Updated 8 months ago
CarperAI / InstructGPT
For experiments involving instruct gpt. Currently used for documenting open research questions.
☆71Updated 2 years ago
ruiqi-zhong / D5
The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions
☆70Updated 2 years ago
neelsjain / BYOD
The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"
☆107Updated last year
FranxYao / GPT-Bargaining
Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback
☆208Updated 2 years ago
seonghyeonye / Flipped-Learning
[ICLR 2023] Guess the Instruction! Flipped Learning Makes Language Models Stronger Zero-Shot Learners
☆116Updated last month
jayelm / gisting
Learning to Compress Prompts with Gist Tokens - https://arxiv.org/abs/2304.08467
☆289Updated 5 months ago
joeljang / RLPHF
Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging
☆108Updated last year
debjitpaul / refiner
About The corresponding code from our paper " REFINER: Reasoning Feedback on Intermediate Representations" (EACL 2024). Do not hesitate t…
☆70Updated last year
vwxyzjn / summarize_from_feedback_details
☆147Updated 8 months ago
evandez / REMEDI
Inspecting and Editing Knowledge Representations in Language Models
☆116Updated 2 years ago
hamishivi / EasyLM
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…
☆75Updated 11 months ago
allenai / Lila
A unified benchmark for math reasoning
☆88Updated 2 years ago