microsoft / RLHF-APA
RL algorithm: Advantage induced policy alignment
☆62Updated last year
Alternatives and similar repositories for RLHF-APA:
Users that are interested in RLHF-APA are comparing it to the libraries listed below
- ☆93Updated 6 months ago
- This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity☆39Updated 11 months ago
- Code for ACL2024 paper - Adversarial Preference Optimization (APO).☆49Updated 7 months ago
- ☆75Updated 6 months ago
- Directional Preference Alignment☆54Updated 3 months ago
- Reproduction of "RLCD Reinforcement Learning from Contrast Distillation for Language Model Alignment☆64Updated last year
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆41Updated 5 months ago
- ICML 2024 - Official Repository for EXO: Towards Efficient Exact Optimization of Language Model Alignment☆49Updated 7 months ago
- A repository for transformer critique learning and generation☆88Updated last year
- [EMNLP Findings 2024 & ACL 2024 NLRSE Oral] Enhancing Mathematical Reasoning in Language Models with Fine-grained Rewards☆49Updated 8 months ago
- ☆30Updated 2 months ago
- Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"☆111Updated 2 months ago
- Rewarded soups official implementation☆54Updated last year
- Code accompanying the paper Pretraining Language Models with Human Preferences☆180Updated 11 months ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆111Updated 4 months ago
- ☆125Updated last month
- We view Large Language Models as stochastic language layers in a network, where the learnable parameters are the natural language prompts…☆93Updated 5 months ago
- Code and Configs for Asynchronous RLHF: Faster and More Efficient RL for Language Models☆22Updated last month
- Research Code for "ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL"☆124Updated 9 months ago
- ☆26Updated 6 months ago
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆64Updated 5 months ago
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated 11 months ago
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated 4 months ago
- Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging☆98Updated last year
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆55Updated 6 months ago
- Building modular LMs with parameter-efficient fine-tuning.☆93Updated this week
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆51Updated 9 months ago
- This is an official implementation of the paper ``Building Math Agents with Multi-Turn Iterative Preference Learning'' with multi-turn DP…☆20Updated last month
- ☆81Updated this week
- ☆34Updated 11 months ago