An index of algorithms for reinforcement learning from human feedback (rlhf))
☆92Apr 17, 2024Updated last year
Alternatives and similar repositories for awesome-rlhf
Users that are interested in awesome-rlhf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)☆29Dec 19, 2023Updated 2 years ago
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆35Jan 5, 2023Updated 3 years ago
- Facebear's minimal implementation of SBAC (Soft behavior regularized actor critic, NIPS22 offline RL workshop)☆11Jul 4, 2022Updated 3 years ago
- Code to reproduce the experiments in The Mirage of Action-Dependent Baselines in Reinforcement Learning.☆17Aug 2, 2018Updated 7 years ago
- A recipe for online RLHF and online iterative DPO.☆543Dec 28, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- A curated list of reinforcement learning with human feedback resources (continually updated)☆4,340Dec 9, 2025Updated 4 months ago
- OpenLLMDE: An open source data engineering framework for LLMs☆18Sep 9, 2023Updated 2 years ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Feb 22, 2024Updated 2 years ago
- RewardBench: the first evaluation tool for reward models.☆707Feb 16, 2026Updated last month
- Repo of "Large Language Model-based Human-Agent Collaboration for Complex Task Solving(EMNLP2024 Findings)"☆34Sep 20, 2024Updated last year
- Cross-domain word representation learning☆10May 23, 2015Updated 10 years ago
- ☆27Mar 13, 2024Updated 2 years ago
- Official repo for NeurIPS'24 paper "WAGLE: Strategic Weight Attribution for Effective and Modular Unlearning in Large Language Models"☆19Dec 16, 2024Updated last year
- Deep Weighted Averaging Classifiers☆22Feb 4, 2019Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Collection of papers and resources for data augmentation (DA) in visual reinforcement learning (RL).☆81Mar 27, 2024Updated 2 years ago
- Directional Preference Alignment☆61Sep 23, 2024Updated last year
- Lipschitz Lifelong RL☆11Nov 6, 2020Updated 5 years ago
- ☆39Jul 25, 2024Updated last year
- The server portion of the Neural Chat project to deploy chatbots on web. This code is accompanied by another repository that includes the…☆37Jun 10, 2021Updated 4 years ago
- TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"☆19Mar 17, 2022Updated 4 years ago
- ☆60Jun 13, 2024Updated last year
- Recipes to train reward model for RLHF.☆1,527Apr 24, 2025Updated 11 months ago
- Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback☆1,595Nov 24, 2025Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [ICML 2023] Official code for "DevFormer: A Symmetric Transformer for Context-Aware Device Placement"☆21Dec 7, 2024Updated last year
- Code for Contrastive Preference Learning (CPL)☆180Nov 22, 2024Updated last year
- ☆25Dec 8, 2020Updated 5 years ago
- ☆51Oct 28, 2024Updated last year
- This repo support auto line plot for multi-seed event file from TensorBoard☆12Jun 23, 2022Updated 3 years ago
- Learning algorithm implementation and experiments in the paper "A Composable Specification Language for Reinforcement Learning Tasks" (ht…☆18Nov 23, 2020Updated 5 years ago
- CodeUltraFeedback: aligning large language models to coding preferences (TOSEM 2025)☆73Jun 25, 2024Updated last year
- Experiment for Understanding the Effects of Dataset Characteristics on Offline Reinforcement Learning☆26Jan 16, 2023Updated 3 years ago
- Implementation of the Playground environment from the paper Language as a Cognitive Tool to Imagine Goals inCuriosity-Driven Exploration.☆11Mar 5, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback☆74Aug 31, 2024Updated last year
- Supervised Contrastive Learning for Downstream Optimized Sequence Representations☆26Nov 9, 2021Updated 4 years ago
- Self-Supervised Alignment with Mutual Information☆20May 24, 2024Updated last year
- Official repo for EMNLP'24 paper "SOUL: Unlocking the Power of Second-Order Optimization for LLM Unlearning"☆30Oct 1, 2024Updated last year
- [ICLR 2024] This is the official implementation for the paper: "Beyond imitation: Leveraging fine-grained quality signals for alignment"☆10May 5, 2024Updated last year
- This repository is the official implementation of Bidirectional Learning for Offline Infinite-width Model-based Optimization (NeurIPS 202…☆14Jan 19, 2023Updated 3 years ago
- Official implementation of ICLR'24 paper, "Curiosity-driven Red Teaming for Large Language Models" (https://openreview.net/pdf?id=4KqkizX…☆88Mar 15, 2024Updated 2 years ago