An index of algorithms for reinforcement learning from human feedback (rlhf))
☆91Apr 17, 2024Updated 2 years ago
Alternatives and similar repositories for awesome-rlhf
Users that are interested in awesome-rlhf are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)☆29Dec 19, 2023Updated 2 years ago
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆35Jan 5, 2023Updated 3 years ago
- Facebear's minimal implementation of SBAC (Soft behavior regularized actor critic, NIPS22 offline RL workshop)☆11Jul 4, 2022Updated 3 years ago
- A recipe for online RLHF and online iterative DPO.☆545Dec 28, 2024Updated last year
- A curated list of reinforcement learning with human feedback resources (continually updated)☆4,378May 20, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- OpenLLMDE: An open source data engineering framework for LLMs☆18Sep 9, 2023Updated 2 years ago
- The official implementation of Self-Exploring Language Models (SELM)☆63Jun 4, 2024Updated 2 years ago
- RewardBench: the first evaluation tool for reward models.☆720Feb 16, 2026Updated 3 months ago
- Cross-domain word representation learning☆10May 23, 2015Updated 11 years ago
- Training and testing scripts for the prediction model used in the "Interaction-Aware Sampling-Based MPC with Learned Local Goal Predictio…☆21Nov 14, 2023Updated 2 years ago
- ☆16Oct 5, 2021Updated 4 years ago
- Collection of papers and resources for data augmentation (DA) in visual reinforcement learning (RL).☆81Mar 27, 2024Updated 2 years ago
- Lipschitz Lifelong RL☆11Nov 6, 2020Updated 5 years ago
- TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"☆19Mar 17, 2022Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆60Jun 13, 2024Updated last year
- Recipes to train reward model for RLHF.☆1,533Apr 24, 2025Updated last year
- ☆10May 17, 2024Updated 2 years ago
- Code and results accompanying our paper titled Mixture Proportion Estimation and PU Learning: A Modern Approach at Neurips 2021 (Spotligh…☆46Mar 12, 2024Updated 2 years ago
- Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback☆1,604Nov 24, 2025Updated 6 months ago
- Code for Contrastive Preference Learning (CPL)☆182Nov 22, 2024Updated last year
- AI Alignment: A Comprehensive Survey☆137Nov 2, 2023Updated 2 years ago
- ☆50Oct 28, 2024Updated last year
- CodeUltraFeedback: aligning large language models to coding preferences (TOSEM 2025)☆74Jun 25, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Experiment for Understanding the Effects of Dataset Characteristics on Offline Reinforcement Learning☆26Jan 16, 2023Updated 3 years ago
- Implementation of the Playground environment from the paper Language as a Cognitive Tool to Imagine Goals inCuriosity-Driven Exploration.☆11Mar 5, 2021Updated 5 years ago
- Seamlessly integrate IoT data with AI agents, enabling the effortless parsing, processing, and utilization of IoT data streams.☆11Jan 27, 2025Updated last year
- A PyTorch implementation for the paper 'Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement Learning from Observatio…☆14Sep 22, 2021Updated 4 years ago
- Supervised Contrastive Learning for Downstream Optimized Sequence Representations☆26Nov 9, 2021Updated 4 years ago
- Self-Supervised Alignment with Mutual Information☆20May 24, 2024Updated 2 years ago
- [ICLR 2024] This is the official implementation for the paper: "Beyond imitation: Leveraging fine-grained quality signals for alignment"☆10May 5, 2024Updated 2 years ago
- ☆25Apr 24, 2019Updated 7 years ago
- NAACL 2019 paper: Density Matching for Bilingual Word Embedding (Zhou et al., 2019)☆63Dec 8, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆11Jun 13, 2024Updated last year
- Official implementation of ICLR'24 paper, "Curiosity-driven Red Teaming for Large Language Models" (https://openreview.net/pdf?id=4KqkizX…☆89Mar 15, 2024Updated 2 years ago
- A curated reading list for large language model (LLM) alignment. Take a look at our new survey "Large Language Model Alignment: A Survey"…☆82Sep 28, 2023Updated 2 years ago
- Natural Language for Optimization Modelling☆72Jun 11, 2025Updated last year
- This is the pytorch implementation of the UAI2023 paper "A Trajectory is Worth Three Sentences: Multimodal Transformer for Offline Reinf…☆11Oct 9, 2023Updated 2 years ago
- The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism☆30Jul 17, 2024Updated last year
- ☆12Jun 4, 2026Updated last week